mosesdecoder/moses/ScoreProducer.h

// $Id$

#ifndef moses_ScoreProducer_h
#define moses_ScoreProducer_h

#include <set>
#include <string>
#include <vector>

#include "FeatureVector.h"

namespace Moses
{

 /*
 * @note do not confuse this with a producer/consumer pattern.
 * this is not a producer in that sense.
 */
class ScoreProducer
{
private:
  std::string m_description;
  bool m_reportSparseFeatures;
  size_t m_numScoreComponents;
  //In case there's multiple producers with the same description
  static std::multiset<std::string> description_counts;
	ScoreProducer(const ScoreProducer&);  // don't implement
	
protected:
	ScoreProducer(const std::string& description, size_t numScoreComponents);
	virtual ~ScoreProducer();

public:

  static const size_t unlimited;

  static void ResetDescriptionCounts() {
    description_counts.clear();
  }

	//! returns the number of scores that a subclass produces.
	//! For example, a language model conventionally produces 1, a translation table some arbitrary number, etc
  //! sparse features returned unlimited
	size_t GetNumScoreComponents() const {return m_numScoreComponents;}

	//! returns a string description of this producer
	const std::string& GetScoreProducerDescription() const {return m_description;}

  //! returns the weight parameter name of this producer (used in n-best list)
  virtual std::string GetScoreProducerWeightShortName(unsigned idx=0) const = 0;

  //! returns the number of scores gathered from the input (0 by default)
  virtual size_t GetNumInputScores() const {
    return 0;
  };

	virtual bool IsStateless() const = 0;

  void SetSparseFeatureReporting() { m_reportSparseFeatures = true; }
  bool GetSparseFeatureReporting() const { return m_reportSparseFeatures; } 

  virtual float GetSparseProducerWeight() const { return 1; }
};


}

#endif
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`// $Id$`

Use portable include guard instead of pragma once git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2939 1f5c12ca-751b-0410-a591-d2e778427230 2010-02-24 14:15:44 +03:00			`#ifndef moses_ScoreProducer_h`
			`#define moses_ScoreProducer_h`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00
Fixed so that multiple phrase tables works. Passes regression! git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3614 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-11 18:09:39 +04:00			`#include <set>`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`#include <string>`
Goodbye ScoreIndexManager. Compiles ok, but haven't dared to run regression yet. git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3608 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-07 02:06:49 +04:00			`#include <vector>`

			`#include "FeatureVector.h"`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00
create namespace git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1897 1f5c12ca-751b-0410-a591-d2e778427230 2008-10-09 03:51:26 +04:00			`namespace Moses`
			`{`

Goodbye ScoreIndexManager. Compiles ok, but haven't dared to run regression yet. git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3608 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-07 02:06:49 +04:00			`/*`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`* @note do not confuse this with a producer/consumer pattern.`
			`* this is not a producer in that sense.`
			`*/`
			`class ScoreProducer`
			`{`
			`private:`
Fixed so that multiple phrase tables works. Passes regression! git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3614 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-11 18:09:39 +04:00			`std::string m_description;`
added reporting of sparse features in n-best list git-svn-id: http://svn.statmt.org/repository/mira@3926 cc96ff50-19ce-11e0-b349-13d7f0bd23df 2011-08-07 04:58:56 +04:00			`bool m_reportSparseFeatures;`
set num score components in ScoreProducer ctor 2011-11-09 01:22:34 +04:00			`size_t m_numScoreComponents;`
Fixed so that multiple phrase tables works. Passes regression! git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3614 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-11 18:09:39 +04:00			`//In case there's multiple producers with the same description`
			`static std::multiset<std::string> description_counts;`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`ScoreProducer(const ScoreProducer&); // don't implement`

			`protected:`
set num score components in ScoreProducer ctor 2011-11-09 01:22:34 +04:00			`ScoreProducer(const std::string& description, size_t numScoreComponents);`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`virtual ~ScoreProducer();`

			`public:`
Implementation and testing of target bigram feature git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3624 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-15 01:52:35 +04:00
			`static const size_t unlimited;`

remove caching of wp weight and translation weights, clean up mira code 2012-04-29 08:37:48 +04:00			`static void ResetDescriptionCounts() {`
			`description_counts.clear();`
			`}`

move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`//! returns the number of scores that a subclass produces.`
			`//! For example, a language model conventionally produces 1, a translation table some arbitrary number, etc`
Implementation and testing of target bigram feature git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3624 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-15 01:52:35 +04:00			`//! sparse features returned unlimited`
set num score components in ScoreProducer ctor 2011-11-09 01:22:34 +04:00			`size_t GetNumScoreComponents() const {return m_numScoreComponents;}`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00
			`//! returns a string description of this producer`
Fixed so that multiple phrase tables works. Passes regression! git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/mira-mtm5@3614 1f5c12ca-751b-0410-a591-d2e778427230 2010-10-11 18:09:39 +04:00			`const std::string& GetScoreProducerDescription() const {return m_description;}`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00
run beautify.perl. Consistent formatting for .h & .cpp files git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3901 1f5c12ca-751b-0410-a591-d2e778427230 2011-02-24 16:14:42 +03:00			`//! returns the weight parameter name of this producer (used in n-best list)`
change to print the corrett name of the features with InputScores git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4168 1f5c12ca-751b-0410-a591-d2e778427230 2011-08-30 16:25:50 +04:00			`virtual std::string GetScoreProducerWeightShortName(unsigned idx=0) const = 0;`
generalized n-best list reporting for feature functions, added experimental version of global lexical model git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2343 1f5c12ca-751b-0410-a591-d2e778427230 2009-05-26 23:30:35 +04:00
run beautify.perl. Consistent formatting for .h & .cpp files git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3901 1f5c12ca-751b-0410-a591-d2e778427230 2011-02-24 16:14:42 +03:00			`//! returns the number of scores gathered from the input (0 by default)`
			`virtual size_t GetNumInputScores() const {`
			`return 0;`
			`};`
Feature function overhaul. Each feature function is computed in one of three ways: 1) Stateless feature functions from the phrase table/generation table: these are computed when the TranslationOption is created. They become part of the ScoreBreakdown object contained in the TranslationOption and are added to the feature value vector when a hypothesis is extended. 2) Stateless feature functions that are computed during state exploration. Currently, only WordPenalty falls into this category, but these functions implement a method Evaluate which do does not receive a Hypothesis or any contextual information. 3) Stateful feature functions: these features receive the arc information (translation option), compute some value and then return some context information. The context information created by a particular feature function is passed back to it as the previous context when a hypothesis originating at the node where the previous edge terminates is created. States in the search space may be recombined if the context information is identical. The context information must be stored in an object implementing the FFState interface. TODO: 1) the command line interface / MERT interface needs to go to named parameters that are otherwise opaque 2) StatefulFeatureFunction's Evaluate method should just take a TranslationOption and a context object. It is not good that it takes a hypothesis, because then people may be tempted to access information about the "previous" hypothesis without "declaring" this dependency. 3) Future cost estimates should be handled using feature functions. All stateful feature functions need some kind of future cost estimate. 4) Philipp's poor-man's cube pruning is broken. git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2087 1f5c12ca-751b-0410-a591-d2e778427230 2009-02-06 18:43:06 +03:00
			`virtual bool IsStateless() const = 0;`

added reporting of sparse features in n-best list git-svn-id: http://svn.statmt.org/repository/mira@3926 cc96ff50-19ce-11e0-b349-13d7f0bd23df 2011-08-07 04:58:56 +04:00			`void SetSparseFeatureReporting() { m_reportSparseFeatures = true; }`
			`bool GetSparseFeatureReporting() const { return m_reportSparseFeatures; }`
-show-weights: output sparseProducerWeight if != 1, otherwise 'sparse' 2011-12-13 23:13:13 +04:00
			`virtual float GetSparseProducerWeight() const { return 1; }`
move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`};`

create namespace git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1897 1f5c12ca-751b-0410-a591-d2e778427230 2008-10-09 03:51:26 +04:00
			`}`

move cube pruning moses lib to trunk git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1848 1f5c12ca-751b-0410-a591-d2e778427230 2008-06-11 14:52:57 +04:00			`#endif`