mosesdecoder/vw/README.md

Vowpal Wabbit for Moses
=======================

This is an attempt to integrate Vowpal Wabbit with Moses as a stateless feature
function.

Compatible with this frozen version of VW:

    https://github.com/moses-smt/vowpal_wabbit
    
To enable VW, you need to provide a path where VW was installed (using `make install`) to bjam:

    ./bjam --with-vw=<path/to/vw/installation>

Implemented classifier features
-------------------------------

* `VWFeatureSourceBagOfWords`: This creates a feature of form bow^token for every
source sentence token.
* `VWFeatureSourceExternalFeatures column=0`: when used with -inputtype 5 (`TabbedSentence`) this can be used to supply additional feature to VW. The input is a tab-separated file, the first column is the usual input sentence, all other columns can be used for meta-data. Parameter column=0 counts beginning with the first column that is not the input sentence.  
* `VWFeatureSourceIndicator`: Ass a feature for the whole source phrase.
* `VWFeatureSourcePhraseInternal`: Adds a separate feature for every word of the source phrase.
* `VWFeatureSourceWindow size=3`: Adds source words in a window of size 3 before and after the source phrase as features. These do not overlap with `VWFeatureSourcePhraseInternal`.
* `VWFeatureTargetIndicator`: Adds a feature for the whole target phrase.
* `VWFeatureTargetPhraseInternal`: Adds a separate feature for every word of the target phrase.

Configuration
-------------

To use the classifier edit your moses.ini

    [features]
    ...
    VW path=/home/username/vw/classifier1.vw
    VWFeatureSourceBagOfWords
    VWFeatureTargetIndicator
    VWFeatureSourceIndicator
    ...
     
    [weights]
    ...
    VW0= 0.2
    ...

If you change the name of the main VW feature, remember to tell the VW classifier
features which classifier they belong to:

    [features]
    ...
    VW name=bart path=/home/username/vw/classifier1.vw 
    VWFeatureSourceBagOfWords used-by=bart
    VWFeatureTargetIndicator used-by=bart
    VWFeatureSourceIndicator used-by=bart
    ...
    
    [weights]
    ...
    bart= 0.2
    ...

You can also use multiple classifiers:

    [features]
    ...
    VW name=bart path=/home/username/vw/classifier1.vw 
    VW path=/home/username/vw/classifier2.vw
    VW path=/home/username/vw/classifier3.vw
    VWFeatureSourceBagOfWords used-by=bart,VW0 
    VWFeatureTargetIndicator used-by=VW1,VW0,bart
    VWFeatureSourceIndicator used-by=bart,VW1
    ...
    
    [weights]
    ...
    bart= 0.2
    VW0= 0.2
    VW1= 0.2
    ...
    
Training the classifier
-----------------------

To train a classifier, run `vwtrainer` (a limited version of the `moses` binary). Configure your features in the `moses.ini` file (see above) and set the `train` flag:

     [features]
     ... 
     VW name=bart path=/home/username/vw/features.txt train=1
     ...

The `path` variable points to the file (prefix) where features will be written. Currently, threads write to separate files (maybe subject to change sooner or later): `features.txt.1`, `features.txt.2` etc.

`vwtrainer` creates the translation option collection for each input sentence but does not run decoding. Therefore, you probably want to disable expensive feature functions such as the language model (LM score is not used by VW features at the moment).

Currently, classification is implemented using VW's `csoaa_ldf` scheme with quadratic features which take the product of the source namespace (`s`, contains label-independent features) and the target namespace (`t`,  contains label-dependent features).

To train VW in this setting, use the command:

    cat features.txt.* | vw --hash all --noconstant -b 26 -q st --csoaa_ldf mc -f classifier1.vw
Added README.md for VW 2015-01-09 17:01:38 +03:00			`Vowpal Wabbit for Moses`
			`=======================`

			`This is an attempt to integrate Vowpal Wabbit with Moses as a stateless feature`
			`function.`

VW version note 2015-01-09 17:47:00 +03:00			`Compatible with this frozen version of VW:`

			`https://github.com/moses-smt/vowpal_wabbit`
minor 2015-01-09 17:54:29 +03:00
			To enable VW, you need to provide a path where VW was installed (using `make install`) to bjam:

			`./bjam --with-vw=<path/to/vw/installation>`
VW version note 2015-01-09 17:47:00 +03:00
Added README.md for VW 2015-01-09 17:01:38 +03:00			`Implemented classifier features`
			`-------------------------------`

Update README.md 2015-01-09 17:20:22 +03:00			* `VWFeatureSourceBagOfWords`: This creates a feature of form bow^token for every
verbatim text in readme 2015-01-09 17:05:39 +03:00			`source sentence token.`
Update README.md 2015-01-09 17:20:22 +03:00			* `VWFeatureSourceExternalFeatures column=0`: when used with -inputtype 5 (`TabbedSentence`) this can be used to supply additional feature to VW. The input is a tab-separated file, the first column is the usual input sentence, all other columns can be used for meta-data. Parameter column=0 counts beginning with the first column that is not the input sentence.
			* `VWFeatureSourceIndicator`: Ass a feature for the whole source phrase.
			* `VWFeatureSourcePhraseInternal`: Adds a separate feature for every word of the source phrase.
Update README.md 2015-01-09 17:21:41 +03:00			* `VWFeatureSourceWindow size=3`: Adds source words in a window of size 3 before and after the source phrase as features. These do not overlap with `VWFeatureSourcePhraseInternal`.
Update README.md 2015-01-09 17:20:22 +03:00			* `VWFeatureTargetIndicator`: Adds a feature for the whole target phrase.
			* `VWFeatureTargetPhraseInternal`: Adds a separate feature for every word of the target phrase.
Added README.md for VW 2015-01-09 17:01:38 +03:00
			`Configuration`
			`-------------`

			`To use the classifier edit your moses.ini`

verbatim text in readme 2015-01-09 17:04:10 +03:00			`[features]`
			`...`
			`VW path=/home/username/vw/classifier1.vw`
			`VWFeatureSourceBagOfWords`
			`VWFeatureTargetIndicator`
verbatim text in readme 2015-01-09 17:05:39 +03:00			`VWFeatureSourceIndicator`
verbatim text in readme 2015-01-09 17:04:10 +03:00			`...`

			`[weights]`
			`...`
			`VW0= 0.2`
			`...`

Added README.md for VW 2015-01-09 17:01:38 +03:00			`If you change the name of the main VW feature, remember to tell the VW classifier`
			`features which classifier they belong to:`

verbatim text in readme 2015-01-09 17:04:10 +03:00			`[features]`
			`...`
			`VW name=bart path=/home/username/vw/classifier1.vw`
			`VWFeatureSourceBagOfWords used-by=bart`
			`VWFeatureTargetIndicator used-by=bart`
verbatim text in readme 2015-01-09 17:05:39 +03:00			`VWFeatureSourceIndicator used-by=bart`
verbatim text in readme 2015-01-09 17:04:10 +03:00			`...`

			`[weights]`
			`...`
			`bart= 0.2`
			`...`
Added README.md for VW 2015-01-09 17:01:38 +03:00
			`You can also use multiple classifiers:`

verbatim text in readme 2015-01-09 17:04:10 +03:00			`[features]`
			`...`
			`VW name=bart path=/home/username/vw/classifier1.vw`
			`VW path=/home/username/vw/classifier2.vw`
			`VW path=/home/username/vw/classifier3.vw`
			`VWFeatureSourceBagOfWords used-by=bart,VW0`
			`VWFeatureTargetIndicator used-by=VW1,VW0,bart`
			`VWFeatureSourceIndicator used-by=bart,VW1`
			`...`

			`[weights]`
			`...`
			`bart= 0.2`
			`VW0= 0.2`
			`VW1= 0.2`
			`...`
training described 2015-01-09 17:33:52 +03:00
Added README.md for VW 2015-01-09 17:01:38 +03:00			`Training the classifier`
Update README.md 2015-01-09 17:13:59 +03:00			`-----------------------`
training described 2015-01-09 17:33:52 +03:00
			To train a classifier, run `vwtrainer` (a limited version of the `moses` binary). Configure your features in the `moses.ini` file (see above) and set the `train` flag:

			`[features]`
			`...`
			`VW name=bart path=/home/username/vw/features.txt train=1`
			`...`

			The `path` variable points to the file (prefix) where features will be written. Currently, threads write to separate files (maybe subject to change sooner or later): `features.txt.1`, `features.txt.2` etc.

note on LM 2015-01-09 17:37:20 +03:00			`vwtrainer` creates the translation option collection for each input sentence but does not run decoding. Therefore, you probably want to disable expensive feature functions such as the language model (LM score is not used by VW features at the moment).

training described 2015-01-09 17:33:52 +03:00			Currently, classification is implemented using VW's `csoaa_ldf` scheme with quadratic features which take the product of the source namespace (`s`, contains label-independent features) and the target namespace (`t`, contains label-dependent features).

			`To train VW in this setting, use the command:`

			`cat features.txt.* \| vw --hash all --noconstant -b 26 -q st --csoaa_ldf mc -f classifier1.vw`