mirror of
https://github.com/moses-smt/mosesdecoder.git
synced 2024-12-27 05:55:02 +03:00
2.3 KiB
2.3 KiB
Vowpal Wabbit for Moses
This is an attempt to integrate Vowpal Wabbit with Moses as a stateless feature function.
Implemented classifier features
- VWFeatureSourceBagOfWords: This creates a feature of form bow^token for every source sentence token.
- VWFeatureSourceExternalFeatures: (not quite finished yet) when used with -inputtype 5 this can be used to supply additional feature to VW. The input is a tab-separated file, the first column is usual input sentence, all other columns can be used for meta-data. By default the second column is read in and features are split on whitespace.
- VWFeatureSourceIndicator: Ass a feature for the whole source phrase.
- VWFeatureSourcePhraseInternal: Adds a separate feature for every word of the source phrase.
- VWFeatureSourceWindow: Adds source words in a window before and after the source phrase as features. These does not overlap with VWFeatureSourcePhraseInternal.
- VWFeatureTargetIndicator: Adds a feature for the whole target phrase.
- VWFeatureTargetPhraseInternal: Adds a separate feature for every word of the target phrase.
Configuration
To use the classifier edit your moses.ini
[features]
...
VW path=/home/username/vw/classifier1.vw
VWFeatureSourceBagOfWords
VWFeatureTargetIndicator
VWFeatureSourceIndicator
...
[weights]
...
VW0= 0.2
...
If you change the name of the main VW feature, remember to tell the VW classifier features which classifier they belong to:
[features]
...
VW name=bart path=/home/username/vw/classifier1.vw
VWFeatureSourceBagOfWords used-by=bart
VWFeatureTargetIndicator used-by=bart
VWFeatureSourceIndicator used-by=bart
...
[weights]
...
bart= 0.2
...
You can also use multiple classifiers:
[features]
...
VW name=bart path=/home/username/vw/classifier1.vw
VW path=/home/username/vw/classifier2.vw
VW path=/home/username/vw/classifier3.vw
VWFeatureSourceBagOfWords used-by=bart,VW0
VWFeatureTargetIndicator used-by=VW1,VW0,bart
VWFeatureSourceIndicator used-by=bart,VW1
...
[weights]
...
bart= 0.2
VW0= 0.2
VW1= 0.2
...