Commit Graph

13484 Commits

Author SHA1 Message Date
Ales Tamchyna
e932e96db7 avoid duplicate generation of source-side features when training VW 2015-02-04 11:07:07 +01:00
Philipp Koehn
f69c1dab02 more efficient default recaser training 2015-02-04 09:18:09 +00:00
Phil Williams
6b9da6c585 filter-rule-table: merge changes from t2s branch (still WIP) 2015-02-03 11:33:10 +00:00
Hieu Hoang
78f79632b9 script to convert moses.ini v2 to v1 /Tom Hoar 2015-02-03 10:59:38 +00:00
Hieu Hoang
6bfdddfe49 eclipse 2015-02-03 10:26:51 +00:00
Kenneth Heafield
081fb5e181 Fix argument parser: ignore lead 2015-02-02 17:34:57 -05:00
Hieu Hoang
464686e242 Merge pull request #94 from aminorex/master
remove spurious endl
2015-02-02 21:01:27 +00:00
akimbal1
362e6a9374 remove spurious endl 2015-02-02 15:57:04 -05:00
Hieu Hoang
80b2128a59 Merge pull request #93 from aminorex/master
alignment for hieu
2015-02-02 20:18:26 +00:00
akimbal1
8ea1c9fd40 alignment for hieu 2015-02-02 12:55:21 -05:00
Phil Williams
ac8f01bc3d Partial merge of t2s branch (implements t2s and f2s algorithms)
Implements search algorithms 7, 8, and 9:

-search-algorithm 7
  tree-to-string (STSG-based, currently a special-case of forest-to-string)

-search-algorithm 8
  tree-to-string (SCFG-based)

-search-algorithm 9
  forest-to-string (STSG-based)
2015-02-02 17:07:54 +00:00
Phil Williams
c8ad84fa55 New input type, ForestInput, for weighted forests.
* Uses Egret format (https://sites.google.com/site/zhangh1982/egret)
 * Used by forest-to-string search algorithm (-search-algorithm 9)
2015-02-02 16:34:17 +00:00
Phil Williams
984936f05f Placeholders for new syntax-based search algorithms 2015-02-02 15:39:00 +00:00
Phil Williams
e177131a22 Remove -s2t option (use -search-algorithm=6 instead) 2015-02-02 15:01:59 +00:00
Kenneth Heafield
e542986ed5 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-02-02 09:26:08 -05:00
Kenneth Heafield
ee881734fe Rewrite KenLM argument parser to consume one = 2015-02-02 09:25:02 -05:00
Hieu Hoang
884a0b1c90 forgot to add Parameters.cpp. Change c++11 to c++0x to support older compilers (on Ubuntu 12.04 etc). 2015-01-30 17:45:20 +00:00
Phil Williams
2975e7a0d3 Minor refactoring in moses/Syntax 2015-01-30 11:56:08 +00:00
Kenneth Heafield
925565a0b9 "just put it in. I'll verify it if i can be bovvered" --Hieu /usr/bin/env 2015-01-29 18:37:05 -05:00
Matthias Huck
449d9b294b Revert "env perl shebang"
This reverts commit 34f2801f8a.

Caused problems because /bin/env doesn't exist on Ubuntu 12.04.
/usr/bin/env does, though.
2015-01-29 21:15:20 +00:00
Matthias Huck
a2efb9afc7 PhraseOrientationFeature: more sparse scores 2015-01-29 20:23:41 +00:00
Ales Tamchyna
c33e9e81da vw verbose 2015-01-29 17:40:22 +01:00
Ales Tamchyna
2b0aa56677 VW verbose outputs 2015-01-29 17:33:35 +01:00
Ales Tamchyna
79f7e7bd16 Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-01-29 16:52:27 +01:00
Ales Tamchyna
bef4795ad9 optionally leave one out in VW training 2015-01-29 16:52:21 +01:00
Matthias Huck
7a7d12b038 PhraseOrientationFeature: can produce some sparse scores 2015-01-28 20:50:24 +00:00
Matthias Huck
91dfd36c8b use FEATUREVERBOSE 2015-01-28 20:49:04 +00:00
Kenneth Heafield
34f2801f8a env perl shebang 2015-01-27 18:35:54 -05:00
Matthias Huck
4cc5c677b3 Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-01-27 18:53:13 +00:00
Hieu Hoang
7691657c35 Merge branch 'RC-3.1' 2015-01-27 18:24:36 +00:00
Hieu Hoang
5244a7b607 shut up 2015-01-27 18:24:15 +00:00
Matthias Huck
afb6ac903b Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-01-27 18:16:14 +00:00
Matthias Huck
9f562e0fd4 SoftSourceSyntacticConstraintsFeature: better config parameter names 2015-01-27 18:15:51 +00:00
Hieu Hoang
646d3bd055 Merge branch 'RC-3.1' 2015-01-27 16:49:01 +00:00
Hieu Hoang
d7f5bb41fa don't define TEST_DYN_LINK. Doesn't work on OSX with manually compiled boost 2015-01-27 16:22:15 +00:00
Matthias Huck
0a0ea437bb use pragma once; avoid using cerr directly 2015-01-26 22:12:44 +00:00
Matthias Huck
eb9d8134a7 PhraseOrientationFeature: Heuristic score for boundary non-terminals is basically a lookahead. Compute a lookahead for everything.
(+ Refined feature state comparison.)
2015-01-26 21:11:37 +00:00
Matthias Huck
e51714ff7a a plain dense unaligned word count feature with two scores (source and target unaligned words) 2015-01-26 21:06:12 +00:00
Matthias Huck
c66d6a9b86 using pragma once and VERBOSE in SourceWordDeletionFeature/TargetWordInsertionFeature 2015-01-26 20:45:08 +00:00
Nicola Bertoldi
fa00c99aa3 fixings to the IRSTLM interface for textual input; code cleanup 2015-01-26 18:24:12 +01:00
Nicola Bertoldi
5d186874f4 minor fixing to the comparison script of the regression tests 2015-01-26 15:52:29 +01:00
Nicola Bertoldi
18eaf62ce3 fixings to the IRSTLM interface for textual input 2015-01-26 15:51:08 +01:00
Hieu Hoang
4202ad473c Merge branch 'master' of github.com:moses-smt/mosesdecoder 2015-01-25 15:02:51 +00:00
Hieu Hoang
1dea58e945 separate parameters into it's own class 2015-01-25 15:02:33 +00:00
Hieu Hoang
5d2b0224d6 Jamfile for tokenizer 2015-01-25 14:00:35 +00:00
XapaJIaMnu
6ca1a4718c Expose learning rate as a parameter 2015-01-25 02:13:47 +00:00
Matthias Huck
55f6bbb14a Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-01-23 18:45:31 +00:00
Matthias Huck
9987beb453 SoftSourceSyntacticConstraintsFeature: Now for both non-terminals (as before) _and_ terminals.
Also added score components based on relative frequency.
(TODO: logprobs right now; are plain probabilities better?)
2015-01-23 18:41:18 +00:00
Kenneth Heafield
98c352ed3a Merge branch 'master' of https://github.com/moses-smt/mosesdecoder 2015-01-23 13:38:35 -05:00
akimbal1
d38dcd89bb add glib-2.0 for better unicodification and faster implementation 2015-01-23 13:35:09 -05:00