Hieu Hoang
4f6f127486
Merge pull request #53 from pengli09/master
...
Fix the bug in phrase-extract/extract-main.cpp: the authors forgot to change three variable names
2013-11-20 03:04:41 -08:00
Peng Li
f53825c71e
Fix the bug in phrase-extract/extract-main.cpp: the authors forgot to change inBottomRight/outBottomRight to inBottomLeft/outBottomLeft in the second loops in getOrientPhraseModel() and getOrientHierModel()
2013-11-20 16:22:15 +08:00
Hieu Hoang
ccf9662748
Merge branch 'master' of ../mosesdecoder
2013-11-15 14:03:05 +00:00
Phil Williams
6bee77e207
extract-ghkm: use square brackets for glue rule internal tree structure
2013-11-12 15:49:49 +00:00
Hieu Hoang
477314cda4
Merge branch 'master' of github.com:hieuhoang/mosesdecoder
2013-11-12 12:26:35 +00:00
Hieu Hoang
24f95297fc
compiles with clang
2013-10-31 12:46:41 +00:00
Hieu Hoang
125e9a8569
add debug argument
2013-10-05 10:48:01 +01:00
Hieu Hoang
902741681a
reverse 7d3de78500
2013-10-04 21:27:53 +01:00
Hieu Hoang
7d3de78500
minor error with placeholder
2013-10-04 19:29:16 +01:00
Phil Williams
d6aa123d03
score: write sparse features to third field.
2013-09-29 18:58:20 +01:00
Phil Williams
2a28d1a73e
Merge branch 'master' into GHKMStruct
...
Conflicts:
moses-chart-cmd/IOWrapper.cpp
moses-chart-cmd/IOWrapper.h
moses/FF/Factory.cpp
moses/Parameter.cpp
moses/StaticData.h
phrase-extract/extract-ghkm/ScfgRuleWriter.cpp
phrase-extract/score-main.cpp
2013-09-29 15:27:09 +01:00
Phil Williams
20b96fd0a7
Oops, fix e497dc485...
2013-09-29 15:23:37 +01:00
Phil Williams
e497dc4857
Remove NT length code missed in commit cdd9df19...
2013-09-29 15:09:14 +01:00
Hieu Hoang
31ce9b510e
beautify
2013-09-27 09:35:24 +01:00
Phil Williams
940591a1a3
extract-ghkm: allow trailing whitespace in alignment file
...
Thanks to Matt Post for reporting the problem.
2013-09-26 15:49:08 +01:00
Phil Williams
29c1089283
consolidate: don't assume input contains key-value field
2013-09-24 09:45:49 +01:00
Phil Williams
74ed066569
consolidate: expect key-value pairs in 7th field, not 6th
2013-09-20 15:50:03 +01:00
Phil Williams
23488e1adb
extract-ghkm: use square brackets for --TreeFragments
...
Use square brackets instead of round brackets for internal tree
structure. This avoids the need for additional escaping since
square brackets are already escaped in Moses.
Also: tweak code style to match the rest of the source file, and
output less whitespace to make the extract files (marginally)
smaller.
2013-09-20 14:57:40 +01:00
Phil Williams
ab863d1f16
consolidate: write key-value field to rule table
2013-09-20 09:42:13 +01:00
Hieu Hoang
98bb4fa1c7
placeholders work in extract
2013-09-19 12:24:57 +02:00
Hieu Hoang
a40d9082cd
more placeholder code and 'NO BEST TRANSLATION' to stderr for pb
2013-09-18 23:47:50 +02:00
Matthias Huck
a6d172e0f1
command line option for extract-ghkm: --TreeFragments
2013-09-16 20:06:02 +01:00
maria nadejde
7cc284a743
comment
2013-09-14 10:50:33 +02:00
maria nadejde
df86f0e78b
Merge branch 'GHKMStruct' of github.com:moses-smt/mosesdecoder into GHKMStruct
2013-09-14 10:46:17 +02:00
maria nadejde
5f37a545b1
fixed sparse feature output
2013-09-14 10:44:35 +02:00
Phil Williams
296eb6804a
Merge master
2013-09-13 22:32:45 +01:00
Phil Williams
cdd9df19d2
Remove --OutputNTLengths from extract-rules, etc.
...
The option isn't used in master and the output is compatible with the
current rule table format. If anyone wants this in master it should
probably be fixed in the span-length branch then merged.
2013-09-13 22:16:42 +01:00
maria nadejde
bf5c32df6c
stuff that probably doesn't work
2013-09-13 19:43:04 +02:00
Matthias Huck
643fa18805
Merge branch 'GHKMStruct' of github.com:moses-smt/mosesdecoder into GHKMStruct
2013-09-13 17:13:20 +02:00
Matthias Huck
c39bed60c0
Tree fragments in GHKM glue rules;
...
output of LHS tag in tree fragments for UNKs;
GHKMParse info is now denoted as Tree info
2013-09-13 17:10:21 +02:00
maria nadejde
fad57a60a7
comment for Equal implementation
2013-09-13 16:13:36 +02:00
maria nadejde
5615a11766
sparse feature weight file
2013-09-13 16:06:48 +02:00
maria nadejde
bff123635e
added Dense and Sparse feature to scorer
2013-09-13 12:45:46 +02:00
maria nadejde
43a9323d0f
add feature files
2013-09-12 18:46:40 +02:00
maria nadejde
67b873b67d
mock feature
2013-09-12 18:40:08 +02:00
Matthias Huck
96d14555fc
GHKM tree output during extraction: modified extract-ghkm and score tools
2013-09-11 16:46:37 +02:00
Matthias Huck
004c44faf1
prototype GHKM tree output from extract-ghkm (still flawed)
2013-09-10 15:41:26 +02:00
Rico Sennrich
b421f7c9b0
refactoring to minimize overhead from flexibility score code (if off)
2013-09-07 23:04:40 +02:00
Rico Sennrich
7138056b8f
flexibility scores
2013-09-07 23:04:01 +02:00
Hieu Hoang
77872f7521
beautify
2013-07-30 15:04:37 +01:00
Hieu Hoang
9cdcf713a6
phrase penalty now has it's own ff. No longer in the phrase table
2013-07-29 12:55:44 +01:00
Hieu Hoang
9e8402dedd
add placeholder support to extract
2013-07-26 15:46:15 +01:00
Hieu Hoang
e3917f911b
add placeholder support to extract
2013-07-26 15:44:29 +01:00
Hieu Hoang
2ba7a372e8
add placeholder support to extract
2013-07-26 14:12:27 +01:00
Hieu Hoang
4fde5f7ea2
eclipse file for extract-rules
2013-07-26 12:27:55 +01:00
Phil Williams
f0b603e6b5
extract-ghkm: write glue grammars for all sentence offsets
...
extract-parallel now merges separate glue grammars, so remove
previous workaround.
2013-07-25 13:53:32 +01:00
Phil Williams
b5584fdecf
extract-ghkm: workaround for extract-parallel issue
...
Don't write glue grammar or unknown word label files unless the sentence
offset is 0. This prevents multiple instances of extract-ghkm writing
to the same two files when extract-parallel is used.
TODO Better solutions might be:
1. modify extract-parallel so that it only configures one instance of
extract-ghkm to write the glue / unknown-lhs files (like the current
workaround, this assumes file chunks are representative of the whole)
2. add multithreading support directly to extract-ghkm
3. write distinct output files for each extract-ghkm instance and
combine them on completion
2013-07-23 14:55:16 +01:00
Hieu Hoang
310b26f989
beautify
2013-07-08 20:52:14 +01:00
Hieu Hoang
3eba5782c2
beautify
2013-07-08 20:25:47 +01:00
Hieu Hoang
dc33fa3d3d
redo parsing of feature function parameters
2013-06-20 12:50:41 +01:00