bojar
55e3ee4a30
just setting the executable bit
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2795 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 19:49:37 +00:00
bojar
2097e45edd
a handy script for calculating out-of-vocabulary rate of n-grams
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2794 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 19:48:29 +00:00
naditomeh
ad3b0760b2
adding extract.cpp
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/hierarchical-reo@2770 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 13:59:04 +00:00
naditomeh
03de8a99d8
adding extract.cpp
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/hierarchical-reo@2769 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 13:41:34 +00:00
sarst
4eec020d5b
bugfixes to train-factored-phrase-model.perl
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/hierarchical-reo@2764 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 12:11:10 +00:00
sarst
a9ef19edf0
updated train-factored-phrase-model.perl to work with the new hierarchical reordering framework
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/branches/hierarchical-reo@2759 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 11:29:39 +00:00
bojar
9f784c6bf8
a handy script to get many translations from Google (can continue interrupted
...
sessions)
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2744 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 01:48:13 +00:00
bojar
ed18df8dc7
allow env.var to override BINDIR and TARGETDIR
...
exclude memscore from the released things if failed to compile
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2740 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-29 00:51:28 +00:00
phkoehn
4d814e53d2
del
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2728 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-28 17:38:26 +00:00
phkoehn
f1f395e05d
added web interface, organized files in sub directories
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2724 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-28 17:24:39 +00:00
hieuhoang1972
53e54def0b
indent
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2723 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-28 17:17:19 +00:00
hieuhoang1972
f5ebdbcec8
xcode
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2721 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-28 16:43:36 +00:00
phkoehn
244e334b3d
bug fix
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2690 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-27 14:55:17 +00:00
bojar
0889b9efff
renaming .pl -> .perl
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2674 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-26 17:23:41 +00:00
bojar
0e26f91865
don't organize to stacks by default, accept --organize-to-stacks
...
read from stdin as well
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2673 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-26 17:20:28 +00:00
bojar
536c7bdbcc
commiting a script by Loic Barrault to display moses search graph
...
(-output-search-graph) using graphviz dot
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2672 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-26 17:01:12 +00:00
phkoehn
def35604af
initial release of experiment.perl
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2669 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-25 17:38:53 +00:00
chardmeier
6317633148
Added memscore phrase scorer.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2653 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-18 14:52:34 +00:00
nicolabertoldi
3ad833d136
program to compute countings for phrase pairs
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2647 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-08 17:16:37 +00:00
nicolabertoldi
34d9feccc8
now it is possible to perform mert on a subset of features
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2646 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-08 15:56:45 +00:00
mphi
850e54f17d
added a switch to the training script, which allows using different word alignment models: --final-alignment-model X, where X is either 1/2/3/4/5 (for the ibm models 1 to 5) or hmm; the latter is equivalent to using --hmm-align.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2644 1f5c12ca-751b-0410-a591-d2e778427230
2010-01-08 12:00:28 +00:00
bhaddow
d8864fa6ad
support for mgiza
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2640 1f5c12ca-751b-0410-a591-d2e778427230
2009-12-15 17:36:06 +00:00
eisele
12dda84589
test, please ignore
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2598 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-23 15:45:35 +00:00
nicolabertoldi
5ad52827e3
minor change
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2576 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-09 07:47:14 +00:00
nicolabertoldi
427d421cf9
small change to be compliant with the previous change (2571->2572)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2573 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 18:09:20 +00:00
nicolabertoldi
7b77734e3c
the ordered list of features names are now stored in a file after each step and re-load in the case of re-starting of the training
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2572 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 18:03:41 +00:00
nicolabertoldi
6d45d03f48
removed local references
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2571 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 17:34:34 +00:00
nicolabertoldi
3731f83b8d
minor changes
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2570 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 16:43:07 +00:00
nicolabertoldi
98387244c1
added a new regression test for --continue option of mert-moses-new.pl
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2569 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 16:37:25 +00:00
nicolabertoldi
124f88e55a
enabled the --continue option to re-start an interrupted mert from the last finished step
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2568 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 16:35:57 +00:00
nicolabertoldi
e25b8c41b7
minor change
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2567 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 16:33:32 +00:00
nicolabertoldi
40f9b00bab
use of compressed data
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2566 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-07 12:47:40 +00:00
nicolabertoldi
2d1e4697f2
adding a new regression test
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2564 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-05 18:01:57 +00:00
nicolabertoldi
a93041d3d8
adding a new regression test
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2563 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-05 18:01:15 +00:00
nicolabertoldi
572f54f474
removing useless files
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2562 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-05 17:59:41 +00:00
nicolabertoldi
fa6a5bfc35
with this change, the usage of initial points for mert works properly
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2558 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-01 16:07:13 +00:00
nicolabertoldi
d4083b1119
adding very basic regression regr-tests for mert-moses-new which use a virtual decoder simulating the generation of the nbest lists
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2557 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-01 15:42:55 +00:00
nicolabertoldi
3484a1fd93
fixed minor bugs
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2550 1f5c12ca-751b-0410-a591-d2e778427230
2009-10-01 07:44:30 +00:00
hieuhoang1972
9b18ec4a29
add release files
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2527 1f5c12ca-751b-0410-a591-d2e778427230
2009-08-25 10:13:47 +00:00
nicolabertoldi
f75e3993ac
correction of parameter description
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2462 1f5c12ca-751b-0410-a591-d2e778427230
2009-08-05 16:54:33 +00:00
nicolabertoldi
8384857f8c
changes to mert-moses-new.pl to work with different reference length policies: shortest, average, closest (either "--shortest", "--average", or "--closest ) for BLEU and with case-sensitive/insensitive evaluation ( --nocase)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2461 1f5c12ca-751b-0410-a591-d2e778427230
2009-08-05 16:39:06 +00:00
hieuhoang1972
568e973b2e
make gcc amd make calls consistent for eric to use in ubuntu package
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2347 1f5c12ca-751b-0410-a591-d2e778427230
2009-05-27 12:54:04 +00:00
phkoehn
8833098925
generalized n-best list reporting for feature functions, added experimental version of global lexical model
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2343 1f5c12ca-751b-0410-a591-d2e778427230
2009-05-26 19:30:35 +00:00
mphi
17c3cfffac
added unpaired significance evaluation
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2328 1f5c12ca-751b-0410-a591-d2e778427230
2009-05-12 18:56:01 +00:00
bhaddow
981e440cc2
Fix detection of binarised reordering table
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2240 1f5c12ca-751b-0410-a591-d2e778427230
2009-03-13 12:28:34 +00:00
bhaddow
e1d7bb986c
Add option for predictable seeding
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2220 1f5c12ca-751b-0410-a591-d2e778427230
2009-02-25 19:31:17 +00:00
phkoehn
8d5aef137b
bug fix
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2113 1f5c12ca-751b-0410-a591-d2e778427230
2009-02-09 16:00:35 +00:00
phkoehn
a62f8ee316
added truecaser
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2112 1f5c12ca-751b-0410-a591-d2e778427230
2009-02-09 15:32:34 +00:00
jdschroeder
cc95706045
mert-moses.pl now supports multiple input weights for lattices and confusion networks, using the --inputweights argument.
...
I'll leave it to someone who knows mert-moses-new.pl better to make the changes there.
"zcat" is now abstracted as a $ZCAT variable in these files, and is set to "gzip -cd" which should work on more platforms (notably on the mac, where zcat fails unless an archive name ends in ".Z").
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@2082 1f5c12ca-751b-0410-a591-d2e778427230
2009-02-05 17:39:36 +00:00
phkoehn
98381c0193
fixed xml removal
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1995 1f5c12ca-751b-0410-a591-d2e778427230
2009-01-24 05:21:36 +00:00
phkoehn
616842f278
fixed multi-bleu documentation
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1971 1f5c12ca-751b-0410-a591-d2e778427230
2009-01-08 00:47:10 +00:00
nicolabertoldi
2075f9dda1
modification to mert script to allow the use of fewer nbest lists; features and scores are no more gzipped
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1965 1f5c12ca-751b-0410-a591-d2e778427230
2008-12-30 17:33:16 +00:00
bojar
091c9ece28
raising line_max_length
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1953 1f5c12ca-751b-0410-a591-d2e778427230
2008-12-05 10:01:05 +00:00
bojar
586d7e2f84
minor fix when handling gzipped corpora
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1952 1f5c12ca-751b-0410-a591-d2e778427230
2008-12-04 17:49:59 +00:00
bojar
2c900c8bd7
uncompress input files for phrase extract, if needed
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1951 1f5c12ca-751b-0410-a591-d2e778427230
2008-11-27 10:39:28 +00:00
bhaddow
1e13f6d2d6
Weights can sometimes be in exponential format.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1947 1f5c12ca-751b-0410-a591-d2e778427230
2008-11-25 09:54:45 +00:00
hieuhoang1972
2807bc48ad
absolute file name check, provided by Eric Kow
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1944 1f5c12ca-751b-0410-a591-d2e778427230
2008-11-20 13:03:59 +00:00
hieuhoang1972
254284e57e
patch to fix fiddly env variable and directory stuff, provided by Eric Kow@
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1943 1f5c12ca-751b-0410-a591-d2e778427230
2008-11-20 13:01:49 +00:00
phkoehn
abb2fc37b1
proper binarization of lexicalized reordering model
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1938 1f5c12ca-751b-0410-a591-d2e778427230
2008-11-10 16:03:16 +00:00
phkoehn
bfbbefd710
bug fixes
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1917 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-26 03:32:29 +00:00
mphi
8a4c6a2c63
pus significance test into proper location
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1915 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-23 09:16:33 +00:00
mphi
88d3b775ce
altered the bootstrap significance script algorithm according to (Riezler and Maxwell 2005 @ MTSE'05)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1914 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-23 09:03:41 +00:00
phkoehn
a09242ad16
bug fix with phrase table name in moses.ini, when using hmm alignment
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1913 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-20 21:37:57 +00:00
mphi
f033e32979
Added implementation of Koehn's 2004 EMNLP paired bootsrap resampling
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1911 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-20 11:55:12 +00:00
phkoehn
1c7b305152
bug fix
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1910 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-19 07:32:48 +00:00
phkoehn
1b5d99ad26
added headers for standard compliance (gcc 4.3 on 64 bit linux)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1905 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-16 21:14:38 +00:00
phkoehn
3a5981ce9d
major improvements, see email to moses-support
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1904 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-15 23:25:14 +00:00
phkoehn
614876771d
extended extract/score, to allow for one big file, not just parts
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1903 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-15 22:12:56 +00:00
jdschroeder
78534c1518
made all zcat calls through ZCAT variable.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1875 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-15 16:15:51 +00:00
jdschroeder
7a2ebedc20
minor bugfixes and error checking
...
-added -rootdir option to enhanced-mert
-fixed float regex in score-nbest.py and mert-moses.pl
-allow for extra weights in constructing ini in mert-moses.pl
-additional NFS bug checks in mert-moses.pl
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1869 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-01 10:24:01 +00:00
bojar
6a087d59c4
removed SCRIPTS_ROOTDIR from this 'my' declaration, it was obscuring previous
...
declaration!
lines wrapped
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1865 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-14 16:24:18 +00:00
bojar
2afe9e0357
avoid coredump files in parallel moses (usually just kills NFS for a while),
...
debug on a smaller scale, if needed
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1864 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-14 13:58:42 +00:00
bojar
c20e682f18
Avoid NFS race condition:
...
explicitly remove old cmert output files (hoping that they will be correctly
replaced by a 'mv' in the shell script submitted to SGE by qsubwrapper
occasionally reveals a race condition in NFS => weights seem unchanged =>
mert finishes too early)
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1862 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-10 11:47:55 +00:00
bhaddow
83f234cf17
Implementation of Cer et al mert regularisation. Use with argument such
...
as --scconfig regtype:min,regwin:3 in extractor and mert. Only tested
on toy example so far.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1860 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-24 19:27:18 +00:00
hieuhoang1972
52c2843e6c
perl regexpr bug, submitted by German Sanchis Trilles
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1855 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-19 21:57:29 +00:00
bhaddow
4195b70247
First cut of new mert outer loop
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1842 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-10 09:07:20 +00:00
hieuhoang1972
1b44c7c445
most popular alignment outputted, finally
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1818 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-04 14:49:56 +00:00
hieuhoang1972
8554a7c89d
most popular alignment outputted, finally
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1817 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-04 14:42:51 +00:00
hieuhoang1972
3832f68fed
most popular alignment outputted, finally
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1816 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-04 14:40:04 +00:00
hieuhoang1972
bf34eb891d
don't output alignment if inverse
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1813 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-03 12:25:37 +00:00
hieuhoang1972
b48ce341e9
output most aligned instead of merged
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1798 1f5c12ca-751b-0410-a591-d2e778427230
2008-05-28 13:49:04 +00:00
phkoehn
7498f469ab
get scripts rootdir by FindBin
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1745 1f5c12ca-751b-0410-a591-d2e778427230
2008-05-16 15:54:02 +00:00
hieuhoang1972
a2a3d33103
explicitly use bash
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1693 1f5c12ca-751b-0410-a591-d2e778427230
2008-05-15 08:50:22 +00:00
hieuhoang1972
3fc0b8ddb4
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1603 1f5c12ca-751b-0410-a591-d2e778427230
2008-05-04 12:52:52 +00:00
nicolabertoldi
def5f419c2
- handling of word graph generation in the parallel environment (-output-word-graph)
...
- handling of '-' (i.e. /dev/stdout) for word graphs
- if either translations, nbests, searchgraphs or wordgraphs are output to stdout
they are concatenated in this order
BUT I STRONGLY RECOMMENT NO TO DO THAT
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1598 1f5c12ca-751b-0410-a591-d2e778427230
2008-04-23 10:03:45 +00:00
nicolabertoldi
d514c277df
- handling of search graph generation in the parallel environment (-output-search-graph)
...
- modification of the parameter for nbest generation in the parallel environment:
I make it similar to moses parameter (-n-best-list)
- handling of '-' (i.e. /dev/stdout) for nbest and search graphs
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1597 1f5c12ca-751b-0410-a591-d2e778427230
2008-04-23 08:37:44 +00:00
hieuhoang1972
a822d61d8f
prevent -inf in lex re-ordering. Code contributed by Christian Hardmeier
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1596 1f5c12ca-751b-0410-a591-d2e778427230
2008-04-18 09:04:38 +00:00
nicolabertoldi
1aff3d2382
correct handling of binary phrase tables
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1579 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-28 14:33:15 +00:00
nicolabertoldi
def0fff5cd
changes to handle lattice input format
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1578 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-28 08:55:40 +00:00
hieuhoang1972
0bb92c2e79
merge properly
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1577 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-27 19:01:38 +00:00
hieuhoang1972
cb1f0e56dc
optional output what lines are retained
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1576 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-27 18:38:31 +00:00
bojar
f056bdbfde
fixed to correctly handle models in [distortion-file] section
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1572 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-26 10:24:53 +00:00
bojar
3957dc6b4c
default to reordering factors of 0-0 even if decoding steps are set (users
...
might have explicitly said e.g. t0-0!)
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1571 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-26 10:20:09 +00:00
hieuhoang1972
cced54cf7d
win32 fix provided by jc read
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1569 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-22 21:47:32 +00:00
bojar
f7a1fb5b9c
corpus compression correctly used even for generation step
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1568 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-22 16:14:30 +00:00
bojar
7f3e34207a
added some heuristics for Czech quotation marks
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1567 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-22 15:07:46 +00:00
bojar
6af3140978
added optional sentence uppercasing (use -u)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1566 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-22 14:50:43 +00:00
bojar
8b3d44b2e2
SAFE_GETLINE made safer: will exit if the line does not fit into the buffer
...
instead of just going on and getting the src/tgt/alignment files out of sync
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1565 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-22 14:42:01 +00:00
bojar
f89ab590ec
added Nicola's enhancedmert to released files
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1564 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-22 13:30:05 +00:00
redpony
25750c6555
if giza returns sentences that have different lengths in different directions (due to truncation or other errors), don't silenty fail. print a blank line instead.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1562 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-19 20:48:14 +00:00
bojar
fa31d83421
even factors that are being added can be gzipped
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1561 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-19 17:32:51 +00:00
bojar
eec1bdb623
added support to open gzipped files
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1560 1f5c12ca-751b-0410-a591-d2e778427230
2008-02-19 16:05:11 +00:00
nicolabertoldi
ae319da62b
revert to /bin/sh for enhanced-mert; use of setenv (instead of export) in the csh scripts created by qsub-wrapper.pl
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1515 1f5c12ca-751b-0410-a591-d2e778427230
2007-11-21 14:31:21 +00:00
nicolabertoldi
0176c5f8ec
use fo csh instead of sh
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1514 1f5c12ca-751b-0410-a591-d2e778427230
2007-11-21 07:59:17 +00:00
bojar
89ea9828ba
added ttable iterator to this script, too
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1498 1f5c12ca-751b-0410-a591-d2e778427230
2007-11-06 03:33:41 +00:00
bojar
09d8b5e657
improving documentation and allowing environment variables to override the
...
default paths
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1497 1f5c12ca-751b-0410-a591-d2e778427230
2007-11-06 03:16:17 +00:00
nicolabertoldi
568f92b310
bug fixed
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1491 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-30 15:42:26 +00:00
jdschroeder
e52040bc12
added str length check to stop std::out_of_range error in a few more spots - similar bug to one corrected in v. 1319
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1488 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-26 12:42:55 +00:00
nicolabertoldi
fd3ecd4334
bug fixed
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1487 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-25 14:34:49 +00:00
nicolabertoldi
1b0576ba6c
small bug fixed: temporary concatenated sorted file is now deleted only at the end
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1486 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-24 07:50:51 +00:00
nicolabertoldi
8710cc9bc9
features can be activated using a comma- or blank-separated list
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1485 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-23 16:55:02 +00:00
nicolabertoldi
9e70b5ffd8
Features are activated using their names
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1484 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-23 16:30:02 +00:00
nicolabertoldi
8fe62f2b95
some small bugs fixed and clean up
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1483 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-23 14:15:19 +00:00
nicolabertoldi
4720d1cb9f
bug fixed
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1482 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 17:01:53 +00:00
nicolabertoldi
918dae011a
bug fixed
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1481 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 14:00:10 +00:00
nicolabertoldi
e7ac20d4d6
bug fixed in the name of a temporary file
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1480 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 13:34:12 +00:00
nicolabertoldi
db9d0fc539
Added a more time-efficient (but more memory-consumptive) method to rescore nbest list
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1479 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 10:08:07 +00:00
nicolabertoldi
b827d51870
changes to cope with the new mert suite (enhanced-mert)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1478 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 09:45:19 +00:00
jdschroeder
a969197e16
Fixed passing decoder parameters when tuning on single machine.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1477 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 09:27:09 +00:00
nicolabertoldi
5759005857
Suite of scripts to perform MERT on a subset of fetures.
...
Look at the directory example to learn about its use.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1476 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-22 09:04:03 +00:00
redpony
0cf583e249
add --hmm-align option. Allows using Giza++'s HMM word alignment model as the underlying word alignment. It is much faster than Model 4 alignment and not much worse.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1474 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-19 03:44:05 +00:00
nicolabertoldi
901823d83a
explicit export of PYTHONPATH variable
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1473 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-18 15:33:04 +00:00
nicolabertoldi
81b439d728
minor changes in passing parameters to moses-parallel
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1472 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-18 12:36:49 +00:00
hieuhoang1972
4e1cad4bbe
fixed sync/async bug
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1471 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-04 17:31:31 +00:00
redpony
57dcaa8e80
performance fixes for scorer
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1470 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-02 21:43:54 +00:00
hieuhoang1972
9cbc2922b4
separate word penalty for each decode step for async
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1469 1f5c12ca-751b-0410-a591-d2e778427230
2007-10-02 12:52:00 +00:00
hieuhoang1972
d2d03c33e7
fixed bug which prevented mert working when phrase table NOT filtered or binarised
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1449 1f5c12ca-751b-0410-a591-d2e778427230
2007-08-10 15:48:58 +00:00
hieuhoang1972
53fa2cb18a
async - don't use binarising or filtering
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1443 1f5c12ca-751b-0410-a591-d2e778427230
2007-08-05 20:29:46 +00:00
hieuhoang1972
9eba034662
turn off debugging
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1435 1f5c12ca-751b-0410-a591-d2e778427230
2007-07-25 10:05:34 +00:00
hieuhoang1972
2beb0c44e9
mkdir before doing generation
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1425 1f5c12ca-751b-0410-a591-d2e778427230
2007-07-14 09:54:05 +00:00
nicolabertoldi
75afdf04a5
I corrected direction of alignment
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1421 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-27 17:57:55 +00:00
nicolabertoldi
ac91cb78cc
two additional (and simpler) ways of extracting alignments: source-to-target and target-to-source
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1420 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-27 16:34:14 +00:00
nicolabertoldi
7f9c2856c2
changes to reduce disk memory consumption during training
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1419 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-27 16:30:20 +00:00
phkoehn
960bebdd4a
fixed clean script to handle '|'s
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1416 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-18 15:50:04 +00:00
redpony
c747cdd505
fix dumb error
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1414 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-01 16:27:56 +00:00
redpony
1f050e198a
fix compile error, enable optimizations
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1413 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-01 16:26:26 +00:00
redpony
564bb5a64e
make scorer use compiler optimization
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1412 1f5c12ca-751b-0410-a591-d2e778427230
2007-06-01 16:22:30 +00:00
hieuhoang1972
aa25c7341d
fixed bug with non-ascii data, recieved from Jaakko Väyrynen
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1392 1f5c12ca-751b-0410-a591-d2e778427230
2007-05-21 13:06:40 +00:00
bojar
74954cb0ae
prefer hardlinking, dropped dependency on a proprietary script
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1381 1f5c12ca-751b-0410-a591-d2e778427230
2007-05-09 00:54:07 +00:00
bojar
31def05428
- added a comment where the binarizer is
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1380 1f5c12ca-751b-0410-a591-d2e778427230
2007-05-07 07:19:02 +00:00
abarun
ba90a05233
Added script to perform Minimum Bayes Risk reranking
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1372 1f5c12ca-751b-0410-a591-d2e778427230
2007-05-02 17:26:01 +00:00
hieuhoang1972
13e07cef5f
multiple distance based distortion for async decoder
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1370 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-23 19:17:38 +00:00
redpony
485bda2db5
andreas zollman's changes to write span information
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1367 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-20 14:53:46 +00:00
jdschroeder
3e1aabc487
Removed a few errant svn diff lines that found their way into the file.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1366 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-19 10:52:17 +00:00
jdschroeder
752d148c6e
Changed initial setting of number of distortion weights from 0 to 1. For models with lexicalized reordering, this script was generating one too few weights in moses.ini
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1365 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-18 19:38:14 +00:00
redpony
c80d8b8d47
Support for the decoding of arbitrary word lattices. Must be given in the form of a "plf" file, which is a little tricky. I'll add documentation at some point; for now, refer to the example plf file in the "lattice-surface" regression test.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1359 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-18 14:08:46 +00:00
hieuhoang1972
45dde20c54
comment out psyco library
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1354 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-12 17:48:48 +00:00
hieuhoang1972
75c20e7609
Add alignment info to phrase table
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1352 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-11 19:58:38 +00:00
hieuhoang1972
b84191c9d3
Add alignment info to phrase table
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1351 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-11 19:56:43 +00:00
hieuhoang1972
b9d2288c22
compileable with visual studio
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1349 1f5c12ca-751b-0410-a591-d2e778427230
2007-04-11 11:35:36 +00:00