Barry Haddow
d57d0c0656
Merge branch 'master' into miramerge
...
Conflicts:
Makefile.am
config.h.in
configure.in
kenlm
moses-chart-cmd/src/IOWrapper.cpp
moses/src/Parameter.cpp
scripts/Makefile
2011-11-17 17:46:17 +00:00
Barry Haddow
79de3c8699
Complete initial version of pro extractor
2011-11-16 14:54:23 +00:00
Barry Haddow
58268cc3dd
Test for pro
2011-11-14 16:03:56 +00:00
Barry Haddow
42a3f28b42
Speed up decoding by cutting reducing score copies.
...
Hypothesis gets the weighted score from previous,
and lazily computes full breakdown. Changes lex
reorder scores very slightly (third decimal place),
hence test change.
2011-11-03 22:33:05 +00:00
Barry Haddow
90820ad0c5
Merge branch 'master' into miramerge.
...
Also fix mert-moses.pl to use correct flag for specifying
weights of non-core features.
Conflicts:
.gitignore
configure.in
ltmain.sh
moses/src/LM/Factory.cpp
moses/src/LMList.cpp
moses/src/LMList.h
moses/src/LanguageModel.cpp
moses/src/LanguageModel.h
moses/src/LanguageModelKen.h
moses/src/Makefile.am
moses/src/Manager.cpp
moses/src/PhraseDictionaryMemory.cpp
moses/src/PhraseDictionaryTree.cpp
moses/src/StaticData.cpp
moses/src/TargetPhrase.h
2011-10-28 15:54:23 +01:00
Barry Haddow
d51f9c281a
Reinstate lex reorder bin test, and bump up data version.
2011-10-25 15:47:00 +01:00
Barry Haddow
39e1927d09
Disable failing tests
2011-10-25 10:42:57 +01:00
Ales Tamchyna
3e765d259a
Merge branch 'mtm6_cruise_control'
2011-10-21 13:58:47 +01:00
hieuhoang1972
9b00ae425c
add regression test binary models with lex reordering. Fix other regression tests - changed to account for new LM prob from IRST or use KenLM
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4367 1f5c12ca-751b-0410-a591-d2e778427230
2011-10-15 03:51:26 +00:00
hieuhoang1972
195cc1a2c1
add regression test binary models with lex reordering. Fix other regression tests - changed to account for new LM prob from IRST or use KenLM
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4366 1f5c12ca-751b-0410-a591-d2e778427230
2011-10-15 03:50:00 +00:00
Barry Haddow
c83166087e
Merge branch 'master' into miramerge
...
Conflicts:
moses/src/LanguageModel.cpp
moses/src/TargetPhrase.h
moses/src/TrellisPath.h
moses/src/Util.h
scripts/training/train-model.perl
2011-10-12 17:14:23 +01:00
hieuhoang1972
7fa74c1eb2
roll back kenlm tests. Binarizy files are OS-dependent
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4287 1f5c12ca-751b-0410-a591-d2e778427230
2011-10-01 12:22:24 +00:00
hieuhoang1972
e1c808ad9a
roll back kenlm tests. Binarizy files are OS-dependent
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4286 1f5c12ca-751b-0410-a591-d2e778427230
2011-10-01 11:33:19 +00:00
hieuhoang1972
6faf20707c
roll back kenlm tests. Binarizy files are OS-dependent
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4285 1f5c12ca-751b-0410-a591-d2e778427230
2011-10-01 11:09:19 +00:00
hieuhoang1972
f51239cf68
kenlm regression tests
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4284 1f5c12ca-751b-0410-a591-d2e778427230
2011-09-30 14:12:18 +00:00
bhaddow
5b7c5ebdb5
Merge branch 'master' into miramerge
...
Conflicts:
moses-chart-cmd/src/Main.cpp
moses-cmd/src/IOWrapper.cpp
moses-cmd/src/Main.cpp
moses/src/DummyScoreProducers.cpp
moses/src/DummyScoreProducers.h
moses/src/GenerationDictionary.cpp
moses/src/GenerationDictionary.h
moses/src/GlobalLexicalModel.h
moses/src/LMList.h
moses/src/LanguageModel.cpp
moses/src/LanguageModel.h
moses/src/LanguageModelImplementation.h
moses/src/LanguageModelKen.h
moses/src/LanguageModelMultiFactor.cpp
moses/src/LanguageModelMultiFactor.h
moses/src/LanguageModelSingleFactor.cpp
moses/src/LanguageModelSingleFactor.h
moses/src/LexicalReordering.h
moses/src/PhraseDictionary.cpp
moses/src/PhraseDictionary.h
moses/src/ScoreIndexManager.cpp
moses/src/ScoreProducer.h
moses/src/StaticData.cpp
moses/src/StaticData.h
moses/src/TranslationSystem.cpp
2011-09-20 11:23:38 +01:00
hieuhoang1972
358720c24a
don't score with good-turing. phi has moved this calc to another file
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4220 1f5c12ca-751b-0410-a591-d2e778427230
2011-09-13 18:14:14 +00:00
bhaddow
d53350dd95
Regression test for lm oov feature
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4218 1f5c12ca-751b-0410-a591-d2e778427230
2011-09-12 20:53:22 +00:00
bhaddow
8d558d2e04
Fix test script
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4194 1f5c12ca-751b-0410-a591-d2e778427230
2011-09-08 18:03:30 +00:00
bhaddow
ed8d85acc1
moving mert tests
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4193 1f5c12ca-751b-0410-a591-d2e778427230
2011-09-08 17:51:09 +00:00
Ales Tamchyna
94fa872b6d
return '2' if a test failed
2011-09-07 11:16:44 +02:00
bojar
ca1912961d
first draft of cruise control for Moses
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4166 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-29 06:20:25 +00:00
bhaddow
7b132651e8
Merge branch 'master' into miramerge
2011-08-22 16:16:32 +01:00
bhaddow
1f3c11b441
Merge branch 'master' into miramerge
...
Passes all regression except for the score tests, as I still have to merge
in the missing files for these tests.
Conflicts:
.gitignore
Makefile.am
configure.in
moses-chart/moses-chart.xcodeproj/project.pbxproj
moses/src/ChartHypothesis.cpp
moses/src/ChartTranslationOptionCollection.cpp
moses/src/ChartTrellisPath.cpp
moses/src/DotChart.cpp
moses/src/LMList.h
moses/src/LanguageModel.cpp
moses/src/LanguageModel.h
moses/src/LanguageModelFactory.cpp
moses/src/LanguageModelRemote.h
moses/src/LexicalReordering.h
moses/src/Makefile.am
moses/src/Manager.cpp
moses/src/Parameter.cpp
moses/src/RuleCubeQueue.h
moses/src/ScoreIndexManager.cpp
moses/src/StaticData.cpp
moses/src/StaticData.h
moses/src/TargetPhrase.cpp
moses/src/TargetPhrase.h
moses/src/TranslationOptionCollection.cpp
2011-08-22 14:52:02 +01:00
hieuhoang1972
3763b2466b
run scorer regression test from any directory
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4155 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-22 07:56:46 +00:00
hieuhoang1972
1873030d24
forgot to add these files for regression tests
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4154 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-22 07:40:14 +00:00
bhaddow
cf81f9188f
Merged up to 3895 from trunk
...
git-svn-id: http://svn.statmt.org/repository/mira@3934 cc96ff50-19ce-11e0-b349-13d7f0bd23df
2011-08-18 12:59:38 +02:00
bgottesman
436a285f18
stop using 'subtest' because it doesn't work for everyone, e.g. Hieu reports it doesn't work on a Mac even with an up-to-date Test::Simple module
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4144 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-16 10:51:43 +00:00
hieuhoang1972
87216f55be
rename & make executable
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4136 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-09 09:07:34 +00:00
bgottesman
0fe1c629da
if we fail to make the output directory for a test, just abort the test, don't exit the whole script
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4135 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-08 18:26:57 +00:00
bgottesman
24f5bf6723
when detokenizing, remove whitespace between a pair of CJK (Chinese/Japanese/Korean) words
...
This gets the Chinese and Japanese tests working, so remove the failure expectation.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4134 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-08 15:30:54 +00:00
bgottesman
14587cdafc
fix a detokenization bug that was preventing the removal of the whitespace following a contracted French or Italian article/pronoun (e.g. "l' immigration") when the contraction was the second-last word in the segment
...
remove the expectation of failure on the corresponding unit test
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4133 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-08 15:02:56 +00:00
bgottesman
9d9977bc6f
add TODO tests for detokenization of Chinese and Japanese
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4131 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-08 13:21:05 +00:00
bgottesman
c030dae094
Allow a test case to have an undefined language, since the detokenizer doesn't require a language to be passed in and, indeed, errors if a language is passed in for which there are no special rules (which seems dubious to me ...). Add test case TEST_GERMAN_NONASCII with an undefined language.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4130 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 19:14:01 +00:00
bgottesman
024bbe0bcc
- factor out class DetokenizerTestCase
...
- create an array of all of the test cases before running any of them
- in the case of an expected failure, move the TODO block deeper, just around the validation of the results
I'm not 100% I like this change, I think it makes the code slightly more elegant but it also makes it longer.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4128 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 18:48:37 +00:00
bgottesman
d521287a3f
move commas to after here-docs, to hopefully make test cases more readable; and remove unused import
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4125 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 16:37:49 +00:00
bgottesman
76c3ef4dba
a few more detokenization tests, including a TODO one that exposes a bug
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4124 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 16:23:47 +00:00
bgottesman
eda0f4e370
An initial test suite for detokenizer.perl.
...
I realize this doesn't quite fit the paradigm if the existing moses test suite. On the other hand, it's self-contained, easy to run, easy to add tests to (just follow the pattern in the section titled 'Definitions of individual test cases'), and uses an established Perl testing framework. I don't think it will be infeasible to incorporate it into the existing test suite.
Usage:
run-test-detokenizer.t --results-dir <RESULTS-DIRECTORY>
where <RESULTS-DIRECTORY> is an empty existing directory where the output can be written
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4121 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 14:32:39 +00:00
hieuhoang1972
30ca534b86
faster scorer
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4119 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 10:27:15 +00:00
hieuhoang1972
b4c79f721e
regression test for scorer
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4118 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 10:18:07 +00:00
hieuhoang1972
b618aadf8d
regression test for scorer
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4117 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 09:23:48 +00:00
hieuhoang1972
b8a0b09206
regression test for scorer
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4116 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-05 02:48:30 +00:00
hieuhoang1972
fc176801d6
regression test for score
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4112 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-04 09:15:43 +00:00
hieuhoang1972
e988361d62
regression test for score
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4111 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-04 08:11:40 +00:00
nicolabertoldi
579d8b0760
added few regression tests explicitly working with IRSTLM; modified few regression tests wrongly working with IRSTLM/SRILM; modified the required data archive (now version is 6);
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4105 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-03 09:19:48 +00:00
hieuhoang1972
49e56f35bb
regression test for score
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4102 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-02 10:28:36 +00:00
hieuhoang1972
d45a29d9c7
data for score regression test
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4101 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-02 10:10:09 +00:00
hieuhoang1972
ed4367ceb0
data for score regression test
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4100 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-02 10:04:53 +00:00
hieuhoang1972
69fe991923
data for score regression test
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4099 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-02 09:53:25 +00:00
hieuhoang1972
1ae8c53a08
executable perl script
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@4098 1f5c12ca-751b-0410-a591-d2e778427230
2011-08-02 09:31:01 +00:00