Commit Graph

1404 Commits

Author SHA1 Message Date
phkoehn
bfbbefd710 bug fixes
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1917 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-26 03:32:29 +00:00
mphi
8a4c6a2c63 pus significance test into proper location
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1915 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-23 09:16:33 +00:00
mphi
88d3b775ce altered the bootstrap significance script algorithm according to (Riezler and Maxwell 2005 @ MTSE'05)
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1914 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-23 09:03:41 +00:00
phkoehn
a09242ad16 bug fix with phrase table name in moses.ini, when using hmm alignment
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1913 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-20 21:37:57 +00:00
mphi
f033e32979 Added implementation of Koehn's 2004 EMNLP paired bootsrap resampling
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1911 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-20 11:55:12 +00:00
phkoehn
1c7b305152 bug fix
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1910 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-19 07:32:48 +00:00
phkoehn
1b5d99ad26 added headers for standard compliance (gcc 4.3 on 64 bit linux)
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1905 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-16 21:14:38 +00:00
phkoehn
3a5981ce9d major improvements, see email to moses-support
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1904 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-15 23:25:14 +00:00
phkoehn
614876771d extended extract/score, to allow for one big file, not just parts
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1903 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-15 22:12:56 +00:00
hieuhoang1972
d9d1b8f748 fix caching bug.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1902 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-14 19:25:18 +00:00
hieuhoang1972
a88239c5c0 unix build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1901 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-10 18:31:07 +00:00
hieuhoang1972
254005bcee unix build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1900 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-10 18:30:59 +00:00
hieuhoang1972
6a61ffa9bd unix build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1899 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-10 18:30:51 +00:00
hieuhoang1972
868428fe66 unix build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1898 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-09 00:20:39 +00:00
hieuhoang1972
928d771085 create namespace
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1897 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-08 23:51:26 +00:00
redpony
76090cdda0 more normalization of feature names
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1896 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-05 19:34:49 +00:00
redpony
624e87a08f add more logging
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1895 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-05 18:12:42 +00:00
redpony
849538f73b make feature names the same.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1894 1f5c12ca-751b-0410-a591-d2e778427230
2008-10-05 16:37:42 +00:00
hieuhoang1972
1ea1d4f9b1 always with unix line endings
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1893 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-29 19:34:54 +00:00
hieuhoang1972
68a2461cb3 vs build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1892 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-24 17:04:29 +00:00
redpony
a81390c5bb fix a couple names
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1891 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-24 17:03:07 +00:00
redpony
232dc9889c enable moses to accept a file that lists feature name and weight pairs.
enable moses to export its search graph as a phrase lattice encoded serialized in a Google protocol buffer. This requires protoc (http://code.google.com/p/protobuf/) to function, disabled by default.


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1890 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-24 16:48:23 +00:00
redpony
bb0ade93f7 a little refactoring in preparation for yet another way to export the search lattice.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1889 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-23 19:39:56 +00:00
hieuhoang1972
8ba603c39e visual studio build
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1888 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-14 01:31:02 +00:00
nicolabertoldi
9cbde412e2 support for creating binary Phrase Tables including word-to-word alignments
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1887 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-12 18:19:41 +00:00
nicolabertoldi
dd6c36640b Support for printing out word-to-word alignments (besides phrase-to-phrase alignments)
as contained in the phrase table.
If PT contains word-to-word alignments between source and target phrases,
Moses can optionally output them in the nbest and in the log file (if verbose).
W2w alignments from source to target and from target to source can differ,
if they differ in the PT.

Detailed documentation will be added in the Moses webpages very soon.


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1886 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-12 18:09:06 +00:00
nicolabertoldi
e376f9f994 mv some Timer functions into the .cpp file
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1885 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-12 15:31:46 +00:00
bhaddow
cd28f119c6 mert tests
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1881 1f5c12ca-751b-0410-a591-d2e778427230
2008-09-01 17:18:51 +00:00
jdschroeder
ea5ddd4d82 fixed nasty out-of-bounds array read in WordsBitmap, simplified (fixed?) lattice checks.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1879 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-26 17:04:43 +00:00
jdschroeder
78534c1518 made all zcat calls through ZCAT variable.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1875 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-15 16:15:51 +00:00
mfederico
842f5842a2 Integrated handling of oov penalty into irstlm library.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1874 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-05 11:56:32 +00:00
mfederico
7a0e5811ba Fixed bug with dub option.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1873 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-05 11:44:50 +00:00
hieuhoang1972
5dee5d04aa rename IOStream to IOWrapper.
move vs.net solution file to root folder

git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1872 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-05 00:24:45 +00:00
maurocettolo
90e3107ef4 just commented a print on stderr
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1871 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-04 16:55:21 +00:00
mfederico
9e61900fad Fixed bug concerning the handling of the oov penalty with IRSTLM
Now, the penalty for out-of-vocabulary words is specified 
by the parameter 

-lmodel-dub: dictionary upper bounds of language models

For instance, if you set it lmodel-dub to 1000000 (1M) and your actual 
vocabulary is let me say 200000 (200K), then the LM probabilty  of the
OOV word-class is divided by 800000 (800K), i.e. 1M-200K

You have to make sure that lmodel-dub is always larger than the LM 
dictionary.



git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1870 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-04 13:06:52 +00:00
jdschroeder
7a2ebedc20 minor bugfixes and error checking
-added -rootdir option to enhanced-mert
	-fixed float regex in score-nbest.py and mert-moses.pl
	-allow for extra weights in constructing ini in mert-moses.pl
	-additional NFS bug checks in mert-moses.pl



git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1869 1f5c12ca-751b-0410-a591-d2e778427230
2008-08-01 10:24:01 +00:00
hieuhoang1972
dd9691d28c abort if try to get substring of confusion network. returning emptyy string just screws things up
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1868 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-30 16:29:51 +00:00
hieuhoang1972
03bd63e312 get rid of md5
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1867 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-28 10:10:07 +00:00
bojar
6a087d59c4 removed SCRIPTS_ROOTDIR from this 'my' declaration, it was obscuring previous
declaration!
lines wrapped


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1865 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-14 16:24:18 +00:00
bojar
2afe9e0357 avoid coredump files in parallel moses (usually just kills NFS for a while),
debug on a smaller scale, if needed


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1864 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-14 13:58:42 +00:00
hieuhoang1972
620b0c34cc abort if internal LM asked to do more than trigram
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1863 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-11 14:55:06 +00:00
bojar
c20e682f18 Avoid NFS race condition:
explicitly remove old cmert output files (hoping that they will be correctly
  replaced by a 'mv' in the shell script submitted to SGE by qsubwrapper
  occasionally reveals a race condition in NFS => weights seem unchanged =>
  mert finishes too early)


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1862 1f5c12ca-751b-0410-a591-d2e778427230
2008-07-10 11:47:55 +00:00
saintamh
9d106392e6 Bugfix
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1861 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-30 09:18:14 +00:00
bhaddow
83f234cf17 Implementation of Cer et al mert regularisation. Use with argument such
as --scconfig regtype:min,regwin:3 in extractor and mert. Only tested
on toy example so far.


git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1860 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-24 19:27:18 +00:00
hieuhoang1972
6ddde13dca fixed constraint format bug
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1859 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-20 20:24:16 +00:00
dowobeha
3e1c6c39ff Fixed constraint decoding - there may be a bug in Util::Tokenize
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1858 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-20 15:45:06 +00:00
hieuhoang1972
81c7e5118b must provide line no to constraint file
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1857 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-19 22:47:54 +00:00
dowobeha
67c8bdd328 Constraint decoding works, but not for cube pruning.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1856 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-19 22:40:51 +00:00
hieuhoang1972
52c2843e6c perl regexpr bug, submitted by German Sanchis Trilles
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1855 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-19 21:57:29 +00:00
dowobeha
7a4b1fb699 Added preliminary code for constraint decoding
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@1854 1f5c12ca-751b-0410-a591-d2e778427230
2008-06-18 23:14:09 +00:00