oliver-wilson
106c4e0fc2
Init thread specific data on RandLM before sentence processing. Needed
...
for multithreaded RandLM.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3910 1f5c12ca-751b-0410-a591-d2e778427230
2011-03-02 16:07:20 +00:00
pjwilliams
75709a6c87
Memory efficiency: make the reserveSize argument non-optional in
...
Moses::Phrase's constructor. The default used to be ARRAY_SIZE_INCR = 10,
which will be excessive in many cases. Where the default was used, I've
set the exact size where that was obvious and explicitly used ARRAY_SIZE_INCR
otherwise. If you know the code involved, it's probably worth reviewing.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3908 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-28 11:41:08 +00:00
leven101
a5a860ad1f
optimized some parameters
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3907 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-25 14:53:34 +00:00
hieuhoang1972
251011df9e
irstlm variable
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3906 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-25 02:26:05 +00:00
phkoehn
93ed0ca1a9
get rid of "unused variable" warnings
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3903 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-24 16:17:38 +00:00
hieuhoang1972
a0b6abdfd3
run beautify.perl. Consistent formatting for .h & .cpp files
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3901 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-24 13:14:42 +00:00
oliver-wilson
06df9d1770
Changed white space to test effect on svn blame.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3892 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-22 16:48:51 +00:00
bhaddow
e7a8ba3b7f
check for correct number of factors
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3891 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-22 13:06:16 +00:00
maurocettolo
4c6dfbddc3
minor changes to make Moses compliant with IRSTLM toolkit (release 5.60.01)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3878 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-11 11:32:35 +00:00
ales-t
e922c159b6
Alignment points are also created for unknown source words.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3876 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-08 18:04:09 +00:00
pjwilliams
d4359f9875
If Boost is available, use per-sentence object pools to allocate ProcessedRule
...
and WordConsumed objects (which are used to store rule table lookup state).
Large numbers of these objects are used during decoding and this can
significantly improve performance, especially for multithreaded decoding,
though at the cost of increased total memory use.
The ./configure option --disable-boost-pool can be used to disable this
feature if memory is tight. This currently only affects moses_chart with
in-memory rule tables.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3873 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-07 15:43:19 +00:00
bhaddow
6221d2a558
Patch to add covered to osgx from Dennis Mehay
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3872 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-04 16:21:57 +00:00
maurocettolo
8fcd76f2fc
made handling of chunk LM compatible with recent efficiency updates to IRSTLM toolkit by Nicola
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3870 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-04 07:42:56 +00:00
phkoehn
4e72cd91be
added decoding-graph-backoff, still experimenting with it
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3869 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-03 13:41:44 +00:00
bojar
0bc0ece594
Ales Tamchyna's printing of alignments (-print-alignment-info did nothing)
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3867 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-03 09:08:42 +00:00
hieuhoang1972
0eed5716b7
get rid of linked trans opt
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3864 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-02 11:24:19 +00:00
hieuhoang1972
e087e78df9
get rid of linked trans opt
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3863 1f5c12ca-751b-0410-a591-d2e778427230
2011-02-02 11:06:19 +00:00
hieuhoang1972
abacb9166a
xcode
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3857 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-28 14:57:55 +00:00
heafield
87f15593da
Remove vestigial len parameter from language model calls
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3856 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-27 19:01:45 +00:00
pjwilliams
967b7be213
Support for multithreading in moses_chart (-threads option). This hasn't
...
been thoroughly tested yet, so don't be surprised if it breaks. Verbose
output will be scrambled.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3853 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-26 13:15:13 +00:00
redpony
eddb28e0ce
facilitate programmatic creation of word lattices
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3848 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-25 20:08:29 +00:00
pjwilliams
8051c5ad35
Use TranslationTask objects to perform sentence decoding in moses-chart.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3846 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-25 17:15:30 +00:00
pjwilliams
99bbfe938b
Use OutputCollector to write moses-chart output.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3843 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-25 15:17:17 +00:00
pjwilliams
67b30ea0c7
Move sentence-specific rule lookup state out of PhraseDictionarySCFG and
...
PhraseDictionaryOnDisk and into ChartRuleLookupManager.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3842 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-24 19:14:19 +00:00
bhaddow
7b6503680a
Shortcut when trans opts cache is size 0. Avoids potential uninitialised read.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3825 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-21 21:25:14 +00:00
hieuhoang1972
96bd3a164d
vs.net
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3802 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-17 16:19:33 +00:00
bhaddow
a9cd71628a
Change of boost macros - please make sure you favourite configuration still works
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3799 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-13 23:38:48 +00:00
pjwilliams
d20667a46d
Faster lookup for rules with source and/or target syntax labels (in-memory rule
...
table only).
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3797 1f5c12ca-751b-0410-a591-d2e778427230
2011-01-13 00:25:10 +00:00
hieuhoang1972
41c5b3a1c2
xcode
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3769 1f5c12ca-751b-0410-a591-d2e778427230
2010-12-10 13:04:06 +00:00
heafield
5e9df58a3c
Respect -v 0
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3768 1f5c12ca-751b-0410-a591-d2e778427230
2010-12-09 22:13:09 +00:00
bhaddow
50f0e6c07d
Add a show-weights option. It prints out the moses features and exits. May
...
load tables as a side-effect.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3744 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-29 16:44:28 +00:00
bhaddow
6255216b6a
Remove gnu-specific typeof
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3725 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-22 10:05:17 +00:00
bhaddow
a7e0977eea
Fix compile error by using correct macro.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3720 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-18 10:27:30 +00:00
chardmeier
837a667a95
Cleaned up language modelling code by disentangling the decoder's LM feature
...
function from the LM toolkit abstraction layer. There are two different groups
of classes now:
- LanguageModel, which inherits from StatefulFeatureFunction and contains
the n-gram model feature function.
- LanguageModelImplementation, which is the base class of the individual
LM implementations (SRI, IRST, RandLM, KenLM) and provides methods to
query LM probabilities and states.
Each LanguageModel controls a LanguageModelImplementation. Implementations can
be shared by more than one LanguageModel.
This should make it easier to use the LM libraries as a backend for other
feature functions while retaining the flexibility to use different LM toolkits.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3719 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-17 14:06:21 +00:00
chardmeier
d18ff948f5
Bugfixes in srilm adaptor.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3718 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-17 13:23:44 +00:00
leven101
84d83480b6
function name changes
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3711 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-15 11:32:02 +00:00
leven101
5251a2823a
separated source and target vocab in suffixarrays to support unequal factors
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3710 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-15 11:28:27 +00:00
bojar
2ea140062b
don't warn about probs outside [0,1] in -verbose 0
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3698 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-10 11:51:26 +00:00
heafield
2784923899
Rename a bunch of kenlm files. A ./regenerate-makefiles.sh is required.
...
Make loading with MAP_POPULATE on Linux and read on other OSes the default.
Use LM #9 for lazy loading, as recommended by other devs.
Slightly faster trie.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3688 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-06 00:40:16 +00:00
leven101
34b45c0480
removed debug messages from BilingualDynSuffixArray.cpp
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3687 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-04 18:41:04 +00:00
bhaddow
3aee6fab5d
Use correct conditional compilation flag for threaded moses
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3686 1f5c12ca-751b-0410-a591-d2e778427230
2010-11-03 18:43:18 +00:00
heafield
c12c2c59d2
Autodetect model from binary format.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3675 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-28 01:05:04 +00:00
hieuhoang1972
735d5b682f
xcode
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3673 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-27 18:54:50 +00:00
heafield
614d6002a6
Integrate heafield-refactorlm. Faster kenlm with new binary format. Stateful language model
...
framework.
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3671 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-27 17:50:40 +00:00
nicolabertoldi
bb08dcb5b6
made code compliant with the enhanced IRSTLM library; IRSTLM release 5.50.01 is needed; back compatibility is not assured;
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3662 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-26 16:06:01 +00:00
nicolabertoldi
3d7f391ae5
remove inefficient function calls
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3638 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-22 13:30:50 +00:00
hieuhoang1972
564a9ad371
word alignment info in binary pt, code provided by Christof Pintaske
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3637 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-22 10:37:19 +00:00
hieuhoang1972
ac875ffc81
mem leak
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3617 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-12 14:47:58 +00:00
hieuhoang1972
620d92c68a
bug in on-disk pt due to refactoring
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3616 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-12 14:10:19 +00:00
rsennrich
6e032b6ab5
XML spans parameter now uses '-' as token segmentation everywhere, as documented on the website.
...
git-svn-id: https://mosesdecoder.svn.sourceforge.net/svnroot/mosesdecoder/trunk@3609 1f5c12ca-751b-0410-a591-d2e778427230
2010-10-07 08:35:08 +00:00