Ulrich Germann
b6c887b370
Minor bug fix in logging biased sampling for phrase lookup.
2015-04-03 20:18:55 +01:00
Ulrich Germann
93ce2423df
1. A context string for biased sampling in Mmsapt can now be provided on the
...
command line with --context-string. Not available in server mode yet.
2. Numerous bug fixes related to biased sampling.
3. Biased sampling now checks that the sampling sticks to the bias. If
the distribution of samples deviates too much from the bias, samples
whose selection would push the sample distribution even further from the bias
are not considered, even if that means that fewer samples are chosen in total.
2015-04-03 16:16:52 +01:00
Jeroen Vermeulen
c52d7be025
Make an include path explicit.
...
Better to see clearly where that header is supposed to come from, since
it lives halfway across the codebase.
I think this may also make the "classifier" dance in moses/Jamfile
unnecessary, which would be a nice simplification, but I'm not
confident enough to just make the change.
2015-04-03 17:47:48 +07:00
Jeroen Vermeulen
ebc0930500
Replace use of tmpnam with boost::filesystem.
...
Silences a few annoying warnings from gcc: "tmpnam is dangerous" (and
the suggestion to use mkstemp instead).
2015-04-02 10:42:06 +07:00
XapaJIaMnu
29a729c99b
Remove old obsolete probingPT tests
2015-04-01 16:58:21 +01:00
Hieu Hoang
1a61e1405f
delete External FF. FF framework changes too fast to be able to keep this up-to-date
2015-03-31 19:45:59 +04:00
Ulrich Germann
a9dbced81d
Bug fix.
2015-03-30 02:56:49 +01:00
Ulrich Germann
fcbfc5a535
Feature functions and the constructors of TranslationOptionCollections
...
now have access to the current translation task.
This was done to allow context-sensitive processing (if provided by the FF).
2015-03-30 01:20:17 +01:00
Ulrich Germann
79cd40d2c4
Disabled temporarily. Needs to be adapted to API changes in Mmsapt.
2015-03-29 23:58:17 +01:00
Ulrich Germann
2899645992
Cleanup.
2015-03-29 23:57:14 +01:00
Ulrich Germann
3541838a46
Included TargetPhraseCollectionCache.* in fakelib mmsapt.
2015-03-29 23:55:47 +01:00
Ulrich Germann
76b76e4db6
Bug fixes.
2015-03-29 23:51:51 +01:00
Ulrich Germann
18d2547497
Added member for storing annotations by feature functions.
2015-03-29 23:51:22 +01:00
Ulrich Germann
1525f1ea62
Cleanup.
2015-03-29 23:44:06 +01:00
Ulrich Germann
529a766da7
Initial check-in.
2015-03-29 23:43:50 +01:00
Jeroen Vermeulen
b124d99330
Use boost::filesystem for "rm -rf".
...
Replaces a system() call (which was a portability problem) and fixes,
en passant, a warning about its return value being ignored.
2015-03-29 18:33:58 +07:00
Jeroen Vermeulen
789a2e2bc3
Fix some compile warnings (gcc 4.9.2).
...
Mostly signed/unsigned comparisons and reordered member
initializations; also a few unused variables.
There are more, but if I chip away at them for a while, who knows, it
may catch on and warnings may eventually become socially stigmatizing.
:)
2015-03-29 18:10:51 +07:00
Ulrich Germann
a706569844
Thread-safe classes for storing context-specific information.
2015-03-28 14:44:40 +00:00
Ulrich Germann
1b23edf62f
Cache for the N most recently used TargetPhraseCollections. Refactored out of mmsapt.h.
2015-03-28 14:41:08 +00:00
Jeroen Vermeulen
a9c8f44896
Modernize "C" includes in moses.
...
This is one of those little chores in managing a long-lived C++
project: standard C headers like stdio.h and math.h now have their own
place in the C++ standard as resp. cstdio, cmath, and so on. In this
branch the #include names are updated for the moses/ subdirectory; more
branches to follow.
C++11 adds cstdint, but to support compilation with the previous
standard, that change is left for later.
2015-03-28 20:09:03 +07:00
Ulrich Germann
c7ad2571f5
Making GetSource() in Manager and Basemanager virtual breaks one regression tests.
2015-03-26 23:38:58 +00:00
Ulrich Germann
e827896027
Harmonized spelling of CleanUpAfterSentenceProcessing(...).
2015-03-26 23:37:29 +00:00
Ulrich Germann
9dc75bfd8a
Managers and feature functions now have access to the entire TranslationTask, not just the InputType.
2015-03-26 18:25:54 +00:00
Ulrich Germann
b3ff4ff13a
Managers have now access to the entire translation task.
2015-03-26 16:37:04 +00:00
Ulrich Germann
d364211d90
More bjam fiddling.
2015-03-26 12:21:55 +00:00
Hieu Hoang
1064aaacbe
delete typedefs for UINT32 and UINT64. MSVC now has uint32_t and uint64_t /Ken
2015-03-25 00:55:39 +00:00
Ulrich Germann
ddf7bc3e23
1. Added FeatureFunction::Setup(TranslationTask const&) const to allow FFs to set themselves up for specific input.
...
2. FeatureFunction::ChangeSource should not be allow to change the pointer to the source.
2015-03-21 17:09:41 +00:00
Ulrich Germann
8ca11d941d
1. Lifetime of tasks in ThreadPool is now managed via shared pointers.
...
2. Code cleanup in IOWrapper and a bit elsewhere.
2015-03-21 16:12:52 +00:00
Ulrich Germann
85d2567b57
Renamed MosesServer::TranslationTask to MosesServer::TranslationRequest and made that a subtype of Moses::TranslationTask to avoid name confusion.
2015-03-21 15:56:20 +00:00
Ulrich Germann
ee4e396a4d
Removed pointer to TranslationTask in InputTypes again. Not the right place to store this information.
2015-03-21 15:29:37 +00:00
Ulrich Germann
fd56ba6afb
Renamed moses/server/TranslationTask.* to moses/server/TranslationRequest.* to avoid naming conflicts.
2015-03-21 14:00:30 +00:00
Ulrich Germann
6c73693c49
Library path for xmlrpc-c given via --with-xmlprc-c was not included in the compile requirements.
2015-03-20 22:32:42 +00:00
Ulrich Germann
b1c9d8a528
Bjam is the most awful build system in the world.
2015-03-20 18:51:49 +00:00
Ulrich Germann
e40fbbc2cb
Added pragma to tell me if Moses is built with server functionality at compilation time.
2015-03-20 18:45:46 +00:00
Ulrich Germann
9c19ad25b0
Changed order of #includes because of compiler warnings.
2015-03-20 18:43:29 +00:00
Ulrich Germann
386db6529a
More fiddling with bjam.
2015-03-19 14:36:43 +00:00
Ulrich Germann
d1cb14edc0
Fixed Jamfiles.
2015-03-19 13:45:58 +00:00
Ulrich Germann
6f52340c2b
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-18 21:04:18 +00:00
Rico Sennrich
eab513b635
relational dependency language model
2015-03-18 17:39:45 +00:00
Rico Sennrich
1568afb737
on-the-fly unbinarization of internal tree structure (for translation models extracted from binarized treebanks)
2015-03-18 17:36:32 +00:00
Ulrich Germann
286800abba
Addresses issues with compiling server capabilities into the Moses main executable.
2015-03-18 13:50:30 +00:00
Ulrich Germann
058abc32c3
Fixes (?) numerous bugs discovered when the code was actually compiled (an error in the bjam configuration prevented this).
2015-03-18 13:46:15 +00:00
Hieu Hoang
63d8b390b4
Changes to RUleScope from private branch. More codelite projects
2015-03-17 11:50:33 +00:00
Hieu Hoang
25feb7e47b
option to change the estimated score only, not actuall score
2015-03-17 10:25:34 +00:00
Ulrich Germann
dcffbb5f4d
Made LRModel::ReorderingType an enumerated type.
2015-03-16 00:24:11 +00:00
Ulrich Germann
085c88cc7b
Eliminated sources of some compiler warnings (unused variables; signed/usigned comparisons).
2015-03-15 22:45:01 +00:00
Ulrich Germann
ad805c133b
Instances of InputType (and derived classes) now know which TranslationTask (if any) created them.
...
This is a first step towards providing phrase tables etc. access to context information etc.
associated with specific translation tasks.
2015-03-15 20:38:31 +00:00
Ulrich Germann
2a66a55c85
Added document map (maps from sentences to document ids) to Bitext class.
...
Minor overhaul to the bias regime, which allows to specify bias by document
name (as provided in the document map) rather than by sentence in the static
parallel corpus.
2015-03-15 13:32:09 +00:00
Ulrich Germann
bc91743820
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-11 23:32:12 +00:00
Ulrich Germann
a49b76be3f
Quick hack to make moses not stumble over double-dash parameter specifications.
2015-03-11 23:32:06 +00:00
Kenneth Heafield
54304fd473
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
...
Conflicts:
moses/ExportInterface.cpp
2015-03-11 17:43:48 -04:00
James Zhang
23704613de
added a simple translation interface
2015-03-11 17:33:17 -04:00
Phil Williams
e8a7163f0d
Add SyntaxInputWeight feature function
...
Currently only used for forest input.
2015-03-10 11:07:04 +00:00
Ulrich Germann
137b07a486
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-10 10:41:47 +00:00
Ulrich Germann
51824355f9
Sampling now keeps track of counts for hierarchical lexicalized reordering.
2015-03-10 10:41:41 +00:00
Matthias Huck
524ed4406e
pragma once
2015-03-09 21:44:54 +00:00
Ulrich Germann
ccf4cb838c
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-09 01:27:02 +00:00
Ulrich Germann
524376fad4
Code cleanup.
2015-03-09 00:34:47 +00:00
Ulrich Germann
c4885d87c0
Code cleanup
2015-03-09 00:34:18 +00:00
Ulrich Germann
c1d2313a66
Moved caching of lexical reordering scores from class TranslationOption to class TargetPhrase.
...
This was done so that phrase tables can add this information (if available) as extra annotation
to TargetPhrases, in preparation of providing lexical reordering models with sampling phrase tables.
2015-03-09 00:30:01 +00:00
Ulrich Germann
ddea89312e
Commented out an #include that triggered a 'deprecated header' warning.
2015-03-09 00:08:31 +00:00
Hieu Hoang
afe8b4162e
link with openmp for srilm
2015-03-08 21:25:24 +00:00
Hieu Hoang
c08ebc9dae
link with openmp for srilm
2015-03-08 21:23:13 +00:00
Ales Tamchyna
9266d65304
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2015-03-06 13:44:46 +01:00
Ales Tamchyna
9224779695
include scores of source-context features in topt collection verbose output
2015-03-06 13:44:38 +01:00
Rico Sennrich
b3d2b11a36
make BilingualLM play nice with new chart decoding score deltas
2015-03-06 08:52:13 +00:00
Ulrich Germann
612e014e79
Organized command line options thematically.
...
This is the first step in migration to the use of boost::program_options for GNU/Linux-style handling of command line options.
2015-03-05 00:34:02 +00:00
Ales Tamchyna
6058bfc61b
bugfix, BP only applies when candidate is shorter than ref :)
2015-03-04 18:02:50 +01:00
Ales Tamchyna
ff1cae919b
minor refactoring in VW feature
2015-03-04 17:40:05 +01:00
Ales Tamchyna
34649b74d3
implement BLEU2+1 as training loss for VW
2015-03-04 17:10:59 +01:00
Ales Tamchyna
7601618477
some refactoring of VW, towards customizable loss calculation in training
2015-03-04 14:26:26 +01:00
Ulrich Germann
4ad0db1d16
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-04 00:20:14 +00:00
Ulrich Germann
982286b38f
Folded contrib/server/mosesserver.cpp into the moses executable.
...
With immediate effect, mosesserver is deprecated, use moses --server instead.
2015-03-04 00:07:11 +00:00
Matthias Huck
2543c78b71
RulePairUnlexicalizedSource feature function
2015-03-02 14:32:18 +00:00
Ulrich Germann
9291bb80ca
Merge branch 'master' of https://github.com/moses-smt/mosesdecoder
2015-03-02 11:24:30 +00:00
Ulrich Germann
e1c02ee57c
Code reorganization.
2015-03-02 11:23:00 +00:00
Hieu Hoang
0829e5196d
path arg /Cheng Yong
2015-02-28 13:59:36 +00:00
Matthias Huck
0d4678577b
floor value, glue rules, some cleanup
2015-02-27 02:39:43 +00:00
Matthias Huck
0afc261251
SoftSourceSyntacticConstraintsFeature: Sparse label pair scores
2015-02-26 20:27:02 +00:00
Matthias Huck
342abcf8fc
Model1Feature: score caching for improved efficiency
2015-02-25 23:13:15 +00:00
Matthias Huck
0a6adcde3a
Model1Feature: special UNK token in MGIZA vocabularies
2015-02-25 01:27:08 +00:00
Matthias Huck
39c1ef52dc
Model1Feature: unnecessary includes & special UNK token in MGIZA vocabularies
2015-02-25 00:52:15 +00:00
Matthias Huck
3c8d48f8af
Model1Feature: a simple IBM Model 1 scorer,
...
source-to-target with global source-sentence context
2015-02-25 00:11:31 +00:00
Hieu Hoang
28fbf07c37
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2015-02-24 12:35:12 +00:00
Hieu Hoang
59986d8c81
change default mac-chart-span 10->20. /Arefeh Kazemi
2015-02-24 12:35:00 +00:00
Phil Williams
0bff50449e
Fix -search-algorithm 8
2015-02-23 11:12:00 +00:00
Matthias Huck
372d14a9e8
missing condition
2015-02-20 21:46:38 +00:00
Matthias Huck
f317a2cebd
PhraseOrientationFeature
...
better efficiency via precomputation of non-terminal orientation classes,
plus a bugfix (weights vector was not read correctly)
2015-02-20 20:53:50 +00:00
Matthias Huck
9f4ebd83e0
TargetPhrase: can store arbitrary data
2015-02-20 20:53:25 +00:00
Hieu Hoang
32de075022
beautify
2015-02-19 12:27:23 +00:00
Hieu Hoang
6d4bad0f78
Merge branch 'master' of github.com:moses-smt/mosesdecoder
2015-02-18 21:56:48 +00:00
Hieu Hoang
add14db330
use VERBOSE instead of cerr
2015-02-18 21:56:35 +00:00
Phil Williams
357f33b0d5
Fix -output-unknowns for -search-algorithm 3 (moses_chart)
2015-02-18 16:22:13 +00:00
Hieu Hoang
b223256040
park Joint LM. Doesn't work anymore
2015-02-18 15:11:03 +00:00
Matthias Huck
6028c7cf9c
tuneable-components config parameter for feature functions
2015-02-16 18:39:25 +00:00
Ales Tamchyna
f6884c55a1
restoring broken evaluation of translation option list
2015-02-16 12:32:00 +01:00
Ales Tamchyna
a46daa1938
target (internal) bigrams for VW
2015-02-16 11:36:16 +01:00
Ales Tamchyna
b05fc4c9bc
properly register bigram VW feature
2015-02-16 11:30:36 +01:00
Ales Tamchyna
a9926e7f49
source bigram feature for VW
2015-02-16 10:59:55 +01:00
Ulrich Germann
064b01c054
Code reorganization and cleanup around Lexical Reordering Round 2.
...
Hope it compiles this time.
2015-02-15 16:34:54 +00:00