Commit Graph

5 Commits

Author SHA1 Message Date
MosesAdmin
c3424ce541 daily automatic beautifier 2015-07-21 00:00:42 +01:00
Philipp Koehn
6d0f482361 extended phrase lookup: print sentences, document id 2015-07-20 11:41:48 -04:00
Jeroen Vermeulen
b2d821a141 Unify tokenize() into util, and unit-test it.
The duplicate definition works fine in environments where the inline
definition becomes a weak symbol in the object file, but if it gets
generated as a regular definition, the duplicate definition causes link
problems.

In most call sites the return value could easily be made const, which
gives both the reader and the compiler a bit more certainty about the code's
intentions.  In theory this may help performance, but it's mainly for clarity.

The comments are based on reverse-engineering, and the unit tests are based
on the comments.  It's possible that some of what's in there is not essential,
in which case, don't feel bad about changing it!

I left a third identical definition in place, though I updated it with my
changes to avoid creeping divergence, and noted the duplication in a comment.
It would be nice to get rid of this definition as well, but it'd introduce
headers from the main Moses tree into biconcor, which may be against policy.
2015-04-22 09:59:05 +07:00
Hieu Hoang
05ead45e71 beautify 2015-01-14 11:07:42 +00:00
Philipp Koehn
1ed54a6181 add tool for phrase lookup with biconcor 2014-09-21 06:03:51 +01:00