Commit Graph

13 Commits

Author SHA1 Message Date
Phil Williams
fa51da28c5 moses/phrase-extract refactoring
Final commit in this round of refactoring (which started with commit
2f735998...).  The main changes are:

  - a general storage mechanism for attribute/value pairs in XML-style
    tree / lattice input.  E.g. the "pcfg-score" and "semantic-role"
    attributes in:

     <tree label="PRP" pcfg-score="1.0" semantic-role="AGENT"> I </tree>

  - consolidation of the various near-duplicate Tree / XmlTreeParser classes
    that have accumulated over the years (my fault)

  - miscellaneous de-crufting
2015-06-09 16:50:27 +01:00
Phil Williams
ed321791a7 Ongoing moses/phrase-extract refactoring 2015-06-03 11:10:45 +01:00
Phil Williams
5e09d3dc71 Ongoing moses/phrase-extract refactoring 2015-06-03 10:33:46 +01:00
Phil Williams
d3fb4a8002 Ongoing moses/phrase-extract refactoring 2015-06-02 10:16:42 +01:00
Phil Williams
985e7bbfc3 Ongoing moses/phrase-extract refactoring 2015-05-29 20:57:25 +01:00
Phil Williams
2f735998ca Rename MosesTraining::SyntaxTree to MosesTraining::SyntaxNodeCollection
This is the first step in a small-scale refactoring effort that will touch a
lot of the syntax-related code in moses/phrase-extract.  The end goals are:

  - a storage mechanism for general attribute/value pairs in XML-style
    tree / lattice input.  E.g. the "pcfg-score" and "semantic-role"
    attributes in:

     <tree label="PRP" pcfg-score="1.0" semantic-role="AGENT"> I </tree>

  - consolidation of the various near-duplicate Tree / XmlTreeParser classes
    that have accumulated over the years (my fault)

  - general de-crufting
2015-05-29 18:46:02 +01:00
Hieu Hoang
05ead45e71 beautify 2015-01-14 11:07:42 +00:00
Hieu Hoang
92089f9726 ignore 0 span. Don't bomb out 2014-06-11 04:35:09 +01:00
Rico Sennrich
bc0cac59be unescape "&#124;" (for compatibility with escape-special-chars scripts) 2014-01-18 12:23:21 +00:00
Hieu Hoang
6249432407 beautify 2013-05-29 18:16:15 +01:00
Phil Williams
139148bc8f extract-ghkm and friends: don't unescape special characters
Don't unescape special characters when reading XML parse trees in
extract-ghkm, extract-rules, and relax-parse.
2012-12-17 20:08:02 +00:00
Hieu Hoang
ef9db932aa add namespace to phrase-extract 2012-06-30 15:43:47 +01:00
Hieu Hoang
4eef94b121 move c++ code out of /script/ to / 2012-05-31 17:24:06 +01:00