mosesdecoder/phrase-extract/postprocess-egret-forests/SplitPoint.h
Phil Williams 9e88f794e6 Add phrase-extract/postprocess-egret-forests
This performs some minor transformations to Egret forests: escaping of
Moses special characters; removal of "^g" suffixes from constituent labels;
and marking of slash/hyphen split points (using @ characters).
2015-03-10 13:51:30 +00:00

28 lines
464 B
C++

#pragma once
#include <vector>
#include <string>
#include "Forest.h"
namespace MosesTraining
{
namespace Syntax
{
namespace PostprocessEgretForests
{
struct SplitPoint {
int tokenPos;
int charPos;
std::string connector;
};
void MarkSplitPoints(const std::vector<SplitPoint> &, Forest &);
void MarkSplitPoints(const std::vector<SplitPoint> &, std::string &);
} // namespace PostprocessEgretForests
} // namespace Syntax
} // namespace MosesTraining