Month abbreviations shouldn't be causing a sentence split.

Yes this will break existing tokenized data :-(.
This commit is contained in:
Kenneth Heafield 2014-12-05 03:41:01 -05:00
parent 824bd174f1
commit f97ed79a70

View File

@ -105,3 +105,17 @@ Nos
Art #NUMERIC_ONLY# Art #NUMERIC_ONLY#
Nr Nr
pp #NUMERIC_ONLY# pp #NUMERIC_ONLY#
#month abbreviations
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec