mirror of
https://github.com/moses-smt/mosesdecoder.git
synced 2024-12-28 06:22:14 +03:00
154 lines
1008 B
Plaintext
154 lines
1008 B
Plaintext
#Anything in this file, followed by a period (and an upper-case word), does NOT indicate an end-of-sentence marker.
|
|
#Special cases are included for prefixes that ONLY appear before 0-9 numbers.
|
|
#
|
|
#any single upper case letter followed by a period is not a sentence ender
|
|
#usually upper case letters are initials in a name
|
|
#no French words end in single lower-case letters, so we throw those in too?
|
|
A
|
|
B
|
|
C
|
|
D
|
|
E
|
|
F
|
|
G
|
|
H
|
|
I
|
|
J
|
|
K
|
|
L
|
|
M
|
|
N
|
|
O
|
|
P
|
|
Q
|
|
R
|
|
S
|
|
T
|
|
U
|
|
V
|
|
W
|
|
X
|
|
Y
|
|
Z
|
|
a
|
|
b
|
|
c
|
|
d
|
|
e
|
|
f
|
|
g
|
|
h
|
|
i
|
|
j
|
|
k
|
|
l
|
|
m
|
|
n
|
|
o
|
|
p
|
|
q
|
|
r
|
|
s
|
|
t
|
|
u
|
|
v
|
|
w
|
|
x
|
|
y
|
|
z
|
|
|
|
# Period-final abbreviation list for French
|
|
A.C.N
|
|
A.M
|
|
art
|
|
ann
|
|
apr
|
|
av
|
|
auj
|
|
lib
|
|
B.P
|
|
boul
|
|
ca
|
|
c.-à-d
|
|
cf
|
|
ch.-l
|
|
chap
|
|
contr
|
|
C.P.I
|
|
C.Q.F.D
|
|
C.N
|
|
C.N.S
|
|
C.S
|
|
dir
|
|
éd
|
|
e.g
|
|
env
|
|
al
|
|
etc
|
|
E.V
|
|
ex
|
|
fasc
|
|
fém
|
|
fig
|
|
fr
|
|
hab
|
|
ibid
|
|
id
|
|
i.e
|
|
inf
|
|
LL.AA
|
|
LL.AA.II
|
|
LL.AA.RR
|
|
LL.AA.SS
|
|
L.D
|
|
LL.EE
|
|
LL.MM
|
|
LL.MM.II.RR
|
|
loc.cit
|
|
masc
|
|
MM
|
|
ms
|
|
N.B
|
|
N.D.A
|
|
N.D.L.R
|
|
N.D.T
|
|
n/réf
|
|
NN.SS
|
|
N.S
|
|
N.D
|
|
N.P.A.I
|
|
p.c.c
|
|
pl
|
|
pp
|
|
p.ex
|
|
p.j
|
|
P.S
|
|
R.A.S
|
|
R.-V
|
|
R.P
|
|
R.I.P
|
|
SS
|
|
S.S
|
|
S.A
|
|
S.A.I
|
|
S.A.R
|
|
S.A.S
|
|
S.E
|
|
sec
|
|
sect
|
|
sing
|
|
S.M
|
|
S.M.I.R
|
|
sq
|
|
sqq
|
|
suiv
|
|
sup
|
|
suppl
|
|
tél
|
|
T.S.V.P
|
|
vb
|
|
vol
|
|
vs
|
|
X.O
|
|
Z.I
|