From a929b634581c318e25ff0be15ab5d3bb1e247612 Mon Sep 17 00:00:00 2001 From: Taku Kudo Date: Sat, 9 Jun 2018 00:32:55 +0900 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index cf56c85..889ee48 100644 --- a/README.md +++ b/README.md @@ -268,7 +268,7 @@ The usage is basically the same as that of ```subword-nmt```. Assming that L1 an % spm_encode --model=spm.model --generate_vocabulary < {train_file}.L2 > {vocab_file}.L2 ``` -```shuffle``` command is used just in case because ```spm_encode``` loads the first 10M lines of corpus by default. +```shuffle``` command is used just in case because ```spm_train``` loads the first 10M lines of corpus by default. Then segment train/test corpus with ```--vocabulary``` option