From cefb97b59d867d5d6d9cbb83661f4dab42791b35 Mon Sep 17 00:00:00 2001 From: Ryo Takahashi Date: Fri, 2 Jul 2021 16:06:52 +0900 Subject: [PATCH] Fix link to nfkc.tsv in normalization.md --- doc/normalization.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/doc/normalization.md b/doc/normalization.md index af16b14..fd6a4dc 100644 --- a/doc/normalization.md +++ b/doc/normalization.md @@ -34,7 +34,7 @@ You can use custom normalization rule by preparing a TSV file formatted as follo ``` In this sample, UCS4 sequence [41 302 300] (hex) is converted into [1EA6] (hex). When there are ambiguities in the conversions, the longest rule is used. Note that the tab is used as a delimiter for source and target sequence and space is used as a delimiter for UCS4 characters. We can make the target sequence empty to remove some specific characters from the text. -See [data/nfkc.tsv](data/nfkc.tsv) as an example. Once a TSV file is prepared, you can specify it with `--normalization_rule_tsv` flag. +See [data/nfkc.tsv](../data/nfkc.tsv) as an example. Once a TSV file is prepared, you can specify it with `--normalization_rule_tsv` flag. ``` % spm_train --normalization_rule_tsv= --input= --model_prefix= --vocab_size=8000 ```