mirror of
https://github.com/google/sentencepiece.git
synced 2024-10-26 11:38:45 +03:00
Fix dead links
This commit is contained in:
parent
df5f7fdfc6
commit
f122fb3d57
@ -36,7 +36,7 @@ For those unfamiliar with SentencePiece as a software/algorithm, one can read [a
|
||||
|:---|:---:|:---:|:---:|
|
||||
|Supported algorithm|BPE, unigram, char, word|BPE|BPE*|
|
||||
|OSS?|Yes|Yes|Google internal|
|
||||
|Subword regularization|[Yes](#subword-regularization)|No|No|
|
||||
|Subword regularization|[Yes](#subword-regularization-and-bpe-dropout)|No|No|
|
||||
|Python Library (pip)|[Yes](python/README.md)|No|N/A|
|
||||
|C++ Library|[Yes](doc/api.md)|No|N/A|
|
||||
|Pre-segmentation required?|[No](#whitespace-is-treated-as-a-basic-symbol)|Yes|Yes|
|
||||
|
@ -112,7 +112,7 @@ We have evaluated SentencePiece segmentation with the following configurations.
|
||||
* [KFTT](http://www.phontron.com/kftt/index.html)
|
||||
* [MultiUN](http://opus.lingfil.uu.se/MultiUN.php) (First 5M and next
|
||||
5k/5k sentences are used for training and development/testing respectively.)
|
||||
* [WMT16](http://www.statmt.org/WMT16/)
|
||||
* [WMT16](https://www.statmt.org/wmt16/)
|
||||
* In-house: (Used 5M parallel sentences for training)
|
||||
|
||||
**NoPretok** and **WsPretok** do not use any language-dependent resources.
|
||||
|
Loading…
Reference in New Issue
Block a user