v0.1.96
Updates
- Improves the performance of unigram training
- Updated the nfkc normalization with the latest ICU module.
- Stop handling zero-width-joiner string as whitespace.
New features
- added new sampling algorithm without replacement.
- added API for new sampling and perplexity calculation.
- added
allow_whitespace_only_pieces
mode.