- dataset: opus
- model: transformer
- source language(s): pli san
- target language(s): pli san
- model: transformer
- pre-processing: normalization + SentencePiece (spm4k,spm4k)
- a sentence initial language token is required in the form of
>>id<<
(id = valid target language ID) - download: opus-2021-02-10.zip
- test set translations: opus-2021-02-10.test.txt
- test set scores: opus-2021-02-10.eval.txt
testset | BLEU | chr-F |
---|---|---|
Tatoeba-test.eng-san.eng.san | 0.1 | 0.095 |
Tatoeba-test.multi.multi | 0.1 | 0.097 |
Tatoeba-test.san-eng.san.eng | 0.1 | 0.119 |