Skip to content

Latest commit

 

History

History
21 lines (17 loc) · 918 Bytes

README.md

File metadata and controls

21 lines (17 loc) · 918 Bytes

opus-2021-02-10.zip

  • dataset: opus
  • model: transformer
  • source language(s): pli san
  • target language(s): pli san
  • model: transformer
  • pre-processing: normalization + SentencePiece (spm4k,spm4k)
  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)
  • download: opus-2021-02-10.zip
  • test set translations: opus-2021-02-10.test.txt
  • test set scores: opus-2021-02-10.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-san.eng.san 0.1 0.095
Tatoeba-test.multi.multi 0.1 0.097
Tatoeba-test.san-eng.san.eng 0.1 0.119