Code for a new machine translation benchmark, Tatoeba #15

Traubert · 2021-05-21T13:32:28Z

Hi, I'm proposing to integrate the Tatoeba machine translation dataset into sotabench-eval. I have included code for running the tests, modeled after WMT, and for downloading and configuring the data. I'm not 100% sure how the caching is supposed to work at the moment, I'll come back to that.

Currently you can:

import sotabencheval
from sotabencheval.machine_translation import TatoebaEvaluator, TatoebaDataset

# The test data will be downloaded and unpacked under the directory "tatoeba", this only needs to be done if the data isn't already present
sotabencheval.machine_translation.tatoeba.fetch_and_configure_data("tatoeba")
evaluator = TatoebaEvaluator(dataset=TatoebaDataset.v1, source_lang="eng", target_lang="deu", local_root="tatoeba", model_name="Some model", paper_arxiv_id="Some id")

evaluator.add({1: "Tom mag die italienische Küche.", 2: "Hier wirst du viel lernen."})
print(evaluator.get_results(ignore_missing = True))

You should be able to merge this without breaking anything, but please point me towards what else needs to be done...

Traubert added 4 commits May 20, 2021 15:47

Add Tatoeba modules

9034c66

Add tatoeba.py

4c001a8

Parse configuration from remaining directory structure

a79b17a

Data-downloading and configuring code

8970f86

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code for a new machine translation benchmark, Tatoeba #15

Code for a new machine translation benchmark, Tatoeba #15

Traubert commented May 21, 2021

Code for a new machine translation benchmark, Tatoeba #15

Are you sure you want to change the base?

Code for a new machine translation benchmark, Tatoeba #15

Conversation

Traubert commented May 21, 2021