MT Metrics Thresholds

Code for Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies by Tom Kocmi, Vilém Zouhar, Christian Federmann, and Matt Post.

@inproceedings{kocmi-etal-2024-navigating,
    title = "Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies",
    author = "Kocmi, Tom  and Zouhar, Vil{\'e}m  and Federmann, Christian  and Post, Matt",
    editor = "Ku, Lun-Wei  and Martins, Andre  and Srikumar, Vivek",
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.acl-long.110",
    doi = "10.18653/v1/2024.acl-long.110",
    pages = "1999--2014",
}

Web frontend

See the MT thresholds tool.

Local tool

pip3 install mt-thresholds

# accuracy is 63.989%
mt-thresholds bleu 1.00

# ChrF needs 0.710 difference for the same accuracy as BLEU
mt-thresholds chrf 0.63989 --delta

Or use from Python:

import mt_thresholds

mt_thresholds.accuracy(1.0, "bleu") # 0.63989
mt_thresholds.delta(0.63989, "chrf") # 0.665

Experiment code

We plan to release the code for replicating WMT results in upcoming months.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
appendix		appendix
python-tool		python-tool
web-tool		web-tool
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MT Metrics Thresholds

Web frontend

Local tool

Experiment code

About

Contributors 2

Languages

kocmitom/MT-Thresholds

Folders and files

Latest commit

History

Repository files navigation

MT Metrics Thresholds

Web frontend

Local tool

Experiment code

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages