Create dataset loader for Okapi m-TruthfulQA #477

SamuelCahyawijaya · 2024-03-03T09:45:49Z

Dataloader name: okapi_m_truthfulqa/okapi_m_truthfulqa.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?okapi_m_truthfulqa

Dataset	okapi_m_truthfulqa
Description	m-TruthfulQA is a multi-lingual version of TruthfulQA, a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics.
Subsets	-
Languages	ind, vie
Tasks	Question Answering
License	Creative Commons Attribution Non Commercial 4.0 (cc-by-nc-4.0)
Homepage	http://nlp.uoregon.edu/download/okapi-eval/datasets/
HF URL	https://huggingface.co/datasets/jon-tow/okapi_truthfulqa
Paper URL	https://arxiv.org/abs/2307.16039

The text was updated successfully, but these errors were encountered:

tellarin · 2024-03-03T18:27:58Z

#self-assign

tellarin · 2024-03-19T05:37:46Z

Back working on this. Sorry for the delay.

SamuelCahyawijaya added this to SEACrowd Data Hub Mar 3, 2024

SamuelCahyawijaya converted this from a draft issue Mar 3, 2024

github-actions bot assigned tellarin Mar 3, 2024

github-actions bot added the staled-issue label Mar 18, 2024

github-actions bot removed the staled-issue label Mar 20, 2024

github-actions bot added the staled-issue label Apr 3, 2024

Provide feedback