Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create dataset loader for Intercontinental Dictionary Series (IDS) #709

Open
SamuelCahyawijaya opened this issue Jul 30, 2024 · 0 comments

Comments

@SamuelCahyawijaya
Copy link
Collaborator

Dataloader name: intercontinental_dictionary_series/intercontinental_dictionary_series.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?intercontinental_dictionary_series

Dataset intercontinental_dictionary_series
Description The Intercontinental Dictionary Series (IDS) is a database where lexical material across the languages of the world is organized in such a way that comparisons can be made. Each wordlist has been produced in the same format, which assures the cross-linguistic comparability. IDS is organized with a topical outline that is consistent across wordlists, with up to 1310 entries per language.
Subsets -
Languages khb, pcb, prt, cbn, nut, tha, giq, nod, tyh, smu, san, shn, kkh, mra, blr, syo, tdd, pnx, cog, zng, thm, bgk, rbb, sou, vie
Tasks Word lists
License Creative Commons Attribution 4.0 (cc-by-4.0)
Homepage https://ids.clld.org
HF URL -
Paper URL -
@SamuelCahyawijaya SamuelCahyawijaya converted this from a draft issue Jul 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

1 participant