-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create dataset loader for M3LS #228
Comments
#self-assign |
Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help. |
Working on it. Will try to finish this week |
No problem! Feel free to let us know anytime you would like to discuss. |
Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help. |
#self-assign |
and for this one, I'll try to do this immediately after this one is PR-ed: |
Hi, seems the dataset itself is exceedingly large (the zipped version is around 14GB, unsure abt the actual size after unzipping -- now I'm doing it). Also, there is a forseeable blocker on bypassing Google Drive downloading process by passing the GDrive URL to either If all of them aren't possible, maybe the last resort is to change the format into local-based dataset |
updates:
any thoughts or ideas? @holylovenia @SamuelCahyawijaya |
Thanks for inspecting this dataset, @sabilmakbar! I think this dataset is a multimodal dataset, precisely a multilingual |
* add m3ls * Update seacrowd/sea_datasets/m3ls/m3ls.py * Apply suggestions from code review update to comply w/ `black` formatter Co-authored-by: Frederikus Hudi <[email protected]> * Update m3ls.py * Update m3ls.py * Update m3ls.py following `black` formatter --------- Co-authored-by: Lj Miranda <[email protected]> Co-authored-by: Frederikus Hudi <[email protected]>
Dataloader name:
m3ls/m3ls.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?m3ls
The text was updated successfully, but these errors were encountered: