Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot reproduce the model #7

Open
mina58 opened this issue Jun 26, 2024 · 0 comments
Open

Cannot reproduce the model #7

mina58 opened this issue Jun 26, 2024 · 0 comments

Comments

@mina58
Copy link

mina58 commented Jun 26, 2024

I downloaded the dataset and followed the cleaning steps in the paper (removing files with >= 20000 characters) but this yielded a dataset of size 220MB. I am trying to reproduce the model using the exact steps in the finetuning repo but I get very different results.

Is there something I am doing wrong?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant