Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: NotImplementedError #8

Open
oYoungCo opened this issue Feb 29, 2024 · 3 comments
Open

BUG: NotImplementedError #8

oYoungCo opened this issue Feb 29, 2024 · 3 comments

Comments

@oYoungCo
Copy link

When I run the script of Training.embedding_model, a bug is occurred. How can I fix it?
File /gritlm/training/run.py", line 166, in main
else: raise NotImplementedError
NotImplementedError

@Muennighoff
Copy link
Collaborator

if you check line 166 you see that it's cuz your training data is not in the right format, what data are you using & whats ur script?

@oYoungCo
Copy link
Author

if you check line 166 you see that it's cuz your training data is not in the right format, what data are you using & whats ur script?
I run this script, and data is toy_data provided in code.
torchrun --nproc_per_node 1
-m training.run
--output_dir test_path
--model_name_or_path openaccess-ai-collective/tiny-mistral
--train_data training/toy_data
--learning_rate 1e-5
--num_train_epochs 5
--per_device_train_batch_size 2
--dataloader_drop_last True
--normalized True
--temperature 0.02
--query_max_len 32
--passage_max_len 128
--train_group_size 2
--negatives_cross_device
--mode unified

@Muennighoff
Copy link
Collaborator

It should have been fixed now (by just removing the raise)! Thanks for raising this!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants