-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pretrained model #10
Comments
Oh, sorry, I just saw that you actually use the mini dataset for training and the full one for a full-scale evaluation. Closing the issue |
Thanks. Yes actually the training part is same. Training in colab: About sharing a trained model, yes I can. The plan is to write a one page colab demo by loading it for the next update. I really welcome feedback from colab users. I feel it is the way this open project to go. |
I am wondering if it is possible to install
|
I was able to run the training process in Colab with Miniconda, but just installing requirements without Miniconda leads to an error. #12 should fix it. Restoring from that checkpoint doesn't work for some reason. It outputs a long list of messages like
|
@Mihonarium Thanks for report. Yes, it seems we don't need conda for colab. Just About your checkpoint loading issue, let me ask:
BTW, just try |
Yes, I did specify the config. What's even more strange, the issue with a lot of warnings appears only with The notebook: https://gist.github.com/Mihonarium/e3fd355cb560b82373fd2186139f1bc2 (the last cells show that |
@Mihonarium Oh it is an expected behavior as I wrote it above. The checkpoint file contains optimzer's states info which is GPU device dependent. So, if you wanna continue train using my checkpoint as an initial parameter, it's possible but I didn't consider such use. It requires to load model without connecting optimizer first (as in |
@Mihonarium About training from scratch error: First, for P100 GPU, I recommend
You didn't get
It depends on how many threads the system can handle. |
Got it, makes sense. Thanks! Training from scratch didn't give any errors, I interrupted it. I included it to show that errors are from the checkpoints load (I didn't know it was the expected behavior) and not from something else. You're right though, I would probably get an |
Got |
@Mihonarium Solved by removing pathlib for argin. Also fixed same issue for --output option. |
@mimbres @Mihonarium Is it possible for you guys to share the trained model, It's quite hard to train with 320 as batch size? 🤞 |
@TheMightyRaider the trained model is available here |
Thanks! @Mihonarium |
i use the pretrained model, and same database(Dataset-mini), for evalue step, but i got very poor result, i want to know: why? this is my code Top1 exact 3.75 5.90 6.45 7.25 7.25 7.80
|
While it's relatively easy to train the model on the Dataset-mini (even Colab allows that), it's not as easy to reproduce the paper's results with the Dataset-full. It would be great if you could publish a model trained on the full dataset.
(By the way, congratulations on the paper, and thanks for publishing the work, it's really cool!)
The text was updated successfully, but these errors were encountered: