Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Knowledge distillation for vision guide #25619
Knowledge distillation for vision guide #25619
Changes from 11 commits
aad783d
01080da
6eca2fa
b595a1a
36979c6
06d7659
8702960
742dc93
23085f0
ed113cd
c01e4cd
4e46a06
10cc3e0
5c36920
cacbe86
3bc1928
f07351b
836cb90
c8b5098
ea0b75e
c4bce38
70c1c1b
04582a5
1cb7469
72a419d
518017d
0be1027
9cae56d
f0c2a9e
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This sentence is actually not true, ResNet and MobileNet each have their own image processors
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
They do return the same thing because processor just does preprocessing on same resolution. Check this out.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe also push the final model to hub?
trainer.push_to_hub()
?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think the final model is pushed already when we set
push_to_hub
to True (I also have save strategy enabled for every epoch so it's triggered every epoch as well), no?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
AFAIK
trainer.push_to_hub()
also creates a basic model card, e.g. with metrics, and some training results.