Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about the hyperparameter #4

Open
EmiliaKKK opened this issue Apr 23, 2022 · 0 comments
Open

Question about the hyperparameter #4

EmiliaKKK opened this issue Apr 23, 2022 · 0 comments

Comments

@EmiliaKKK
Copy link

Thanks for your amazing work!
I have two question about hyperparameter in the experiment setting.

1.

image

In paper 4.3, the initial lr is 7e4. Is that a typo? If not ,I'm really confuse why the lr is so large.

2.

image

The κ there is on the order of e4, which would lead the

image

about 1, since C(xi) may on the order of e-1. I wonder why κ is set to be so big. If wi is all around 1.0, it looks like "Loss weighting" is almost the same as the normal method. Am I right or I've just missed something?

Thank you in advance for your reply! BTW, I'm one of your fans in Bilibili. Your explanation of the paper was very clear and helpful to me. Thank you for your great work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant