-
Notifications
You must be signed in to change notification settings - Fork 467
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix validation loss calculation #5
base: master
Are you sure you want to change the base?
Conversation
Hey @sthalles I saw you updated Here the problem is not only the Let me show you an example with numbers:
If at the end you divide That's why in this PR I changed the divisor to Now, I see that |
Hi Alessia,
Thanks for the message.
About the zero division, that is true and easy to solve, I will fix it.
Regarding the validation loss calculation, I did not push the code
because actually dividing by (counter + 1) would be wrong.
That follows a simple code with the example you described above.
Regards
counter = 0 # counter needs to start at zero
valid_loss = 0.0
for i in range(5):
print("Batch {}".format(i))
print("----")
valid_loss += 2
counter += 1
print("valid_loss:", valid_loss)
print("counter:", counter)
print("Final valid loss:", valid_loss / counter)
…------
Batch 0 ---- Batch 1 ---- Batch 2 ---- Batch 3 ---- Batch 4 ----
valid_loss: 10.0 counter: 5 Final valid loss: 2.0
--
Thalles Santos Silva
Computer Vision and Deep Learning engineer.
+55 (73) 98222-8989
On Sat, Apr 18, 2020 at 12:21 PM Alessia Marcolini ***@***.***> wrote:
Hey @sthalles <https://github.com/sthalles> I saw you updated simcrl.py
in the last commit
<c057471>,
but the behavior of the code is the same as before: counter starts from 0.
Here the problem is not only the ZeroDivisionError (in case you don't
have enough validation samples to fill a validation batch - so the for loop
never runs), but also that at the end you get a wrong valid_loss.
Let me show you an example with numbers:
- Say that you have 100 samples in your validation set and a batch
size of 20, then you'll perform 5 iterations on the validation set.
- counter will range from 0 to 4 (included).
- Suppose that at each iteration you get a loss of 2 (silly example).
- You're accumulating the losses in valid_loss, so at the end of the 5
iterations valid_loss will be equal to 10.
If at the end you divide valid_loss by counter, you'll get 10/4,
resulting 2.5, and not 10/5 resulting 2, as expected.
That's why in this PR I changed the divisor to counter + 1.
Now, I see that simclr.py conflicts with my changes: do you want me to
resolve the conflict and push again?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#5 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AA2V5PFEEUVVKDBOYY4GBTTRNHAQDANCNFSM4MJNIBLQ>
.
|
Hi Talles, thank you for your quick response. Actually I was referring to Regards |
Fixes issue #4