Train and Validation losses behavior #1822
-
I'm working with my students in a final graduation project that aims to extract building footprints using the Inria dataset, but not the one available through TorchGeo, but the one available in Kaggle. We are using a CSV logger. Our hyper parameters to process for 10 epochs are (we went until 30 epochs):
We are using a CustomGeoDataModule:
To calculate the train losses, we are determining the average of the steps for each epoch in the CSV log generated. Making the almost the same process with RasterVision give us a "well behaved loss graphic". What could be the reason for that? Thanks in advance. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 13 replies
-
Not an answer to your question, but I'm really curious how your course went and how you structured it. If you have time, I would love to discuss this in more detail over slack (see the invite on our README) or email (see my email on https://github.com/adamjstewart). |
Beta Was this translation helpful? Give feedback.
-
Hey @lcoandrade, It looks like the loss values are super high so I would check really low level things:
|
Beta Was this translation helpful? Give feedback.
Yes, the above code without normalization looks correct. Maybe try the other steps @calebrob6 suggested, including plotting the image/mask to make sure they look right.