Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training fixes #21

Merged
merged 5 commits into from
Jun 20, 2024
Merged

Training fixes #21

merged 5 commits into from
Jun 20, 2024

Conversation

dfulu
Copy link
Member

@dfulu dfulu commented Jun 20, 2024

This pull request includes:

  1. A slight refactor of file names to leave name-space for new future models
  2. Fix the shuffling from datapipes+DataLoader similar to Fix shuffling and minor tweaks PVNet#118
  3. Rescale inside the default model. Previously, when using the relative_scale_pvnet_outputs option, out input values to the network were of order 1E-4

Note that this will break the compatibility with already trained summation models

@dfulu dfulu marked this pull request as ready for review June 20, 2024 10:56
@dfulu dfulu requested a review from Sukh-P June 20, 2024 10:57
Copy link
Member

@Sukh-P Sukh-P left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, nice work!

@@ -164,7 +163,7 @@ def _get_premade_batches_datapipe(self, subdir, shuffle=False, add_filename=Fals
file_pipeline = FileLister(f"{self.batch_dir}/{subdir}", masks="*.pt", recursive=False)

if shuffle:
file_pipeline = file_pipeline.shuffle(buffer_size=1000)
file_pipeline = file_pipeline.shuffle(buffer_size=10_000)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Out of interest, what's the reason for the buffer size change here?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No particular reason. Just that at this stage all we'll being doing is loading up to 1000 or 10,000 file names before shuffling them. I thought that since the filenames don't take up any amount of space why not use more and have a more complete shuffle.

@dfulu dfulu merged commit 55599d1 into main Jun 20, 2024
3 checks passed
@dfulu dfulu deleted the training_fixes branch June 20, 2024 15:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants