Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difference in Dataset Format #8

Open
zhiminc opened this issue Feb 15, 2024 · 1 comment
Open

Difference in Dataset Format #8

zhiminc opened this issue Feb 15, 2024 · 1 comment

Comments

@zhiminc
Copy link

zhiminc commented Feb 15, 2024

Thanks for the great work! I find the the format of dataset download from hugging face is different from the format using in the training stage (npy in hugging face and npz in the dataloader). The shape of data in the npy file are N, N, and (B+1)x512x256x32, which is totally different from the npz format:

input: (B+1)x512x256x32 matrix # B is the number of previous frames, default is 10
label: (B+1)x512x256x32 matrix # B is the number of future frames, default is 10
invalid: (B+1)x512x256x32 matrix # same as label

May you tell me what's the meaning of the data in the npy files and the difference to the npz file? Thanks a lot!

@zhiminc
Copy link
Author

zhiminc commented Feb 16, 2024

I find that in the code of data preprocessing, only the npy format is stored in the voxel folder. I cannot find any code to generate npz voxel file that is utilized in the following dataloader code:

for i in self.split:
sample_path = glob(os.path.join(self.root_dir, 'voxel', '{:04d}'.format(i), '*.npz'))
self.data_path.extend(sample_path)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant