A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
-
Updated
Oct 9, 2024 - Python
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
[CVPR 2021: Oral] In this work, we show that high frequency Fourier spectrum decay discrepancies are not inherent characteristics for existing CNN-based generative models.
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
High-level API for tar-based dataset
This repo is the official released code of FoPro (AAAI-2023)
A sample subset of the NIH Chest X-ray Dataset. At only 2.4% of the size of the original dataset, it allows creating an accurate classifier using the Augmented Chest X-Ray repository.
Scripts to collect data from CARLA and save them as Webdataset
Web project, using SparQL on dbpedia and wikidata for mental disorder search feature
Add a description, image, and links to the webdataset topic page so that developers can more easily learn about it.
To associate your repository with the webdataset topic, visit your repo's landing page and select "manage topics."