Add support for downloading publicly available datasets #340

niksirbi · 2024-11-06T08:15:01Z

This came up in discussions with @gchindemi.

There are several publicly available pose datasets - e.g. CalMS21 and Rat7M.

We could add some functionality for easily fetching these data in movement, by following a similar strategy to our sample_data module, i.e. download with pooch, cache locally, and (optionally) load into a movement dataset.

Something like this would be useful for:

us, because it would help us write tutorials/examples, and potentially to test and profile our code
the community, especially developers of new methods, who could benefit from easy access to the various datasets and benchmarks during the prototyping phase

That said, we would have to think about somehow differentiating between publicly available datasets, and our own test data hosted on GIN (see also #237).

The text was updated successfully, but these errors were encountered:

niksirbi · 2024-11-06T08:17:28Z

Clarifications:

this would concern not only pose datasets, but also perhaps data with bounding boxes or segmentations masks (pending Add support for tracked segmentation masks? #301).
we needn't do it all at once, we should start with one dataset, to figure out the interface, and then proceed from there.

gchindemi · 2024-11-11T18:27:32Z

I think this could be very useful. I implemented this functionality for the CalMS21 dataset in our social behavior analysis package LISBET. I would be happy to contribute it to movement, after you decide how to expose public datasets.

niksirbi · 2024-11-12T09:41:31Z

Thanks @gchindemi, I'll keep you in the loop about this.

SkepticRaven · 2024-11-15T15:28:40Z

Keep me in the loop, too. The group I'm in has released a handful of big pose datasets including the MABe22 mouse triplets and our open field strain survey, with more on the horizon.

niksirbi added the enhancement New optional feature label Nov 6, 2024

github-project-automation bot added this to movement progress tracker Nov 6, 2024

github-project-automation bot moved this to 🤔 Triage in movement progress tracker Nov 6, 2024

niksirbi changed the title ~~Add support for downloading publicly available pose datasets~~ Add support for downloading publicly available datasets Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for downloading publicly available datasets #340

Add support for downloading publicly available datasets #340

niksirbi commented Nov 6, 2024

niksirbi commented Nov 6, 2024

gchindemi commented Nov 11, 2024

niksirbi commented Nov 12, 2024

SkepticRaven commented Nov 15, 2024

Add support for downloading publicly available datasets #340

Add support for downloading publicly available datasets #340

Comments

niksirbi commented Nov 6, 2024

niksirbi commented Nov 6, 2024

gchindemi commented Nov 11, 2024

niksirbi commented Nov 12, 2024

SkepticRaven commented Nov 15, 2024