Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for downloading publicly available datasets #340

Open
niksirbi opened this issue Nov 6, 2024 · 4 comments
Open

Add support for downloading publicly available datasets #340

niksirbi opened this issue Nov 6, 2024 · 4 comments
Labels
enhancement New optional feature

Comments

@niksirbi
Copy link
Member

niksirbi commented Nov 6, 2024

This came up in discussions with @gchindemi.

There are several publicly available pose datasets - e.g. CalMS21 and Rat7M.

We could add some functionality for easily fetching these data in movement, by following a similar strategy to our sample_data module, i.e. download with pooch, cache locally, and (optionally) load into a movement dataset.

Something like this would be useful for:

  • us, because it would help us write tutorials/examples, and potentially to test and profile our code
  • the community, especially developers of new methods, who could benefit from easy access to the various datasets and benchmarks during the prototyping phase

That said, we would have to think about somehow differentiating between publicly available datasets, and our own test data hosted on GIN (see also #237).

@niksirbi niksirbi added the enhancement New optional feature label Nov 6, 2024
@niksirbi niksirbi changed the title Add support for downloading publicly available pose datasets Add support for downloading publicly available datasets Nov 6, 2024
@niksirbi
Copy link
Member Author

niksirbi commented Nov 6, 2024

Clarifications:

  • this would concern not only pose datasets, but also perhaps data with bounding boxes or segmentations masks (pending Add support for tracked segmentation masks? #301).
  • we needn't do it all at once, we should start with one dataset, to figure out the interface, and then proceed from there.

@gchindemi
Copy link

I think this could be very useful. I implemented this functionality for the CalMS21 dataset in our social behavior analysis package LISBET. I would be happy to contribute it to movement, after you decide how to expose public datasets.

@niksirbi
Copy link
Member Author

Thanks @gchindemi, I'll keep you in the loop about this.

@SkepticRaven
Copy link

Keep me in the loop, too. The group I'm in has released a handful of big pose datasets including the MABe22 mouse triplets and our open field strain survey, with more on the horizon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New optional feature
Projects
Status: 🤔 Triage
Development

No branches or pull requests

3 participants