Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finish I-JEPA #1320

Open
3 of 10 tasks
guarin opened this issue Jul 14, 2023 · 5 comments
Open
3 of 10 tasks

Finish I-JEPA #1320

guarin opened this issue Jul 14, 2023 · 5 comments
Labels

Comments

@guarin
Copy link
Contributor

guarin commented Jul 14, 2023

Experimental support for I-JEPA was added in #1273

We should do some refactoring and testing before fully releasing the model.

Todo

  • Refactor I-JEPA to use timm ViT #1367
  • Verify that weights are correctly initialized and follow this
  • Check if we can refactor IJEPAMaskCollator into a transform, comment.
  • Move positional embedding functions (_get_1d_sincos_pos_embed_from_grid etc.) to lightly/models/utils and add reference to source
  • Add missing docstrings or bring into correct format, check diff from I-JEPA #1273
  • Add unit tests
  • Finish pytorch lightning and distributed examples
  • Add docs
  • Add imagenet benchmark to lightly/benchmarks/imagenet/vit/ijepa.py
  • Fix broken IJEPA Example #1712
@Natyren
Copy link
Contributor

Natyren commented Jul 14, 2023

Hello @guarin . I can work on it. Or on part of issues.

@guarin
Copy link
Contributor Author

guarin commented Jul 14, 2023

That would great! Feel free to pick anything up that you are interested in. Let me know if you need help or some clarification.

@Natyren
Copy link
Contributor

Natyren commented Jul 15, 2023

@guarin
I will take issues

  • Try if we can get it working with timm ViT backbone. I would like to do this because torchvision ViT doesn't support stochastic path dropout, see comment.
  • Verify that weights are correctly initialized and follow this
  • Check if we can refactor IJEPAMaskCollator into a transform, comment.
  • Move positional embedding functions (_get_1d_sincos_pos_embed_from_grid etc.) to lightly/models/utils and add reference to source
  • Finish pytorch lightning and distributed examples
  • Add docs
  • Add imagenet benchmark to lightly/benchmarks/imagenet/vit/ijepa.py

Will work on them in next prs

@Natyren
Copy link
Contributor

Natyren commented Jul 15, 2023

#1322 reference point to this issue

  • Move positional embedding functions (_get_1d_sincos_pos_embed_from_grid etc.) to lightly/models/utils and add reference to source

@Natyren
Copy link
Contributor

Natyren commented Jul 17, 2023

Next, i will work on implementation of this

  • Check if we can refactor IJEPAMaskCollator into a transform, comment.

UPDATE: on pause

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: No status
Development

No branches or pull requests

2 participants