Add Florence2 support #31506

D4ve-R · 2024-06-20T01:10:29Z

What does this PR do?

Add support for microsoft/Florence2

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

D4ve-R · 2024-06-20T01:56:38Z

Command to check diff from original impl

FILE=configuration_florence2.py
git diff --no-index <(curl -s https://huggingface.co/microsoft/Florence-2-large/raw/main/$FILE) src/transformers/models/florence/$FILE

amyeroberts · 2024-06-20T13:06:11Z

@D4ve-R Let us know when it's ready for review!

D4ve-R

hey @amyeroberts i'm almost done. Have to finish writing the test cases.
Can you take a look at the code in src/transformers/models/florence?
I'm stuck in one place, see comment in this review, maybe you have some tips or suggestions how to fix them. Thank you!

Edit: I resolved all my questions

src/transformers/__init__.py

src/transformers/models/florence/processing_florence2.py

SangbumChoi

I left some comment since I'm very interested in this PR! Overall it seems like going in a right path :)

src/transformers/models/florence/processing_florence2.py

src/transformers/models/florence/__init__.py

tests/models/florence/test_modeling_florence2.py

SangbumChoi · 2024-06-22T07:06:55Z

src/transformers/models/florence/modeling_florence2.py

Also we can add some #Copied from logic for the definition and class also!
e.g.

transformers/src/transformers/models/grounding_dino/modeling_grounding_dino.py

Line 101 in 74a2074

# Copied from transformers.models.deformable_detr.modeling_deformable_detr.MultiScaleDeformableAttentionFunction

I linked groundingdino since it is a similar task

I'm not quite sure what you mean. Could you point out where you want to include this in the code?

For some definition or class function can be directly copied from original repo. For example

transformers/src/transformers/models/grounding_dino/modeling_grounding_dino.py

Line 1022 in 74a2074

# Copied from transformers.models.beit.modeling_beit.BeitDropPath with Beit->GroundingDino

We can use DropPath as same in this file and use #Copied from logic
This Copied from logic is for the make fix-copies reuse the definition for existing method.

BTW all the function is named independent to florence2. I'm not sure this is okay? (e.g. DropPath -> Florence2DropPath)

We can use DropPath as same in this file and use #Copied from logic
This Copied from logic is for the make fix-copies reuse the definition for existing method.

Ok now i know what you mean. Thank you for the tip! Changed it for now, but looking at the code in timm, it's slightly different, so i don't know if this is the right thing to do. This might introduce/persist already fixed bugs. https://github.com/huggingface/pytorch-image-models/blob/main/timm/layers/drop.py#L150

BTW all the function is named independent to florence2. I'm not sure this is okay? (e.g. DropPath -> Florence2DropPath)

Reading a bit more in transformers library, I think the naming conventions only apply for classes that are meant to be imported and/or when they are copied. Imo prefixing everything with Florence2 is not necessary.

Ok now i know what you mean. Thank you for the tip! Changed it for now, but looking at the code in timm, it's slightly different, so i don't know if this is the right thing to do. This might introduce/persist already fixed bugs. https://github.com/huggingface/pytorch-image-models/blob/main/timm/layers/drop.py#L150

The best way is probably to use is_timm_available and import DropPath from timm, same as in the original implementation.

BTW all the function is named independent to florence2. I'm not sure this is okay? (e.g. DropPath -> Florence2DropPath)

Maybe @amyeroberts or some other maintainers can give us some hints how to handle this.

Yeah, I was also explaining one of the patterns that repo should keep on it! You do not always have to use this #Copied method if it is not applicable. :)

A few things to unpack here :)

I think the naming conventions only apply for classes that are meant to be imported and/or when they are copied. Imo prefixing everything with Florence2 is not necessary.

This isn't quite right.

There are certain objects which we import and don't add a prefix or reimplement. If we're implementing any custom class/layer/submodule within the modeling file, then it should be prefixed with the model's name. For example in CLIP, BaseModelOutputWithPooling is used directly as it's a common class. Whereas there are also model-specific outputs which are defined and prefixed with CLIP in their name.

In the case of drop path, we want to define our own class, with the Florence2 prefix instead of importing from timm (we should try to avoid dependencies form other libraries as much as possible).

If the functionality is the same, we should copy from another model like Beit. If it's different we should implement from scratch in this modeling file.

With regards to the # Copied from pattern, this is the classical way to utilise modules from other parts of the library.

We've recently introduced a new, improved way of adding new models which are very similar to other models in the library using the "diff converter", which replaces # Copied from.

This involves defining a diff_model_name.py file, which defines modules which are copied, overridden and newly implemented. The modeling file is then automatically generated.

PR introducing this, with examples: Diff converter v2 #30868

Example PR using this: Add LLaVa NeXT Video #31252

Thanks for the detailed feedback, this was really helpful!
I'll look into the "diff converter" method. Appreciate the guidance!

It's pretty new, so let us know if it's unclear or there's any unexpected behaviour!

SangbumChoi · 2024-06-28T06:21:03Z

@D4ve-R I share some script that finetuning florence-2. This might not be include in this PR since this is unofficial but share for your intereset.
Dataset : https://huggingface.co/datasets/danelcsb/cavity
Model : https://huggingface.co/danelcsb/Florence-2-FT-cavity/tree/main
Demo : https://9d0f63918f67be84d1.gradio.live/ (Can be down without notification)
Finetuning code : https://github.com/SangbumChoi/florence2-finetuning/blob/main/train_object_detection.py
Reference
https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/how-to[…]rence-2-on-detection-dataset.ipynb?ref=blog.roboflow.com
https://colab.research.google.com/drive/1Y8GVjwzBIgfmfD3ZypDX5H1JA_VG0YDL?usp=sharing#scrollTo=OQzv9cEVOwbV
https://colab.research.google.com/drive/1hKDrJ5AH_o7I95PtZ9__VlCTNAo1Gjpf?usp=sharing#scrollTo=zqDWEWDcaSxN

amyeroberts · 2024-06-28T15:51:39Z

@D4ve-R Thanks for all your work on this so far!

I think we can close this PR. The Florence 2 model is available directly on the hub to be used within transformers e.g.: https://huggingface.co/microsoft/Florence-2-large

Apologies for not noticing this sooner

D4ve-R · 2024-06-28T19:56:14Z

@amyeroberts no worries! It was really fun and a great experience to learn something.

SangbumChoi · 2024-06-29T00:10:46Z

@D4ve-R Hi, even though the PR might be closed, I would recommend you to keep this branch alive in your repo. I think this might be valuable when we also consider finetuning florence2 model! Nice work :)

D4ve-R added 6 commits June 20, 2024 02:51

add florence2 config

b7e3311

add florence2 processing

3adc633

fix florence2 processing, add image_utils.ChannelDimension

b4cb3d6

fix florence2 processing, add missing tokenizers

73041ba

add florence2 modeling

1ce7813

add florence2 __init__.py

c2221a2

D4ve-R added 17 commits June 20, 2024 15:47

make fixup

6d1606e

fix florence2 modeling, rm einops.rearrange dep

eaa2e7b

fix florence2 config, make fixup

492d30f

fix florence2 processing, make fixup

5b2737b

fix florence2 processing, make repo-consistency

7cef9d9

fix processing, add PILImageResampling

8875899

fix import of optional dependencies

86d0862

add florence2 to models.__init__

3765e73

add florence2 to transformers.__init__

3f915bd

make fix-copies

bdb3706

fix formatting & typo

2d81135

fix __init__s & dummies

e9f86d0

add tests florence2 processing

172a1dd

add tests florence2 modeling

087ef0e

fix tests florence2 processign vocabfile

fb234ef

fix rm import Florence2PostProcessor

0a615ba

fix rm unnecessary imports

26e1c23

D4ve-R commented Jun 21, 2024

View reviewed changes

src/transformers/__init__.py Outdated Show resolved Hide resolved

src/transformers/models/florence/processing_florence2.py Outdated Show resolved Hide resolved

D4ve-R added 2 commits June 21, 2024 14:18

fix processing PILImageResampling

f11c993

fix make repo-consistency error

311da9c

SangbumChoi reviewed Jun 22, 2024

View reviewed changes

update tests

b429814

D4ve-R added 9 commits June 22, 2024 14:39

rename folders

3d67beb

fix folder renaming

8b88ca7

fix florence2 modeling, copy DropPath from beit

299670b

update Florence2DropPath

689ad98

update Florence2Sequential

6b7feb6

update modeling_florence2, add model-prefix

3addc02

update modeling_florence2, add DaViT prefixes

0f624ac

fix base_model_prefix

880811f

fix missing variables

4069145

D4ve-R closed this Jun 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Florence2 support #31506

Add Florence2 support #31506

D4ve-R commented Jun 20, 2024

D4ve-R commented Jun 20, 2024 •

edited

Loading

amyeroberts commented Jun 20, 2024

D4ve-R left a comment •

edited

Loading

SangbumChoi left a comment

SangbumChoi Jun 22, 2024

D4ve-R Jun 22, 2024

SangbumChoi Jun 23, 2024

D4ve-R Jun 23, 2024

D4ve-R Jun 23, 2024

D4ve-R Jun 23, 2024

SangbumChoi Jun 23, 2024

amyeroberts Jun 24, 2024

D4ve-R Jun 25, 2024

amyeroberts Jun 26, 2024

SangbumChoi commented Jun 28, 2024 •

edited

Loading

amyeroberts commented Jun 28, 2024

D4ve-R commented Jun 28, 2024

SangbumChoi commented Jun 29, 2024 •

edited

Loading

Add Florence2 support #31506

Add Florence2 support #31506

Conversation

D4ve-R commented Jun 20, 2024

What does this PR do?

Before submitting

Who can review?

D4ve-R commented Jun 20, 2024 • edited Loading

amyeroberts commented Jun 20, 2024

D4ve-R left a comment • edited Loading

Choose a reason for hiding this comment

SangbumChoi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SangbumChoi commented Jun 28, 2024 • edited Loading

amyeroberts commented Jun 28, 2024

D4ve-R commented Jun 28, 2024

SangbumChoi commented Jun 29, 2024 • edited Loading

D4ve-R commented Jun 20, 2024 •

edited

Loading

D4ve-R left a comment •

edited

Loading

SangbumChoi commented Jun 28, 2024 •

edited

Loading

SangbumChoi commented Jun 29, 2024 •

edited

Loading