Sam2.1 #4

adam-encord · 2024-12-16T15:27:01Z

No description provided.

sam2/build_sam.py:81:14: TOR102 [*] `torch.load` without `weights_only` parameter is unsafe. Explicitly set `weights_only` to False only if you trust the data you load and full pickle functionality is needed, otherwise set `weights_only=True`. Found with https://github.com/pytorch-labs/torchfix/

Configure a workflow for format checking

…all available kernels if Flash Attention fails (#155) In this PR, we make it optional to build the SAM 2 CUDA extension, in observation that many users encounter difficulties with the CUDA compilation step. 1. During installation, we catch build errors and print a warning message. We also allow explicitly turning off the CUDA extension building with `SAM2_BUILD_CUDA=0`. 2. At runtime, we catch CUDA kernel errors from connected components and print a warning on skipping the post processing step. We also fall back to the all available kernels if the Flash Attention kernel fails.

This PR suggests a way to resolve the error of `unsupported Microsoft Visual Studio version!` in INSTALL.md. Adding `-allow-unsupported-compiler` argument for the `nvcc` worked. Editing [setup.py](https://github.com/facebookresearch/segment-anything-2/blob/main/setup.py) is required to add the `-allow-unsupported-compiler` argument for `nvcc`. ```python def get_extensions(): srcs = ["sam2/csrc/connected_components.cu"] compile_args = { "cxx": [], "nvcc": [ "-DCUDA_HAS_FP16=1", "-D__CUDA_NO_HALF_OPERATORS__", "-D__CUDA_NO_HALF_CONVERSIONS__", "-D__CUDA_NO_HALF2_OPERATORS__", "-allow-unsupported-compiler" # Add this argument ], } ext_modules = [CUDAExtension("sam2._C", srcs, extra_compile_args=compile_args)] return ext_modules ```

This is referring to facebookresearch/sam2#137 , which in itself refers to a common problem during installation, mentioned on facebookresearch/sam2#19 . Some users may encounter significant trouble installing the project, running into the error `OSError: CUDA_HOME environment variable is not set. Please set it to your CUDA install root.`. Simply adding the `--no-build-isolation` flag to the pip install, e.g. `pip install --no-build-isolation -e .`, usually solves this problem. However, this fix is not mentioned anywhere within the readmes or installation troubleshooting docs. This PR adds this recommendation into the INSTALL.md file under the "My installation failed with `CUDA_HOME environment variable is not set` " section, ensuring that more users are aware of this potential fix. Examples of users experiencing related difficulties when installing: facebookresearch/sam2#19 facebookresearch/sam2#41 facebookresearch/sam2#99 facebookresearch/sam2#133

Not used head_dim = dim_out // num_heads self.scale = head_dim**-0.5 F.scaled_dot_product_attention takes care of this automatically.

Integrate with Hugging Face

This PR adds an example to provide box prompt in SAM 2 as inputs to the `add_new_points_or_box` API (renamed from`add_new_points`, which is kept for backward compatibility). If `box` is provided, we add it as the first two points with labels 2 and 3, along with the user-provided points (consistent with how SAM 2 is trained). The video predictor notebook `notebooks/video_predictor_example.ipynb` is updated to include segmenting from box prompt as an example.

…s` package during installation (#176)

ufmt formatting fixed.

remove unused attributes from hieradet.py

…204)

…edictor Fix HF image predictor

…rious typos (#218) (close #217, #66, #67, #69, #91, #126, #127, #145)

SAM2.1 checkpoints + training code + Demo

SAM 2.1

….1 (#338)

…m2` (#341) This PR update repo links after we renamed the repo from `segment-anything-2` to `sam2`. It also changes `NAME` in setup.py to `SAM-2` (which is already the named used in pip setup since python packages don't allow whitespace)

Use `weights_only` for loading

This small PR adds GPU specification in `docker-compose.yaml` for the SAM 2 interactive webdemo, following https://docs.docker.com/compose/how-tos/gpu-support/#example-of-a-compose-file-for-running-a-service-with-access-to-1-gpu-device. It fixes a GPU access error as reported in facebookresearch/sam2#354. (close facebookresearch/sam2#354)

… parent directory of the sam2 repo (#359) If the user has "sam2/sam2" in their path, they are likey importing the repo itself as "sam2" rather than importing the "sam2" python package (i.e. "sam2/sam2" directory). This typically happens because the user is running Python from the parent directory that contains the sam2 repo they cloned. In general, the user should not run Python from the parent dir when the repo is cloned into (same is true for e.g. Numpy repo that contains names like `numpy/numpy` where the module and the repo have the same name), as the user encountered in facebookresearch/sam2#346. (close facebookresearch/sam2#346)

Summary: The Strawberry GraphQL library recently disabled multipart requests by default. This resulted in a video upload request returning "Unsupported content type" instead of uploading the video, processing it, and returning the video path. This issue was raised in #361. A forward fix is to add `multipart_uploads_enabled=True` to the endpoint view. Test Plan: Tested locally with cURL and upload succeeds *Request* ``` curl http://localhost:7263/graphql \ -F operations='{ "query": "mutation($file: Upload!){ uploadVideo(file: $file) { path } }", "variables": { "file": null } }' \ -F map='{ "file": ["variables.file"] }' \ -F [email protected] ``` *Response* ``` {"data": {"uploadVideo": {"path": "uploads/<HASH>.mp4"}}} ```

[sam2][demo][1/x] Fix file upload Summary: The Strawberry GraphQL library recently disabled multipart requests by default. This resulted in a video upload request returning "Unsupported content type" instead of uploading the video, processing it, and returning the video path. This issue was raised in #361. A forward fix is to add `multipart_uploads_enabled=True` to the endpoint view. Test Plan: Tested locally with cURL and upload succeeds *Request* ``` curl http://localhost:7263/graphql \ -F operations='{ "query": "mutation($file: Upload!){ uploadVideo(file: $file) { path } }", "variables": { "file": null } }' \ -F map='{ "file": ["variables.file"] }' \ -F [email protected] ``` *Response* ``` {"data": {"uploadVideo": {"path": "uploads/<HASH>.mp4"}}} ```

…edup and a new SAM2VideoPredictor to better handle multi-object tracking (#486) This PR provides new features and updates for SAM 2: - We now support `torch.compile` of the entire SAM 2 model on videos, which can be turned on by setting `vos_optimized=True` in `build_sam2_video_predictor` (it uses the new `SAM2VideoPredictorVOS` predictor class in `sam2/sam2_video_predictor.py`). * Compared to the previous setting (which only compiles the image encoder backbone), the new full model compilation gives a major speedup in inference FPS. * In the VOS prediction script `tools/vos_inference.py`, you can specify this option in `tools/vos_inference.py` via the `--use_vos_optimized_video_predictor` flag. * Note that turning on this flag might introduce a small variance in the predictions due to numerical differences caused by `torch.compile` of the full model. * **PyTorch 2.5.1 is the minimum version for full support of this feature**. (Earlier PyTorch versions might run into compilation errors in some cases.) Therefore, we have updated the minimum PyTorch version to 2.5.1 accordingly in the installation scripts. - We also update the implementation of the `SAM2VideoPredictor` class for the SAM 2 video prediction in `sam2/sam2_video_predictor.py`, which allows for independent per-object inference. Specifically, in the new `SAM2VideoPredictor`: * Now **we handle the inference of each object independently** (as if we are opening a separate session for each object) while sharing their backbone features. * This change allows us to relax the assumption of prompting for multi-object tracking. Previously (due to the batching behavior in inference), if a video frame receives clicks for only a subset of objects, the rest of the (non-prompted) objects are assumed to be non-existent in this frame (i.e., in such frames, the user is telling SAM 2 that the rest of the objects don't appear). Now, if a frame receives clicks for only a subset of objects, we do not make any assumptions about the remaining (non-prompted) objects (i.e., now each object is handled independently and is not affected by how other objects are prompted). As a result, **we allow adding new objects after tracking starts** after this change (which was previously a restriction on usage). * We believe that the new version is a more natural inference behavior and therefore switched to it as the default behavior. The previous implementation of `SAM2VideoPredictor` is backed up to in `sam2/sam2_video_predictor_legacy.py`. All the VOS inference results using `tools/vos_inference.py` should remain the same after this change to the `SAM2VideoPredictor` class.

…eoPredictor` (#490) This PR adds a pathc for the case of `offload_state_to_cpu=True` where `pred_masks` might have been offload to CPU device (close facebookresearch/sam2#489)

…r in MPS (#495) In this PR, we remove `.pin_memory()` in `obj_pos` of `SAM2Base` to resolve and error in MPS. Investigations show that `.pin_memory()` causes an error of `Attempted to set the storage of a tensor on device "cpu" to a storage on different device "mps:0"`, as originally reported in facebookresearch/sam2#487. (close facebookresearch/sam2#487)

# Conflicts: # sam2/modeling/sam2_base.py # sam2/sam2_video_predictor.py # setup.py

kit1980 and others added 30 commits July 29, 2024 16:54

First draft

b72a8a9

Use classmethod

17b7450

Add model_id_to_filenames

3af4e82

Do not load config from the hub

0c28c63

Make huggingface_hub soft dependency

6aeee34

Update links

cb48213

Update README

e93be7f

Update docstring

841cc1f

Add workflow

acd3939

Update workflow

3b0fd9e

Merge pull request #1 from haithamkhedr/CI

5e3d6ca

Merge pull request #152 from haithamkhedr/main

0230c5f

Configure a workflow for format checking

Include original code snippet

c3393d8

Move HF to separate section

e9503c9

Add link

fbf7e3a

Address comment

e815f70

Clean up

a36edf1

Update README

27a167c

Format using ufmt

8f15c62

Revert code snippet

322aa3e

Update docstrings

43c385c

Update hieradet.py

6ec8560

Not used head_dim = dim_out // num_heads self.scale = head_dim**-0.5 F.scaled_dot_product_attention takes care of this automatically.

Address comment

9b58611

Merge pull request #128 from NielsRogge/add_hf

6ba4c65

Integrate with Hugging Face

Merge branch 'main' into patch-1

086daf0

ronghanghu and others added 29 commits August 8, 2024 11:03

add Colab support to the notebooks; pack config files in `sam2_config…

d421e0b

…s` package during installation (#176)

Update hieradet.py

46945a2

ufmt formatting fixed.

Merge branch 'main' into patch-1

8f607e2

Merge pull request #167 from arun477/patch-1

778e112

remove unused attributes from hieradet.py

better support for non-CUDA devices (CPU, MPS) (#192)

1034ee2

improving warning message and adding further tips for installation (#…

dce7b54

…204)

Fix HF image predictor

1191677

accept kwargs in auto_mask_generator

fd5125b

Merge pull request #205 from facebookresearch/haitham/fix_hf_image_pr…

0db838b

…edictor Fix HF image predictor

open README.md with unicode (to support Hugging Face emoji); fix va…

7e1596c

…rious typos (#218) (close #217, #66, #67, #69, #91, #126, #127, #145)

Merge branch 'main' into patch-1

0f6515a

SAM2.1

aa9b872

SAM2.1 checkpoints + training code + Demo

Merge pull request #335 from facebookresearch/sam2.1

3a7889d

SAM 2.1

minor update README.md

429a2c7

[docs] add a release note and new installation instructions for SAM 2…

05d9e57

….1 (#338)

Merge pull request #2 from kit1980/patch-1

52198ea

Use `weights_only` for loading

Add MANIFEST.in (#353)

8bf0920

remove unused paths (#384)

c2ec8e1

patch for the case of offload_state_to_cpu=True in the new `SAM2Vid…

722d1d1

…eoPredictor` (#490) This PR adds a pathc for the case of `offload_state_to_cpu=True` where `pred_masks` might have been offload to CPU device (close facebookresearch/sam2#489)

In memory operations + adjustments

7bd604e

Merge branch 'main' into aj/sam2.1

90c4473

# Conflicts: # sam2/modeling/sam2_base.py # sam2/sam2_video_predictor.py # setup.py

Formatting

37767a3

adam-encord closed this Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sam2.1 #4

Sam2.1 #4

adam-encord commented Dec 16, 2024

Sam2.1 #4

Sam2.1 #4

Conversation

adam-encord commented Dec 16, 2024