Update code and README #122

yanbing-j · 2024-05-31T02:07:12Z

This PR is to update README of PyTorch nightly version, and update some codes related to CPU support.

We have optimized attn_mask in SDPA with block-wise algorithm in pytorch/pytorch#126961. It can reduce memory peak usage, and have some performance improvements as below.

Performance

Single socket in Intel (R) Xeon (R) CPU Max 9480
Dtype: bfloat16, models: vit_b and vit_h, test in SDPA and Triton commit https://github.com/pytorch-labs/segment-anything-fast/blob/main/experiments/run_experiments.py#L199-L200, select the time of 20th iteration.

	vit_b		vit_h
	attn_mask w/o block-wise	attn_mask w/ block-wise	attn_mask w/o block-wise	attn_mask w/ block-wise
SDPA	10.95s/it	6.59s/it	19.93s/it	12.33s/it
Triton	10.66s/it	7.12s/it	19.87s/it	12.26s/it

yanbing-j · 2024-05-31T02:08:59Z

Hi @cpuhrsch , could you please take a look? Thanks!

cpuhrsch · 2024-05-31T22:55:42Z

experiments/eval_combo.py

+        total_memory = psutil.virtual_memory().total
+        max_memory_allocated_bytes = resource.getrusage(resource.RUSAGE_SELF).ru_maxrss
+        max_memory_allocated_percentage = int(100 * (max_memory_allocated_bytes / (total_memory >> 10)))
+        max_memory_allocated_bytes = max_memory_allocated_bytes >> 10

    if print_header:
        print(",".join(["sam_model_type", "batch_size", "memory(MiB)", "memory(%)", "img_s(avg)", "batch_ms(avg)/batch_size", "mIoU", "use_compile",


Is there something you could add to indicate the device? Seems like this needs a new column to track the device used.

Yes, of course. I just add device at the front.

cpuhrsch

Please see comment

Update code and README

6d3dab4

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 31, 2024

cpuhrsch reviewed May 31, 2024

View reviewed changes

cpuhrsch approved these changes May 31, 2024

View reviewed changes

Add the new column to track the device

9e2e345

cpuhrsch merged commit 3e9c47d into pytorch-labs:main Jun 3, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update code and README #122

Update code and README #122

yanbing-j commented May 31, 2024

yanbing-j commented May 31, 2024

cpuhrsch May 31, 2024

yanbing-j Jun 3, 2024

cpuhrsch left a comment

Update code and README #122

Update code and README #122

Conversation

yanbing-j commented May 31, 2024

Performance

yanbing-j commented May 31, 2024

cpuhrsch May 31, 2024

Choose a reason for hiding this comment

yanbing-j Jun 3, 2024

Choose a reason for hiding this comment

cpuhrsch left a comment

Choose a reason for hiding this comment