update test_run_compressed #970

horheynm · 2024-12-11T03:59:26Z

Contingent on merge of huggingface/transformers#34719

SUMMARY:
Update run_compressed tests from decompression tests to run_comrpressed tests -> test if run_compressed True/False models generate the same output

Add decompress tests that copies attrs from the source dir path's model to the target model.

github-actions · 2024-12-11T03:59:39Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

dsikka · 2024-12-11T04:04:47Z

The generations compared are decompressed and decompressed on the forward pass ie compressed linear. So they should be close within a tolerance

dsikka

I think you've changed the goal of the test which is compressed linear vs decompressing the whole model. We still want run_compressed to be True for one of the models

horheynm · 2024-12-11T04:16:23Z

I think you've changed the goal of the test which is compressed linear vs decompressing the whole model. We still want run_compressed to be True for one of the models

Honestly, idk what the original test was doing. If you want to add compressed linear, decompressed model and its model.generate thats cool.

original test is comparing a quantized model and a non-touched base model, that will always fail

dsikka · 2024-12-11T04:25:39Z

I think you've changed the goal of the test which is compressed linear vs decompressing the whole model. We still want run_compressed to be True for one of the models

Honestly, idk what the original test was doing. If you want to add compressed linear, decompressed model and its model.generate thats cool.

original test is comparing a quantized model and a non-touched base model, that will always fail

We load a compressed model:

llm-compressor/tests/llmcompressor/transformers/compression/test_run_compressed.py

Line 29 in 606aab2

cls.compressed_model = AutoModelForCausalLM.from_pretrained(

We create an empty model to hold the decompressed weights:

llm-compressor/tests/llmcompressor/transformers/compression/test_run_compressed.py

Line 37 in 606aab2

cls.uncompressed_model = AutoModelForCausalLM.from_pretrained(

We then use lines 42-50 to decompress the whole model, using the empty model as the skeleton. This is loading the same checkpoint as what is used by the compressed model.

llm-compressor/tests/llmcompressor/transformers/compression/test_run_compressed.py

Line 42 in 606aab2

config = AutoConfig.from_pretrained(cls.model_stub)

We then run generations and compare the two cases. You're comparing decompressed in both cases, one is just decompressing on the forward pass. It's the same checkpoint in both cases, which is why this test passes.

We just want to update such that the uncompressed model can be decompressed using your new transformer changes.
The goal of the test is that compressed linear decompression is the same as the entire model being decompressed up-front.

horheynm · 2024-12-11T04:50:31Z

Ok i see, we should update the test name, run_compressed_configs is confusing

dsikka · 2024-12-11T04:51:56Z

Ok i see, we should update the test name, test_run_config is confusing

Yeah cuz of the flag/arg name. We can call it test_compressed_linear_decompress or something

horheynm · 2024-12-11T22:31:37Z

/ready

update test_run_compressed

b7a968e

better var name

7067ad0

dsikka requested changes Dec 11, 2024

View reviewed changes

horheynm added 2 commits December 11, 2024 13:09

add decompress tests

126d3d5

Merge branch 'main' into run_compressed-tests

e17c190

dsikka marked this pull request as draft December 12, 2024 17:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update test_run_compressed #970

update test_run_compressed #970

horheynm commented Dec 11, 2024 •

edited

Loading

github-actions bot commented Dec 11, 2024

dsikka commented Dec 11, 2024 •

edited

Loading

dsikka left a comment

horheynm commented Dec 11, 2024

dsikka commented Dec 11, 2024 •

edited

Loading

horheynm commented Dec 11, 2024 •

edited

Loading

dsikka commented Dec 11, 2024

horheynm commented Dec 11, 2024

update test_run_compressed #970

Are you sure you want to change the base?

update test_run_compressed #970

Conversation

horheynm commented Dec 11, 2024 • edited Loading

github-actions bot commented Dec 11, 2024

dsikka commented Dec 11, 2024 • edited Loading

dsikka left a comment

Choose a reason for hiding this comment

horheynm commented Dec 11, 2024

dsikka commented Dec 11, 2024 • edited Loading

horheynm commented Dec 11, 2024 • edited Loading

dsikka commented Dec 11, 2024

horheynm commented Dec 11, 2024

horheynm commented Dec 11, 2024 •

edited

Loading

dsikka commented Dec 11, 2024 •

edited

Loading

dsikka commented Dec 11, 2024 •

edited

Loading

horheynm commented Dec 11, 2024 •

edited

Loading