[WIP] AWQ initial implementation #92

rahul-tuli · 2024-08-15T17:43:27Z

SUMMARY:
"please provide a brief summary"

TEST PLAN:
"please outline how the changes were tested"

qingquansong · 2024-09-12T20:43:36Z

Hey @rahul-tuli , any timeline about when AWQ will be shipped and merged? Thank you!

kylesayrs · 2024-09-15T14:31:11Z

src/llmcompressor/modifiers/quantization/awq/base.py

+        :param state: state to run AWQ on
+        :return: True on a successful run, False otherwise
+        """
+        if self.end and self.end != -1:


Currently user can pass self.end == 0

Suggested change

if self.end and self.end != -1:

if not (self.end is None or self.end == -1):

kylesayrs · 2024-09-15T14:43:52Z

src/llmcompressor/modifiers/quantization/awq/base.py

+            balance_layers = mapping.balance_layers
+
+            activations = self.scales_[mapping.smooth_name].inps
+            module2inspect = smooth_layer


Not sure why there are two variable names

kylesayrs · 2024-09-15T14:52:04Z

src/llmcompressor/modifiers/quantization/awq/base.py

+            w_mean = w_scale.mean(0)
+
+            # [STEP 2]: Compute per-channel mean of the input activation with chunking
+            # move inp to cpu to avoid memory leak


What operation causes the memory leak? Should the tensor be moved back to gpu after?

rahul-tuli · 2024-09-18T14:19:26Z

Closing as all the individual pieces; are now broken down into separate stacked diffs in #181

initial implementation

ae29d15

kylesayrs reviewed Sep 15, 2024

View reviewed changes

rahul-tuli closed this Sep 18, 2024

markmc pushed a commit to markmc/llm-compressor that referenced this pull request Nov 13, 2024

update default symmetry to True on presets (vllm-project#92)

60e4562

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] AWQ initial implementation #92

[WIP] AWQ initial implementation #92

rahul-tuli commented Aug 15, 2024

qingquansong commented Sep 12, 2024

kylesayrs Sep 15, 2024 •

edited

Loading

kylesayrs Sep 15, 2024

kylesayrs Sep 15, 2024

rahul-tuli commented Sep 18, 2024

	if self.end and self.end != -1:
	if not (self.end is None or self.end == -1):

[WIP] AWQ initial implementation #92

[WIP] AWQ initial implementation #92

Conversation

rahul-tuli commented Aug 15, 2024

qingquansong commented Sep 12, 2024

kylesayrs Sep 15, 2024 • edited Loading

Choose a reason for hiding this comment

kylesayrs Sep 15, 2024

Choose a reason for hiding this comment

kylesayrs Sep 15, 2024

Choose a reason for hiding this comment

rahul-tuli commented Sep 18, 2024

kylesayrs Sep 15, 2024 •

edited

Loading