Skip to content

Commit

Permalink
Fix quantized cache output (huggingface#31143)
Browse files Browse the repository at this point in the history
  • Loading branch information
SunMarc authored and zucchini-nlp committed Jun 11, 2024
1 parent d1dafb9 commit 03fa725
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tests/quantization/quanto_integration/test_quanto.py
Original file line number Diff line number Diff line change
Expand Up @@ -448,7 +448,7 @@ class QuantoKVCacheQuantizationTest(unittest.TestCase):
def test_quantized_cache(self):
EXPECTED_TEXT_COMPLETION = [
"Simply put, the theory of relativity states that 1) the speed of light is the same for all observers, and 2) the laws of physics are the same for all observers.\nThe first part of the theory of relativity",
"My favorite all time favorite condiment is ketchup. I love it on everything. I love it on my eggs, my fries, my burgers, my hot dogs, my sandwiches, my salads, my chicken, my fish",
"My favorite all time favorite condiment is ketchup. I love it on everything. I love it on my eggs, my fries, my chicken, my burgers, my hot dogs, my sandwiches, my salads, my p",
]

prompts = [
Expand Down

0 comments on commit 03fa725

Please sign in to comment.