Fix ORT CI #1875

IlyasMoutawwakil · 2024-05-24T14:07:16Z

What does this PR do?

Fixes ort ci failures due to 1.18

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

IlyasMoutawwakil · 2024-05-24T14:09:10Z

one test is still failing during generation with old model optimum/gpt2
https://github.com/huggingface/optimum/blob/main/tests/onnxruntime/test_modeling.py#L2277
run pytest -v -s tests/onnxruntime -k "test_inference_old_onnx_model_1" to see the error:

RuntimeError: Error in execution: Non-zero status code returned while running Where node. Name:'Where_393' Status Message: /onnxruntime_src/onnxruntime/core/providers/cpu/math/element_wise_ops.h:640 onnxruntime::Broadcaster::Broadcaster(gsl::span<const long int>, gsl::span<const long int>) largest <= 1 was false. Can broadcast 0 by 0 or 1. 5 is invalid.

.github/workflows/test_onnxruntime.yml

HuggingFaceDocBuilderDev · 2024-05-24T14:44:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

IlyasMoutawwakil · 2024-05-27T07:09:22Z

ort tests are passing locally, the issue with broadcasting was solved.
but for some reason output ids are matching locally and not on the runner (two tests with old onnx model)

echarlaix · 2024-05-28T08:21:15Z

but for some reason output ids are matching locally and not on the runner (two tests with old onnx model)

Do you know where this could come from ?

…nto fix-ort-ci

.github/workflows/test_onnxruntime.yml

optimum/onnxruntime/modeling_decoder.py

michaelbenayoun · 2024-05-28T09:04:39Z

optimum/onnxruntime/quantization.py

@@ -356,62 +356,45 @@ def quantize(
                )

        quantizer_factory = QDQQuantizer if use_qdq else ONNXQuantizer
+        # TODO: maybe this logic can be moved to a method in the configuration class (get_ort_quantizer_kwargs())


But the config should not be aware of the ORTQuantizer class right?

yes, the quant config already contains everything and can infer which quantizer will use its kwargs (from format and is_static

optimum/optimum/onnxruntime/quantization.py

Line 309 in f300865

use_qdq = quantization_config.is_static and quantization_config.format == QuantFormat.QDQ

)

Co-authored-by: Michael Benayoun <[email protected]>

…nto fix-ort-ci

tests/onnxruntime/test_modeling.py

optimum/onnxruntime/modeling_decoder.py

michaelbenayoun

LGTM

tests/onnxruntime/test_modeling.py

Co-authored-by: Ella Charlaix <[email protected]>

…underlying model

…y ones using np random states

IlyasMoutawwakil · 2024-05-29T11:13:44Z

.github/workflows/test_onnxruntime.yml

-        pytest -n auto -m "not run_in_series" --durations=0 -vs onnxruntime
-        pytest -m "run_in_series" --durations=0 onnxruntime


I changed the order here and started seeing new errors in windows related to input dtype that I didn't see before.
I just noticed but apparently depending on the os, errors propagate into the workflow differently;
in linux based runners (ubuntu), this will run the first command and exit with non-zero code if it fails.
in windows based runners, this will run the first command, and then the second, whether the first succeeds or fails, and will only check the exit code of the last one.

instances:

this is a windows run that failed in first command, ran the second either ways and returned a success

this is an ubuntu run (from the same event) that failed in the first and exited directly

this is probably due to difference between bash and powershell
@echarlaix @michaelbenayoun @JingyaHuang @mht-sharma @regisss @fxmarty

fxmarty · 2024-06-03T08:38:19Z

.github/workflows/test_onnxruntime.yml

+      - name: Free Disk Space (Ubuntu)
+        if: matrix.os == 'ubuntu-20.04'
+        uses: jlumbroso/free-disk-space@main
+        with:
+          tool-cache: false
+          swap-storage: false
+          large-packages: false


Why this change?

I asked in #1875.
It takes half the time (2-3 min -> 1 min) and shows how much space was freed in each step. Either way we won't need it when we start using intel-cpu runners.

fxmarty · 2024-06-03T08:39:47Z

tests/onnxruntime/test_modeling.py

        )
-        outputs = model.generate(**tokens, num_beams=1, do_sample=False, min_new_tokens=30, max_new_tokens=30)
-        self.assertTrue(torch.allclose(outputs_onnx, outputs))
+        outputs = model.generate(**tokens, num_beams=1, do_sample=False, min_new_tokens=10, max_new_tokens=10)


Why reducing the number of new tokens?

was trying to figure out where the failing was coming from and forgot to reset it to 30. Will do that in the windows PR.

IlyasMoutawwakil added 3 commits May 24, 2024 16:05

fix quantizer

aa4f4d5

change diffusion test

922b85c

install cpu torch

6282a06

IlyasMoutawwakil commented May 24, 2024

View reviewed changes

.github/workflows/test_onnxruntime.yml Outdated Show resolved Hide resolved

IlyasMoutawwakil requested review from echarlaix and michaelbenayoun May 24, 2024 14:13

fix

8150a3d

IlyasMoutawwakil added 3 commits May 24, 2024 17:09

fix only for qdq quantizer

6e86081

fix past kv in old model

d744499

warn

3885f8d

Merge branch 'main' into fix-ort-ci

59b8811

IlyasMoutawwakil added 2 commits May 28, 2024 10:58

assert text equal

b01dbef

Merge branch 'fix-ort-ci' of https://github.com/huggingface/optimum i…

e4d259e

…nto fix-ort-ci

michaelbenayoun reviewed May 28, 2024

View reviewed changes

IlyasMoutawwakil and others added 6 commits May 28, 2024 11:19

Update optimum/onnxruntime/modeling_decoder.py

7d8ecae

Co-authored-by: Michael Benayoun <[email protected]>

use action to free disk

62c1f47

change input text

b458e28

Merge branch 'fix-ort-ci' of https://github.com/huggingface/optimum i…

2315a5f

…nto fix-ort-ci

keep large packages

31a501e

update python

7322cdd

echarlaix requested a review from michaelbenayoun May 28, 2024 14:41

echarlaix reviewed May 28, 2024

View reviewed changes

tests/onnxruntime/test_modeling.py Outdated Show resolved Hide resolved

optimum/onnxruntime/modeling_decoder.py Show resolved Hide resolved

michaelbenayoun approved these changes May 28, 2024

View reviewed changes

IlyasMoutawwakil added 2 commits May 28, 2024 17:09

test with original gpt2 tokenizer

9715a6f

test

45589e1

echarlaix reviewed May 28, 2024

View reviewed changes

tests/onnxruntime/test_modeling.py Show resolved Hide resolved

IlyasMoutawwakil and others added 2 commits May 28, 2024 17:19

run everything

7a41a42

Update tests/onnxruntime/test_modeling.py

4a6bda2

Co-authored-by: Ella Charlaix <[email protected]>

echarlaix approved these changes May 28, 2024

View reviewed changes

IlyasMoutawwakil added 4 commits May 29, 2024 10:25

make old onnx model inference tests run in series as they modify the …

92fc653

…underlying model

my bad

862f3a6

compare optimum ort sd with diffusers onnx sd because they're the onl…

bc1a586

…y ones using np random states

seperate ort test subsets to propagate each subsets' return code

4ee746c

IlyasMoutawwakil commented May 29, 2024

View reviewed changes

echarlaix merged commit fbbc408 into main May 29, 2024
44 of 47 checks passed

echarlaix deleted the fix-ort-ci branch May 29, 2024 12:07

fxmarty reviewed Jun 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ORT CI #1875

Fix ORT CI #1875

IlyasMoutawwakil commented May 24, 2024

IlyasMoutawwakil commented May 24, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented May 24, 2024

IlyasMoutawwakil commented May 27, 2024

echarlaix commented May 28, 2024

michaelbenayoun May 28, 2024

IlyasMoutawwakil May 28, 2024

michaelbenayoun left a comment

IlyasMoutawwakil May 29, 2024 •

edited

Loading

fxmarty Jun 3, 2024

IlyasMoutawwakil Jun 3, 2024

fxmarty Jun 3, 2024

fxmarty Jun 3, 2024

IlyasMoutawwakil Jun 3, 2024

		pytest -n auto -m "not run_in_series" --durations=0 -vs onnxruntime
		pytest -m "run_in_series" --durations=0 onnxruntime

Fix ORT CI #1875

Fix ORT CI #1875

Conversation

IlyasMoutawwakil commented May 24, 2024

What does this PR do?

Before submitting

Who can review?

IlyasMoutawwakil commented May 24, 2024 • edited Loading

HuggingFaceDocBuilderDev commented May 24, 2024

IlyasMoutawwakil commented May 27, 2024

echarlaix commented May 28, 2024

michaelbenayoun May 28, 2024

Choose a reason for hiding this comment

IlyasMoutawwakil May 28, 2024

Choose a reason for hiding this comment

michaelbenayoun left a comment

Choose a reason for hiding this comment

IlyasMoutawwakil May 29, 2024 • edited Loading

Choose a reason for hiding this comment

fxmarty Jun 3, 2024

Choose a reason for hiding this comment

IlyasMoutawwakil Jun 3, 2024

Choose a reason for hiding this comment

fxmarty Jun 3, 2024

Choose a reason for hiding this comment

fxmarty Jun 3, 2024

Choose a reason for hiding this comment

IlyasMoutawwakil Jun 3, 2024

Choose a reason for hiding this comment

IlyasMoutawwakil commented May 24, 2024 •

edited

Loading

IlyasMoutawwakil May 29, 2024 •

edited

Loading