Expose alpha and theta type for parametrized activations #1069

jmitrevs · 2024-09-29T02:53:03Z

Description

The type of the parameter for parametrized activations were not configurable before, being set the same as either the input of result. Generally there is no reason for that requirement, so this makes the type explicitly configurable,

Type of change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change which adds functionality)

Tests

The standard activations tests (slightly modified) provide the test.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

jmitrevs · 2024-09-29T20:10:24Z

Note: this PR is needed to fix test_qonnx.py::test_sep_conv[Vitis] in #979.

vloncar · 2024-10-01T16:02:31Z

hls4ml/model/optimizer/passes/infer_precision.py

+    def _infer_par_act_precision(self, node, types_to_infer):
+        inferred_types = []
+
+        # for now, only set if for threshold relu


Why not ELU while we're at it?

I left then to be the default values, though you can configure them otherwise manually. Is there a better choice?

I agree with that, my comment was mostly that the comment in the code suggests this is only applicable to thresholded relu while it actually works for three layers (leaky, threholded and elu)

I can modify the comment saying that the others are left to the default precision.

I think that the idea of configurable threshold is good in priciple. However I think this way we risk of having an inconsistency between parametrized activations. Thresholded relu gets to have the same type as input, which is the current behavior anyway, but leaky and elu get the default (which may be different from what is the input to the current layer), and prelu is not even considered so it gets the default that way. Why not handle all of them to have a consistent behavior (whatever that is, may be input precision for continuity, may be default precision)? Also, why use input precision when we can be smart and figure out the required precision given that you have access to the activ_param to figure out exactly how many bits you need to represent that number/array.

They have a fundamentally different function. Threshold compares the input values to a threshold, so it makes sense for it to be the same type as the input. One can tune it if preferred, but there is an implicit connection with the input type. If the input type logically would have units, the threshold would have the same units. The others are scaling factors, so they have no connection to the input type at all. It just doesn't make sense to make them the same as the input type. There is a fundamental difference. I did consider adding prelu to the match on line 87, but then _infer_par_act_precision would just ignore it and have it set the default anyway, so I didn't add it to the match. But I am not fundamentally opposed to that.

I think logically, for threshold comparisons the same type makes sense, so it seems like a good setting, I believe better than the default. For scaling factors, we have no guidance, so the default precision makes sense.

But the suggestion about looking at the activ_param is interesting. I have to think about it. One can set the range, but not necessarily the number of bits.

For a scale (and threshold) that is a scalar the guidance is simple, it's a number for which we can figure out the best precision to store (without using too many decimal bits to match the original stored in float). Does get trickier for the case of PReLU where we would need a smarter way to decide what precision suits best for the most of the array. I undestand your logic for the current behavior. I'm fine with merging this as-is and do a "smarter" way as a follow-up

Actually, I think all activations can be updated. The output type can be unsigned for relu (but otherwise matching the input type), restricted in range for sigmoid and tanh, etc. It may be good to have another pull request that does precision propagation for all activations.

vloncar · 2024-10-01T16:03:09Z

example-models

Technically, this got in by accident from the QONNX PR

It was not supposed to include that--I thought I was being careful. I had updated example-models in my work area. I am not sure if they are the final ones, either. I can try to recreate this without the example-models change if you prefer.

I think it is fine to remain. We'll update that pointer anyway.

jmitrevs added 2 commits September 28, 2024 18:33

update parametrized activations for Xilinx

d804b2d

update quartus and catapult

b19368c

jmitrevs added this to the v1.0.0 milestone Sep 29, 2024

jmitrevs added the please test Trigger testing by creating local PR branch label Sep 29, 2024

fix pre-commit

9c8c1dd

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Sep 29, 2024

fix non-parametrized version of elu

22f3800

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Sep 29, 2024

jmitrevs mentioned this pull request Sep 29, 2024

Update QONNX parsing for 1.0 #979

Merged

8 tasks

jmitrevs requested review from vloncar and JanFSchulte September 30, 2024 13:52

JanFSchulte approved these changes Oct 1, 2024

View reviewed changes

vloncar reviewed Oct 1, 2024

View reviewed changes

update comment on parametriced activation precision

e68b770

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Oct 1, 2024

vloncar approved these changes Oct 1, 2024

View reviewed changes

vloncar merged commit 1654c1c into fastmachinelearning:main Oct 1, 2024
5 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose alpha and theta type for parametrized activations #1069

Expose alpha and theta type for parametrized activations #1069

jmitrevs commented Sep 29, 2024

jmitrevs commented Sep 29, 2024 •

edited

Loading

vloncar Oct 1, 2024

jmitrevs Oct 1, 2024

vloncar Oct 1, 2024

jmitrevs Oct 1, 2024

vloncar Oct 1, 2024

jmitrevs Oct 1, 2024 •

edited

Loading

jmitrevs Oct 1, 2024 •

edited

Loading

jmitrevs Oct 1, 2024

vloncar Oct 1, 2024

jmitrevs Oct 1, 2024 •

edited

Loading

vloncar Oct 1, 2024

jmitrevs Oct 1, 2024

vloncar Oct 1, 2024

Expose alpha and theta type for parametrized activations #1069

Expose alpha and theta type for parametrized activations #1069

Conversation

jmitrevs commented Sep 29, 2024

Description

Type of change

Tests

Checklist

jmitrevs commented Sep 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmitrevs Oct 1, 2024 • edited Loading

Choose a reason for hiding this comment

jmitrevs Oct 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmitrevs Oct 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmitrevs commented Sep 29, 2024 •

edited

Loading

jmitrevs Oct 1, 2024 •

edited

Loading

jmitrevs Oct 1, 2024 •

edited

Loading

jmitrevs Oct 1, 2024 •

edited

Loading