hls4ml Optimization API [Part 2] #809

bo3z · 2023-06-13T20:36:25Z

Description

Second part of hls4ml Optimization API hls4ml Optimization API [Part 1] #768

Introduces Dense Unrolled layers, optimising multiplications with zero in Resource strategy with RF > 1

Introduces additional TCL scripts, to optimise zero BRAM blocks.

Type of change

New feature (non-breaking change which adds functionality)
A new research paper code implementation
Fix issue Vivado synthesis report - zero BRAM utilisation (OOC) #798

Tests

Added a new test, test_dense_unrolled that verifies dense resource layers implement avoiding zero multiplications are correct

Comparison with "standard" Dense Resource will be shortly available in the (updated) PR hls4ml Optimization API [Part 1] #768.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

bo3z · 2023-06-13T20:37:04Z

I will add pre-commit additionally, last time I ran it, some tests were broken, so will add it a subsequent commit.

bo3z · 2023-06-16T10:30:19Z

This is ready for review, seems that pre-commit can re-arrange the order of includes in C++ header files and it could cause compilation error.

jmitrevs · 2024-02-07T18:26:44Z

We merged part 1. Should we merge part 2?

vloncar · 2024-02-07T18:28:00Z

I'm reviewing it. Slowly 😃 . But it's next in line, then HGQ.

jmitrevs · 2024-05-03T23:46:14Z

The pytest error is unrelated to the PR so from my side this can be merged. I'll let Vladimir give the last OK.

calad0i · 2024-06-20T07:04:53Z

hls4ml/templates/vivado/vivado_synth.tcl

Why switching the default impl to verilog here? What's the impact of the added optimization on firmware performance and synth time?

There shouldn't be any noticeable impact. There are 2 reasons this change was made:

When you use VHDL with Resource strategy the synthesis report gives 0 BRAM (which cannot be the case at all, especially since Verilog shows the correct BRAM). I suspect this is something to do out-of-context synthesis in Vivado but I haven't figured out how to enable / disable it. This is explained more in: Vivado synthesis report - zero BRAM utilisation (OOC) #798

It's "functionally" not correct to do co-simulation / validation with Verilog and synthesis with VHDL (since HLS always generates both formats anyway). Yes, if one is correct and the other is wrong then it's an issue with the HLS compiler and we can't do much about it. But my reasoning is that using we should test and "deploy" the same set of end-files.

Then this should probably go into a separate PR I guess, as it changs the behavior of the library while not using dsp aware pruning. How about let's isolate bo3z@e53ec9c into another PR?

calad0i · 2024-06-20T07:13:35Z

hls4ml/writer/vivado_writer.py

-                        newline += indent + '#pragma HLS DATAFLOW \n'
+
+                    model_cfg = model.config.get_config_value('HLSConfig')['Model']
+                    if (


It seems that if unrolled resource conv layers are used, dataflow will not be used. Also, if n_partition is not 1, the global II won't the be larger than the internal dense II (reuse_factor)?

vloncar · 2024-08-25T23:46:20Z

This PR was refactored to introduce the new unrolled implementation as a "strategy", to be an alternative to existing latency and resource strategies. This allowed the the matrix-vector multiplication kernel to be used as a function, simplifying the integration with the rest of the code. The PR also has changes to the top pipeline style pragma, so the config now includes a new "auto" option (the default) which allows the optimizer to choose the best one. All of pipeline style decisions are now made in the new optimizer, instead of being scattered around the HLSConfig class and the backend.

One more minor change may come. Since we will have multiple new strategies and optimization options, it was suggested to give this optimization technique a name and move it to a submodule of that name. Discussion on this is welcome.

… tools coming)

vloncar · 2024-10-07T16:36:15Z

All the comments from the last discussion were addressed (renamed the strategy and moved the files to a new submodule). It's now ready to be merged

bo3z added 5 commits June 11, 2023 18:42

Introduce unrolled implementation of Dense Resource

4e92b7b

Code generation for unrolled Dense

4fa21cb

Fix incorrect BRAM reporting (fastmachinelearning#798)

22e815b

Add post-synthesis design optimisation to remove unused BRAM

9cab74a

Tests for unrolled Dense

d79f868

bo3z requested a review from vloncar June 13, 2023 20:37

jmduarte added this to the v0.8.0 milestone Jun 15, 2023

pre-commit on hls4ml Optimization pt.2

ff86c26

Fix failing PyTests

0f0adc4

bo3z force-pushed the hls4ml-optimization-api-part-2 branch from 20ed996 to 0f0adc4 Compare June 16, 2023 11:36

bo3z mentioned this pull request Jun 16, 2023

hls4ml Optimization API [Part 1] #768

Merged

8 tasks

jmitrevs modified the milestones: v0.8.0, v1.0.0 Oct 20, 2023

Merge branch 'master' into hls4ml-optimization-api-part-2

2a7cde5

jmitrevs added the please test Trigger testing by creating local PR branch label Jan 8, 2024

Merge branch 'main' into hls4ml-optimization-api-part-2

d509976

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels May 3, 2024

calad0i mentioned this pull request May 14, 2024

Unrolled implementation for latency dense/conv layers #1014

Closed

7 tasks

Merge remote-tracking branch 'upstream/main' into hw_opt_p2

f1a238d

calad0i reviewed Jun 20, 2024

View reviewed changes

vloncar added 4 commits July 15, 2024 18:20

Refactor matrix-multiplication kernel as a function pointer

0ea246c

Reorganize codegen of unrolled implementation

2ed0865

Merge remote-tracking branch 'upstream/main' into hw_opt_p2

10f648c

Remove mentions of dense_resource_implementation

fbc4107

vloncar added 2 commits August 26, 2024 01:02

Default to 'auto' for pipeline style and move check to an optimizer

ecda5c9

Pimp the docs a bit

ce8431d

vloncar added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Aug 25, 2024

vloncar added 4 commits October 1, 2024 19:06

Merge branch 'main' into hw_opt_p2

ac8d9fd

Merge remote-tracking branch 'upstream/main' into hw_opt_p2

eff80aa

Rename "unrolled" -> "resource_unrolled"

c4af46a

Move optimization API to "dsp_aware_pruning" module (new optimization…

97c5347

… tools coming)

Merge branch 'main' into hls4ml-optimization-api-part-2

cfbad0b

JanFSchulte approved these changes Oct 22, 2024

View reviewed changes

JanFSchulte merged commit 352c124 into fastmachinelearning:main Oct 22, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hls4ml Optimization API [Part 2] #809

hls4ml Optimization API [Part 2] #809

bo3z commented Jun 13, 2023 •

edited

Loading

bo3z commented Jun 13, 2023

bo3z commented Jun 16, 2023

jmitrevs commented Feb 7, 2024

vloncar commented Feb 7, 2024

jmitrevs commented May 3, 2024

calad0i Jun 20, 2024

bo3z Sep 4, 2024 •

edited

Loading

calad0i Sep 4, 2024

calad0i Jun 20, 2024

vloncar commented Aug 25, 2024

vloncar commented Oct 7, 2024

hls4ml Optimization API [Part 2] #809

hls4ml Optimization API [Part 2] #809

Conversation

bo3z commented Jun 13, 2023 • edited Loading

Description

Type of change

Tests

Checklist

bo3z commented Jun 13, 2023

bo3z commented Jun 16, 2023

jmitrevs commented Feb 7, 2024

vloncar commented Feb 7, 2024

jmitrevs commented May 3, 2024

calad0i Jun 20, 2024

Choose a reason for hiding this comment

bo3z Sep 4, 2024 • edited Loading

Choose a reason for hiding this comment

calad0i Sep 4, 2024

Choose a reason for hiding this comment

calad0i Jun 20, 2024

Choose a reason for hiding this comment

vloncar commented Aug 25, 2024

vloncar commented Oct 7, 2024

bo3z commented Jun 13, 2023 •

edited

Loading

bo3z Sep 4, 2024 •

edited

Loading