Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge devel into main #22

Merged
merged 47 commits into from
Dec 25, 2023
Merged
Changes from 1 commit
Commits
Show all changes
47 commits
Select commit Hold shift + click to select a range
691c01a
Simplify AArch64 model
hanno-becker Dec 3, 2023
518e81a
Don't rename locked registers in SSA transform
hanno-becker Dec 8, 2023
cb99841
Modularize application of parsing callbacks during DFG construction
hanno-becker Dec 8, 2023
38ba498
Add support for fusion callbacks, implement eor+eor->eor3 fusion
hanno-becker Dec 8, 2023
2c31555
Fix div-by-0 issue
hanno-becker Dec 8, 2023
864ac28
Handle low-iteration count when preamble+postamble are >1 iterations
hanno-becker Dec 8, 2023
1be6340
Disable split heuristic during fusion
hanno-becker Dec 8, 2023
7627713
Remove clutter in split heuristic
hanno-becker Dec 8, 2023
23eb8b7
Add selfcheck after split heuristic
hanno-becker Dec 8, 2023
5a04ae0
Move selfcheck and preamble/postamble fixup to result class
hanno-becker Dec 8, 2023
7cb52fa
Introduce class for source lines
hanno-becker Dec 8, 2023
169872d
Add vector ldp/stp to AArch64 model
hanno-becker Dec 10, 2023
98c4cd5
Add `transpose` parent class for trn1 and trn2
hanno-becker Dec 10, 2023
f1f5f35
Simplify preprecessing by naive interleaving
hanno-becker Dec 11, 2023
0712ca6
Use tag for no-unfold
hanno-becker Dec 11, 2023
9273e83
Adust use of `is_virtual` which is now a property
hanno-becker Dec 18, 2023
9421154
Add configuration option controlling address fixup
hanno-becker Dec 18, 2023
dd75877
Add support for `after_last` source annotation
hanno-becker Dec 18, 2023
d267623
Keep line metadata during optimization
hanno-becker Dec 11, 2023
2da51bd
Smaller cleanup in helper.py
hanno-becker Dec 11, 2023
9359311
Drop source line tags by default upon optimization
hanno-becker Dec 12, 2023
c16c459
Some more cleanup
hanno-becker Dec 12, 2023
3fcc154
Further smaller improvements
hanno-becker Dec 13, 2023
8154e95
Cleanup imports and directory structure
hanno-becker Dec 15, 2023
b3339d4
Fix example.py
hanno-becker Dec 15, 2023
9131e4a
Some pylint'ing
hanno-becker Dec 16, 2023
3fc7bb3
More pylint'ing
hanno-becker Dec 17, 2023
b571978
Adjust AArch64 parsing callbacks to addition of source line info
hanno-becker Dec 18, 2023
7699dca
Add some experimental batched AES 'virtual' instructions to AArch64
hanno-becker Dec 8, 2023
4361e87
Add FAQ
hanno-becker Dec 18, 2023
17ab097
Fix parsing bug in AArch64 model for instructions affecting flags
hanno-becker Dec 18, 2023
2e567b0
Adjust x25519-aarch64-simple.s to tag and parsing changes
hanno-becker Dec 18, 2023
564aae3
Merge escaped lines during source code parsing
hanno-becker Dec 18, 2023
66a8caf
Print and keep tags by default
hanno-becker Dec 18, 2023
4b25416
Fix link in FAQ
hanno-becker Dec 19, 2023
cb502a9
Fix init.sh
hanno-becker Dec 19, 2023
dc6c6bb
Work around bug https://github.com/google/or-tools/issues/4027
hanno-becker Dec 19, 2023
4543a87
pages: Add back-pointer from FAQ to index
hanno-becker Dec 19, 2023
10d22be
Add or-tools patch working around build issue
hanno-becker Dec 23, 2023
f5baf3c
Experiment: Simplify x25519 optimization script
hanno-becker Dec 23, 2023
62a7d57
More pylint
hanno-becker Dec 23, 2023
95b52f9
Minor changes to github pages
hanno-becker Dec 23, 2023
40af412
Fix slothy imports in ntt helium script
hanno-becker Dec 22, 2023
dfe347c
Add logo to README
hanno-becker Dec 24, 2023
8d7dce8
Update logo
hanno-becker Dec 24, 2023
896757a
Add OR-Tools dependencies to README and setup-ortools.sh
hanno-becker Dec 25, 2023
5819b7a
Revert "Experiment: Simplify x25519 optimization script"
hanno-becker Dec 25, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Add transpose parent class for trn1 and trn2
  • Loading branch information
hanno-becker committed Dec 16, 2023
commit 98c4cd5032e1f98ebd5a5c794c6884e15d8151f8
7 changes: 5 additions & 2 deletions targets/aarch64/aarch64_neon.py
Original file line number Diff line number Diff line change
Expand Up @@ -2334,12 +2334,15 @@ class vushr(AArch64Instruction):
inputs = ["Va"]
outputs = ["Vd"]

class trn1(AArch64Instruction):
class Transpose(AArch64Instruction):
pass

class trn1(Transpose):
pattern = "trn1 <Vd>.<dt0>, <Va>.<dt1>, <Vb>.<dt2>"
inputs = ["Va", "Vb"]
outputs = ["Vd"]

class trn2(AArch64Instruction):
class trn2(Transpose):
pattern = "trn2 <Vd>.<dt0>, <Va>.<dt1>, <Vb>.<dt2>"
inputs = ["Va", "Vb"]
outputs = ["Vd"]
Expand Down