-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replace SYCL backend reduce_by_segment
implementation with reduce-then-scan call
#1915
base: main
Are you sure you want to change the base?
Commits on Nov 6, 2024
-
Initial commit of reduce_by_segment with the reduce-then-scan path
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6c11271 - Browse repository at this point
Copy the full SHA 6c11271View commit details -
Revert change to ranges and use zip_view over segments / values instead
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b7328e8 - Browse repository at this point
Copy the full SHA b7328e8View commit details -
Implement correct return for reduce_by_segment
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for f017ffd - Browse repository at this point
Copy the full SHA f017ffdView commit details -
Add support for flag predicates
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5cfa661 - Browse repository at this point
Copy the full SHA 5cfa661View commit details -
Revert "Add support for flag predicates"
This reverts commit 0e0d50e.
Configuration menu - View commit details
-
Copy full SHA for 4f8059a - Browse repository at this point
Copy the full SHA 4f8059aView commit details -
Re-implement support for flag predicates in a more performant manner
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d56bc5a - Browse repository at this point
Copy the full SHA d56bc5aView commit details -
Add fallback and remove old SYCL implementation
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 543e82f - Browse repository at this point
Copy the full SHA 543e82fView commit details -
Switch from using lambdas to functors
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7c92238 - Browse repository at this point
Copy the full SHA 7c92238View commit details -
Add device copyable specializations for red-by-seg functors and updat…
…e testing Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6c4267b - Browse repository at this point
Copy the full SHA 6c4267bView commit details -
Fix typo in error message in device_copyable.pass.cpp
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3eee242 - Browse repository at this point
Copy the full SHA 3eee242View commit details -
Introduce separate input generation for scan phase and update tests
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b334fb0 - Browse repository at this point
Copy the full SHA b334fb0View commit details -
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d564b33 - Browse repository at this point
Copy the full SHA d564b33View commit details -
Add optional first key field to scan input and remove input range in …
…write operations Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 358ec3b - Browse repository at this point
Copy the full SHA 358ec3bView commit details -
Update __write_op in reduce-then-scan
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for db0bc25 - Browse repository at this point
Copy the full SHA db0bc25View commit details -
Remove now unneeded ONEDPL_WORKAROUND_FOR_IGPU_64BIT_REDUCTION macro
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7abbff8 - Browse repository at this point
Copy the full SHA 7abbff8View commit details -
Alternate testing between usm shared and device to prevent excessive …
…binary size Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 92cf29a - Browse repository at this point
Copy the full SHA 92cf29aView commit details -
Performance tuning within scan input functor
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d292e07 - Browse repository at this point
Copy the full SHA d292e07View commit details -
Handle n=0, n=1 first in reduce_by_segment
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e41298d - Browse repository at this point
Copy the full SHA e41298dView commit details -
Configuration menu - View commit details
-
Copy full SHA for fb9a306 - Browse repository at this point
Copy the full SHA fb9a306View commit details -
Improve comments and mark relevant variables as const
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d32ea4d - Browse repository at this point
Copy the full SHA d32ea4dView commit details -
Add condition to ensure value type is trivially copyable to call redu…
…ce-then-scan Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for aa4baaf - Browse repository at this point
Copy the full SHA aa4baafView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8cc59df - Browse repository at this point
Copy the full SHA 8cc59dfView commit details -
Introduce iterator based __pattern_reduce_by_segment
* An iterator based __pattern_reduce_by_segment is added * Due to compiler issues prior to icpx 2025.0, the reduce-then-scan path is disabled and the previous handcrafted SYCL implementation is restored to prevent performance regressions with older compilers * The previous range-based fallback implementation has been moved to the SYCL backend along with the handcrafted SYCL version Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 80ceacd - Browse repository at this point
Copy the full SHA 80ceacdView commit details -
Revert "Remove now unneeded ONEDPL_WORKAROUND_FOR_IGPU_64BIT_REDUCTIO…
…N macro" This reverts commit a4c7835.
Configuration menu - View commit details
-
Copy full SHA for 74143b2 - Browse repository at this point
Copy the full SHA 74143b2View commit details -
Fix test bug where device allocation is always used for testing
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for fdf6a39 - Browse repository at this point
Copy the full SHA fdf6a39View commit details
Commits on Nov 13, 2024
-
Separate each reduce_by_segment fallback path into their own functions
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5987df9 - Browse repository at this point
Copy the full SHA 5987df9View commit details -
Configuration menu - View commit details
-
Copy full SHA for d80377e - Browse repository at this point
Copy the full SHA d80377eView commit details
Commits on Nov 21, 2024
-
Address comments in reduce-then-scan based implementation
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3c8154e - Browse repository at this point
Copy the full SHA 3c8154eView commit details
Commits on Nov 22, 2024
-
Improve explanations of reduce-by-segment approach
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for db63d45 - Browse repository at this point
Copy the full SHA db63d45View commit details -
Use binary_op[_non]_device_copyable where appropriate
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3deed76 - Browse repository at this point
Copy the full SHA 3deed76View commit details -
Address comments in fallback implementation
Signed-off-by: Matthew Michel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c641bd3 - Browse repository at this point
Copy the full SHA c641bd3View commit details