Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Promotion from AMD internal branch for 2024.Q4.1 #2908

Closed
wants to merge 4 commits into from
Closed

Conversation

qiaojbao
Copy link
Collaborator

Set SLC=0 for ATM
LowerGpuRt: fix a type confusion
Honor NonUniform decorations on OpAccessChain operands Force WGP when NGG is in passthrough mode
Use idxen modifier for (RW)StructuredBuffers
Compatible with LLVM upstream change that moved Utility funcs to Utils Include Line Correlation Info for Tools via LLPC
[Continuations] Cleanup Complete op lowering
[CompilerUtils] Improve handling of freeze in ValueOriginTracker [compilerutils] Add GetGlobalInModule to CrossModuleInliner amdllpc: some usability improvements
[Continuations] Add SpecializeDriverShadersPass
Split up gl_in array type.
[Continuations] Remove stack lowering define
Use inbounds gep to index row major matrix
llvmraytracing: Remove support for _AmdContinuationsGetFlags Properly encapsulate state that affects GPURT specialization llvmraytracing: only support a waitmask of -1
llvmraytracing: remove remaining traces of EnqueueCall compilerutils: fix a warning
[LowerGlobals] Mark globals with buffer.index users as readonly Implement structural GEP dialect for in/out
llvmraytracing: add LLPC/LLVM scratch/global address spaces llpc/ProcessGpuRtLibrary: use earlyGpurtTransform
[Continuations] Make use of llvm::zip in CleanupContinuationsPass::updateCpsFunctionArgs Adjust the llvm-tblgen memory limit
lgc: split lowering of GroupMemcpyOp for mesh/task shaders to MeshTaskShader PatchResourceCollect: stop cleaning undef output value in some cases [Continuations] Freeze poison that is stored to payload Fix wrong pipeline dump message
Fix issues with coherent
Update SPIR-V header to latest version
Simplify load of attribute ring buffer descriptor
[Continuations] Enable Traversal specialization test Adjust tests to take into account upstream overload of rsrc and samp arguments [Continuations] Fix signed/unsigned comparison warning LowerRaytracingPipeline: tease apart return handling and any-hit exit handling LowerRaytracingPipeline: unify function end handling lgc: support int4 in cooperative matrix
Add rounding mode for PackHalf2x16
[Continuations] Fix unused variable warning
lgc: Add MsgPackScanner
[Continuations] Remove LegacyCleanupContinuations pass LowerCooperativeMatrix: fix compile warning due to missing break statement lgc: New RegStackUsage to propagate reg/stack usage [LGC] Add helper lane state for subgroups
Add client name to SPIR-V frontend
Use more flexible readfirstlane
Gate argument definition for GEP::collectOffset. [Continuations] Introduce dummy csp argument for lgc.cps.jump [Continuations] Tolerate non-waiting AwaitTraversal Add AmdExtD3DShaderIntrinsics_LoadDwordAtAddrx3
Add more BuiltIn which has primitive index in mesh shader Rewrite the helper getShaderModuleUsageInfo
[Continuations] Replace isLgcRtOp helper with isDialectOp helper from dialects Downgrade scope Device to Workgroup if permitted
Optimize performance for PrimSetup
Update submodule llvm-dialects
Remove inactive built-ins for last vertex processing stage Fix internal hash for color export shader
Fix call to lookupIntrinsicID
[Continuations] Move Simplifying GEP helpers to CompilerUtils [PatchBufferOp] Generate struct buffer cmpxchg intrinsics [LGC] Refactor default wave size setting
[Continuations] Derive DispatchSystemData type from _cont_DispatchRaysIndex3 [Continuations] Replace "lgc.rt" starts_with check Rename some classes and files
Optimize PointSize write when the value is 1.0
[Continuations] Handle _AmdGetShaderRecordIndex calls [LowerBufferOperations] Check for uniform buffer pointers for s_buffer_load lgc: Mark applicable LgcDialect and Builder ops as NoDivergenceSource

Set SLC=0 for ATM
LowerGpuRt: fix a type confusion
Honor NonUniform decorations on OpAccessChain operands
Force WGP when NGG is in passthrough mode
Use idxen modifier for (RW)StructuredBuffers
Compatible with LLVM upstream change that moved Utility funcs to Utils
Include Line Correlation Info for Tools via LLPC
[Continuations] Cleanup Complete op lowering
[CompilerUtils] Improve handling of freeze in ValueOriginTracker
[compilerutils] Add GetGlobalInModule to CrossModuleInliner
amdllpc: some usability improvements
[Continuations] Add SpecializeDriverShadersPass
Split up gl_in array type.
[Continuations] Remove stack lowering define
Use inbounds gep to index row major matrix
llvmraytracing: Remove support for _AmdContinuationsGetFlags
Properly encapsulate state that affects GPURT specialization
llvmraytracing: only support a waitmask of -1
llvmraytracing: remove remaining traces of EnqueueCall
compilerutils: fix a warning
[LowerGlobals] Mark globals with `buffer.index` users as readonly
Implement structural GEP dialect for in/out
llvmraytracing: add LLPC/LLVM scratch/global address spaces
llpc/ProcessGpuRtLibrary: use earlyGpurtTransform
[Continuations] Make use of `llvm::zip` in `CleanupContinuationsPass::updateCpsFunctionArgs`
Adjust the llvm-tblgen memory limit
lgc: split lowering of GroupMemcpyOp for mesh/task shaders to MeshTaskShader
PatchResourceCollect: stop cleaning undef output value in some cases
[Continuations] Freeze poison that is stored to payload
Fix wrong pipeline dump message
Fix issues with coherent
Update SPIR-V header to latest version
Simplify load of attribute ring buffer descriptor
[Continuations] Enable Traversal specialization test
Adjust tests to take into account upstream overload of rsrc and samp arguments
[Continuations] Fix signed/unsigned comparison warning
LowerRaytracingPipeline: tease apart return handling and any-hit exit handling
LowerRaytracingPipeline: unify function end handling
lgc: support int4 in cooperative matrix
Add rounding mode for PackHalf2x16
[Continuations] Fix unused variable warning
lgc: Add MsgPackScanner
[Continuations] Remove `LegacyCleanupContinuations` pass
LowerCooperativeMatrix: fix compile warning due to missing break statement
lgc: New RegStackUsage to propagate reg/stack usage
[LGC] Add helper lane state for subgroups
Add client name to SPIR-V frontend
Use more flexible readfirstlane
Gate argument definition for `GEP::collectOffset`.
[Continuations] Introduce dummy `csp` argument for `lgc.cps.jump`
[Continuations] Tolerate non-waiting AwaitTraversal
Add AmdExtD3DShaderIntrinsics_LoadDwordAtAddrx3
Add more BuiltIn which has primitive index in mesh shader
Rewrite the helper getShaderModuleUsageInfo
[Continuations] Replace isLgcRtOp helper with isDialectOp helper from dialects
Downgrade scope Device to Workgroup if permitted
Optimize performance for PrimSetup
Update submodule llvm-dialects
Remove inactive built-ins for last vertex processing stage
Fix internal hash for color export shader
Fix call to lookupIntrinsicID
[Continuations] Move Simplifying GEP helpers to CompilerUtils
[PatchBufferOp] Generate struct buffer cmpxchg intrinsics
[LGC] Refactor default wave size setting
[Continuations] Derive `DispatchSystemData` type from `_cont_DispatchRaysIndex3`
[Continuations] Replace "lgc.rt" starts_with check
Rename some classes and files
Optimize PointSize write when the value is 1.0
[Continuations] Handle `_AmdGetShaderRecordIndex` calls
[LowerBufferOperations] Check for uniform buffer pointers for s_buffer_load
lgc: Mark applicable LgcDialect and Builder ops as NoDivergenceSource
@qiaojbao qiaojbao requested a review from a team as a code owner October 30, 2024 08:48
@qiaojbao qiaojbao changed the title Update llpc from commit bdb23f95 Promotion from AMD internal branch for 2024.Q4.1 Oct 30, 2024
@amdvlk-admin
Copy link

Test summary for commit b86d36b

CTS tests (Failed: 2/148731)
  • Built with version 1.3.5.2.RC
  • Ubuntu navi3x, Srdcvk
    • Passed: 35806/75073 (47.7%)
    • Failed: 1/75073 (0.0%)

      Failures:

      FAILURE: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic
      Stack trace: Timeout
      

    • Not Supported: 39266/75073 (52.3%)
    • Warnings: 0/75073 (0.0%)
    Ubuntu navi2x, Srdcvk
    • Passed: 34909/73658 (47.4%)
    • Failed: 1/73658 (0.0%)

      Failures:

      FAILURE: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic
      Stack trace: Timeout
      

    • Not Supported: 38748/73658 (52.6%)
    • Warnings: 0/73658 (0.0%)

@amdvlk-admin
Copy link

Test summary for commit bc32393

CTS tests (Failed: 2/148731)
  • Built with version 1.3.5.2.RC
  • Ubuntu navi3x, Srdcvk
    • Passed: 35806/75073 (47.7%)
    • Failed: 1/75073 (0.0%)

      Failures:

      FAILURE: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic
      Stack trace: Timeout
      

    • Not Supported: 39266/75073 (52.3%)
    • Warnings: 0/75073 (0.0%)
    Ubuntu navi2x, Srdcvk
    • Passed: 34909/73658 (47.4%)
    • Failed: 1/73658 (0.0%)

      Failures:

      FAILURE: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic
      Stack trace: Timeout
      

    • Not Supported: 38748/73658 (52.6%)
    • Warnings: 0/73658 (0.0%)

@qiaojbao qiaojbao closed this Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants