-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase private segment limit for dynamic scratch kernels #80
base: develop
Are you sure you want to change the base?
Commits on Mar 18, 2024
-
SWDEV-449558 - Update barrier's logic
PAL optimized the logic for the barriers, which caused failures with CP DMA on Navi4x. Change barrier's code to match the most recent PAL optimizations. Change-Id: I55eeab20f51eb8e920bcbb4b55fbe3c7f77fd3fa
Configuration menu - View commit details
-
Copy full SHA for 1239309 - Browse repository at this point
Copy the full SHA 1239309View commit details -
SWDEV-430437,SWDEV-434702 - Split the streamset per device
Change-Id: If1bcca45825c9899462bb95ed6f637f5af806cc8
Configuration menu - View commit details
-
Copy full SHA for be2bdab - Browse repository at this point
Copy the full SHA be2bdabView commit details
Commits on Mar 20, 2024
-
SWDEV-1 - Do not pass daz_opt to comgr link options
This no longer does anything. Change-Id: I0643198a46a534a76454a5b461d010ed1776a89a
Configuration menu - View commit details
-
Copy full SHA for 67473ba - Browse repository at this point
Copy the full SHA 67473baView commit details -
SWDEV-449146 - Call hipDeviceSynchronize before removeFatBinary
Added call to hipDeviceSynchronize in __hipUnregisterFatBinary to ensure that all HSA async signal handlers complete before removeFatBinary Change-Id: I756fecca1c2a5eae092613d8079de266399e5685
Configuration menu - View commit details
-
Copy full SHA for ad32e60 - Browse repository at this point
Copy the full SHA ad32e60View commit details
Commits on Mar 21, 2024
-
SWDEV-452299 - Pass dst pitch while capturing hipMemcpyParam2DAsync &…
… elementSize should be 1 as width is in bytes while capturing hipMemset2DAsync. Change-Id: I8f9122a30cba0a07c097dfd7609432090caab142
Configuration menu - View commit details
-
Copy full SHA for 0be92b8 - Browse repository at this point
Copy the full SHA 0be92b8View commit details
Commits on Mar 22, 2024
-
Revert "SWDEV-444098 - remove rocm-ocl-icd packaging"
This reverts commit 5f68a45. Reason for revert: due to windows staging branch using Opencl-icd-loader master Change-Id: I9cca7564a21de1733665a34da6f0322aa3b886e7
Configuration menu - View commit details
-
Copy full SHA for f44ca70 - Browse repository at this point
Copy the full SHA f44ca70View commit details
Commits on Mar 23, 2024
-
SWDEV-451736 - Revert "SWDEV-444670 - Register the Runtime::tearDown …
…function to be called at exit" This reverts commit 5e294f8. Change-Id: Ib9cb1cc0c3903bfba56c9a5d05ae8afe96be583a
Configuration menu - View commit details
-
Copy full SHA for 51926b6 - Browse repository at this point
Copy the full SHA 51926b6View commit details
Commits on Mar 25, 2024
-
SWDEV-353281 - Change pool type for graphs
Under ROCr physical allocations don't have initial VA and require extra flag in ROCclr. Add an option to have a mempool of physical allocations. Change-Id: I4d062fe0dd8113d4eaf6e8b51749ed56d8701d1e
Configuration menu - View commit details
-
Copy full SHA for f296159 - Browse repository at this point
Copy the full SHA f296159View commit details
Commits on Mar 26, 2024
-
SWDEV-301667 - Disable HostBlit copy for HIP correct if check
Change-Id: I33d1359d5e4c871f63350d8300f726e039664d86
Anusha GodavarthySurya authored and Anusha Godavarthy Surya committedMar 26, 2024 Configuration menu - View commit details
-
Copy full SHA for 7f84df9 - Browse repository at this point
Copy the full SHA 7f84df9View commit details -
SWDEV-448288 - Remove references to deprecated llvm references.
Change-Id: I7b58177c41dc0c6c59813977cb90e65a6cb3be72
Configuration menu - View commit details
-
Copy full SHA for 5b28e38 - Browse repository at this point
Copy the full SHA 5b28e38View commit details -
SWDEV-413997 - Save hsa_handle as ptr in hipMemCreate path.
Change-Id: Ica32017ef7b00326dfb6d1f604e126d40ad5b786
Configuration menu - View commit details
-
Copy full SHA for 5cbd74b - Browse repository at this point
Copy the full SHA 5cbd74bView commit details -
SWDEV-446992 - Request can be for bytes OR dptr OR both.
Change-Id: Ib479c744b90125b74d99cbf18b7f4b8cf765bf1c
Configuration menu - View commit details
-
Copy full SHA for 09328f4 - Browse repository at this point
Copy the full SHA 09328f4View commit details -
SWDEV-445096 - Fix -O0 crash in OpenCL tests
- With https://gerrit-git.amd.com/c/lightning/ec/llvm-project/+/1002628 applied, at -O0 Kernel::dynamicParallelism() returns true but virtual queue isn't created - This causes segfault inside VirtualGPU::submitKernelInternal() when getVQVirtualAddress() is called Change-Id: Ia7af042adad2329e870c142caaac3e8fa886f8b8
Configuration menu - View commit details
-
Copy full SHA for d1fff7c - Browse repository at this point
Copy the full SHA d1fff7cView commit details -
SWDEV-301667 - Create TS for each node recorded in graph
- Create a vector to allow multiple TS to be stored in Command. - This would mean we dont wait for entire batch in Accumulate command to finish when we exhaust signals. - Reduce the number of signals created at init to 64. This min value may still need to be tuned but the KFD allows max of 4094 interrupt signals per device. - Store kernel names whenever they are available and not just when profiling. If we dynamically enable profiling like for Torch, a crash can happen if hipGraphInstantiate wasnt included in Torch profile scope beacuse we previously entered kernel names only when profiler is attached. Change-Id: I34e7881a25bbc763f82fdeb3408a8ea58e1ec006
Configuration menu - View commit details
-
Copy full SHA for c157bfb - Browse repository at this point
Copy the full SHA c157bfbView commit details
Commits on Mar 27, 2024
-
SWDEV-450636 - Hostcall Listen thread cause seg fault due to main thr…
…ead is being destroied by app/test's unload libamdhip64.so call. Change-Id: I8d4a8d8b6801d9f6eb745c45adf831597def0cb5
Configuration menu - View commit details
-
Copy full SHA for 4761ecb - Browse repository at this point
Copy the full SHA 4761ecbView commit details
Commits on Mar 28, 2024
-
SWDEV-449922 - Remove per queue residency update
alwaysResident setting doesn't require per queue residency tracking. Thus, the logic can be skipped to avoid the lock of queues. Change-Id: Ib5cff5b79d3ecb8c2f2eb2565cf069f9a69438b0
Configuration menu - View commit details
-
Copy full SHA for 95e3958 - Browse repository at this point
Copy the full SHA 95e3958View commit details -
SWDEV-449922 - Remove a global lock for queues on wait
The new logic has a lock for PAL call and doesn't require the lock for queues. Change-Id: I61b67c3c4abd2ede44809de1d6beed756766032e
Configuration menu - View commit details
-
Copy full SHA for 2f3ad43 - Browse repository at this point
Copy the full SHA 2f3ad43View commit details
Commits on Mar 29, 2024
-
SWDEV-443567 - SWDEV-436126 - Fix Prohibited and Unhandled Operations…
… during capture => hipDeviceSynchronize is not allowed during capture. => hipEventSynchronize during capture should return hipErrorCapturedEvent error => hipEventQuery during capture should return hipErrorCapturedEvent error hipStreamSynchronize, hipEventSynchronize, hipStreamWaitEvent, hipStreamQuery For Side Stream(Stream that is not currently under capture): => If current thread is capturing in relaxed mode, calls are allowed => If any stream in current/concurrent thread is capturing in global mode, calls are not allowed => If any stream in current thread is capturing in ThreadLocal mode, calls are not allowed For Stream that is currently under capture => calls are not allowed => Any call that is not allowed during capture invalidates the capture sequence => It is invalid to call synchronous APIs during capture. Synchronous APIs, such as hipMemcpy(), enqueue work to the legacy stream and synchronize it before returning. Change-Id: I201c6e63e1a5d93fd416a3b520264c0fdbe31237
Anusha GodavarthySurya authored and Anusha Godavarthy Surya committedMar 29, 2024 Configuration menu - View commit details
-
Copy full SHA for 19b4660 - Browse repository at this point
Copy the full SHA 19b4660View commit details -
SWDEV-452787 - correct hipDrvGraphAddMemcpyNode check
Change-Id: Id58f982edd4f17d675f7a0f61a9b4dea0baebd9b
Anusha GodavarthySurya authored and Anusha Godavarthy Surya committedMar 29, 2024 Configuration menu - View commit details
-
Copy full SHA for ea4f09e - Browse repository at this point
Copy the full SHA ea4f09eView commit details
Commits on Apr 1, 2024
-
SWDEV-451687 - Fix alloc message values in AMD_LOG_LEVEL for 32 bit
Change-Id: Icbe67024297c92bf59139b6a2ccd2ba3674f60b1
Configuration menu - View commit details
-
Copy full SHA for 411960a - Browse repository at this point
Copy the full SHA 411960aView commit details
Commits on Apr 2, 2024
-
SWDEV-443537 - fix make build warning message
Add cltrace compile definition for CL_TARGET_OPENCL_VERSION to OpenCL 2.2 Change-Id: Ie868ab0a6e86951afc6d07da58be942c3b736d15
Configuration menu - View commit details
-
Copy full SHA for f7b1398 - Browse repository at this point
Copy the full SHA f7b1398View commit details
Commits on Apr 3, 2024
-
SWDEV-446726 - Disable large bar for 32 bit windows
When large bar is enabled, persistent memory leads to overallocation for 32 bit architecture. Change-Id: Iae39359d8128588de02e42d77fe58e868b8e71fd
Configuration menu - View commit details
-
Copy full SHA for bc80802 - Browse repository at this point
Copy the full SHA bc80802View commit details
Commits on Apr 4, 2024
-
SWDEV-450361 - Add nullptr validation for waitStream
- Application is passing null for parameter stream in API hipStreamWaitEvent - When event stream isn't capturing and event is not recorded, causes segfault because we are accessing deviceId() from waitStream Change-Id: I8b87ffd6f234677f68b66dcb7ef44b2ff04a7c91
Configuration menu - View commit details
-
Copy full SHA for 880f1f0 - Browse repository at this point
Copy the full SHA 880f1f0View commit details
Commits on Apr 5, 2024
-
SWDEV-453301 - Remove the option to write multiple packets in dispatc…
…hGenericAqlPacket Dispatching multiple packets with ring the doorbell once is not supported by the lower layers Change-Id: I7665a2dcdd4ef9e47dadfe410180fed64c5a4ee0
Configuration menu - View commit details
-
Copy full SHA for d7f352d - Browse repository at this point
Copy the full SHA d7f352dView commit details -
SWDEV-451964 - Limit gpu single allocation percentage for gfx940 only
Change-Id: Iadcdadd734e7aeeb23742e426353defa972d3ad5
Configuration menu - View commit details
-
Copy full SHA for dbac297 - Browse repository at this point
Copy the full SHA dbac297View commit details
Commits on Apr 8, 2024
-
SWDEV-301667 - Fix SDMA mask reuse
If we are using the mask returned by getLastUsedSdmaEngine() then we need to apply the SDMA Read/Write mask to it before using with HSA copy_on_engine API. Change-Id: I6e5dc6c187eeb3c61ee159e9d2a0fa7b4737c06e
Configuration menu - View commit details
-
Copy full SHA for 3f0bcf7 - Browse repository at this point
Copy the full SHA 3f0bcf7View commit details
Commits on Apr 9, 2024
-
SWDEV-379007 - add __hip_bfloat16_raw types
This also brings bfloat16 implementation closer to CUDA's. Change-Id: I23f381141faacd6537923ae9b88ada4d661db496
Configuration menu - View commit details
-
Copy full SHA for 481912a - Browse repository at this point
Copy the full SHA 481912aView commit details
Commits on Apr 10, 2024
-
SWDEV-446298 - Adding error code to the logs on p2p hsa api failure.
Change-Id: Ic41b1ad1b64cca0e31986337a83a5146d52a7328
Configuration menu - View commit details
-
Copy full SHA for 2b8634b - Browse repository at this point
Copy the full SHA 2b8634bView commit details -
SWDEV-455254 - Add kernel arg optimization
Add kernel arguments optimization into blit path. Enabled by default on MI300. Change-Id: I2694a81b90d48ad07d86dfe4c0c64fe187bada8e
Configuration menu - View commit details
-
Copy full SHA for f0c7ecf - Browse repository at this point
Copy the full SHA f0c7ecfView commit details
Commits on Apr 12, 2024
-
SWDEV-436754 - Use glbctx instead so that ref count increments for mu…
…lti devices and chunk decommit gets delayed. Change-Id: Ia4b0d5fbfa8f198776e52d14de8b22c6942f740d
Configuration menu - View commit details
-
Copy full SHA for d52168b - Browse repository at this point
Copy the full SHA d52168bView commit details -
SWDEV-424956 - Fix half vector printf issue
Refactor PrintfDbg::outputArgument() to remove potential risk. Fix half vector printf issue on all devices. Fix FEAT-56794 as well. Change-Id: Iae39359d2128588def2e43d77fe58e868b8e71ff
Configuration menu - View commit details
-
Copy full SHA for 35c80dd - Browse repository at this point
Copy the full SHA 35c80ddView commit details -
SWDEV-455254 - Reduce blit kernels signature
Remove offset from blit kernels, since it can be applied in setup. Change-Id: I06b585068d68a0ee8e125ddf46a36fccb372f30d
Configuration menu - View commit details
-
Copy full SHA for 7de7da4 - Browse repository at this point
Copy the full SHA 7de7da4View commit details
Commits on Apr 13, 2024
-
SWDEV-413997 - Fixing multiple device cases.
Change-Id: I10ad3fbfca887e92cd81f68392fa1acf753cbd2b
Configuration menu - View commit details
-
Copy full SHA for d52d16c - Browse repository at this point
Copy the full SHA d52d16cView commit details
Commits on Apr 14, 2024
-
SWDEV-453180 - Add UUID support for HIP_VISIBLE_DEVICES on Linux
- UUID is Ascii string with a maximum of 21 chars which uniquely identifies a GPU - Convert set UUID in HIP_VISIBLE_DEVICES to device index internally - Then use existing device index logic for HIP_VISIBLE_DEVICES Change-Id: I8cab4fe42459f8209b97f909300789e6e687b9ac
Configuration menu - View commit details
-
Copy full SHA for 52db98e - Browse repository at this point
Copy the full SHA 52db98eView commit details
Commits on Apr 15, 2024
-
SWDEV-413997 - Read Access can be valid now that ROCr takes care of a…
…ccess. Change-Id: Iecda74ca0207c95d3fbed8b4e15c8c4c5895d939
Configuration menu - View commit details
-
Copy full SHA for 815e450 - Browse repository at this point
Copy the full SHA 815e450View commit details -
SWDEV-413997 - Fixing alignment validation check for power of 2 inste…
…ad of granularity factor. Change-Id: I1e0db6e0628c09d26850e5a0339e2a4660442db8
Configuration menu - View commit details
-
Copy full SHA for 00ddc3e - Browse repository at this point
Copy the full SHA 00ddc3eView commit details -
SWDEV-453577 - Fixes to account for right CU count based on WGP or CU…
… mode Change-Id: Ib9739f9917bc6ff69cc76f444d909311922ebc1e
Sourabh Betigeri authored and Sourabh Betigeri committedApr 15, 2024 Configuration menu - View commit details
-
Copy full SHA for fcfe2ec - Browse repository at this point
Copy the full SHA fcfe2ecView commit details
Commits on Apr 16, 2024
-
SWDEV-379007 initial implementation of fp8 header
Change-Id: Id9a5a85641882961e4d860a815217c641e6f3387
Configuration menu - View commit details
-
Copy full SHA for ca07f59 - Browse repository at this point
Copy the full SHA ca07f59View commit details -
SWDEV-422580 - Adding back the pcie.function to PCI address string in…
… hipGetDevicePCIBusId. Change-Id: I932724cc872d7ae2643ce6ac2924901cb49cd7ad
Configuration menu - View commit details
-
Copy full SHA for a1e0970 - Browse repository at this point
Copy the full SHA a1e0970View commit details -
SWDEV-444670 - Enable teardown class
Force implicit runtime teardown with a global destructor. Change-Id: Iabe63dedf5b94fefc98668585c45a61607120669
Configuration menu - View commit details
-
Copy full SHA for c95a75a - Browse repository at this point
Copy the full SHA c95a75aView commit details -
SWDEV-379007 - use avx instruction for bf16 cvt
AMD CPUs have had avx512_bf16 support for quite some time now (from consumer Ryzen 7000 series to enterprise grade CPUs). This patch should allow users to use the hardware bf16 unit when running the __host__ variants of the function. This can be enabled via `hipcc ... -mavx512vl -mavx512bf16`. Change-Id: I67c377afc95ddfe8d45a048dce078a247d4a1878
Configuration menu - View commit details
-
Copy full SHA for 49349f1 - Browse repository at this point
Copy the full SHA 49349f1View commit details -
SWDEV-455699 - removing HW_REG_TRAPSTS for gfx12
Change-Id: I5f8b030eefdb37d3d51da3e135e5aa0f18ad9018
Configuration menu - View commit details
-
Copy full SHA for 5ddca58 - Browse repository at this point
Copy the full SHA 5ddca58View commit details
Commits on Apr 17, 2024
-
SWDEV-454959 - ignore the upper half of the mask in wave32 mode
Change-Id: If027dd8cbe5cbe142fff353cb72c16f08e9aea8e
Configuration menu - View commit details
-
Copy full SHA for 03562a2 - Browse repository at this point
Copy the full SHA 03562a2View commit details -
SWDEV-379007 - fix bool check for fp8_fnuz
for fnuz nums zero val is 0x00, -0 i.e. 0x80 would be a NaN Change-Id: Ibdc4fb4b9fb307b5952434f08d45a8ddd6262db8
Configuration menu - View commit details
-
Copy full SHA for d7b0d78 - Browse repository at this point
Copy the full SHA d7b0d78View commit details -
SWDEV-441603 - Correct dst device
Signed-off-by: sdashmiz <[email protected]> Change-Id: Ie60aa598dd73df66cdf02c1d96daf2dfccba7a59
Configuration menu - View commit details
-
Copy full SHA for d511e57 - Browse repository at this point
Copy the full SHA d511e57View commit details -
SWDEV-440746 - Fix the hostcall buffer creation
Avoid a deadlock on the host call buffer creation. Since the buffer will be allocated in the queue thread, then use direct device memory allocation skipping the global context lock. Change-Id: I09b55ee03bb42ab5d320c152b52a8c842c5fdcc1
Configuration menu - View commit details
-
Copy full SHA for 62559a6 - Browse repository at this point
Copy the full SHA 62559a6View commit details
Commits on Apr 18, 2024
-
SWDEV-455346 - End wait if HostcallListener terminates.
Change-Id: I21ec8eadb189147c579ec65acf68de40d604686b
Configuration menu - View commit details
-
Copy full SHA for 8942939 - Browse repository at this point
Copy the full SHA 8942939View commit details -
SWDEV-453535 - Capture hipMemset3DAsync.
Change-Id: I517c2557573db258b3e3e353f02f6a56652b0fde
Configuration menu - View commit details
-
Copy full SHA for 12e0bdc - Browse repository at this point
Copy the full SHA 12e0bdcView commit details -
SWDEV-450053 - Handle MemcpyNodeSetParamsTo/FromSymbol negative param…
…eters For all windows allocation on SVM memory tagged with flag ROCCLR_MEM_INTERPROCESS. hipHostMalloc validation is based on flag. So remove ROCCLR_MEM_INTERPROCESS before check. Change-Id: I823bbf228d9a4a9acb4abffc01ac6b3f544c6e12
Anusha GodavarthySurya authored and Anusha Godavarthy Surya committedApr 18, 2024 Configuration menu - View commit details
-
Copy full SHA for 8179fa9 - Browse repository at this point
Copy the full SHA 8179fa9View commit details -
SWDEV-413997 - VMM IPC implementation for Linux.
Change-Id: Icfeb83ca51e96be35abb67a94d6e3e1a1ca5a934
1Configuration menu - View commit details
-
Copy full SHA for 56ebf51 - Browse repository at this point
Copy the full SHA 56ebf51View commit details
Commits on Apr 19, 2024
-
SWDEV-455869 - Revert "SWDEV-410751 - Consider null amd::memory is in…
…valid." This reverts commit a9ff2c5. Change-Id: I26c4b3c74b2861afc17f979492d025b59d4388ab
Anusha GodavarthySurya authored and Anusha Godavarthy Surya committedApr 19, 2024 Configuration menu - View commit details
-
Copy full SHA for e829ef6 - Browse repository at this point
Copy the full SHA e829ef6View commit details -
SWDEV-451594 - Implement Readback and Avoid HDP Flush workaround for …
…device kernel args Change-Id: I6d41a089a17f55306e7ff402588a1e831b20a7a7
Configuration menu - View commit details
-
Copy full SHA for bf74ef4 - Browse repository at this point
Copy the full SHA bf74ef4View commit details -
SWDEV-446610 - Attribute HIP_POINTER_ATTRIBUTE_SYNC_MEMOPS should ret…
…urn the correct value Change-Id: Ieced2ee61bba28f2d1df96893a661287b0a5c7b7
Configuration menu - View commit details
-
Copy full SHA for 8f7acbd - Browse repository at this point
Copy the full SHA 8f7acbdView commit details -
SWDEV-440746 - Don't set CL_SUBMITTED twice
Change-Id: I9ba34454f7487d6bc0d398b322a147cbac6c6443
Configuration menu - View commit details
-
Copy full SHA for fd81490 - Browse repository at this point
Copy the full SHA fd81490View commit details -
SWDEV-353281 - Reuse timestamp on memory reuse
Mempool may reuse memory without a wait. Hence, the timestamp has to be preserved and can't be destroyed. Change-Id: I6f095f44afa69887a4b7aeb3b329804aedd96f3e
Configuration menu - View commit details
-
Copy full SHA for ffb516d - Browse repository at this point
Copy the full SHA ffb516dView commit details -
SWDEV-440746 - Wait for signal before release
Change-Id: I9e2aefdbcbba153c7f1080d80aab7a345eaf1eb4
Configuration menu - View commit details
-
Copy full SHA for 329ba27 - Browse repository at this point
Copy the full SHA 329ba27View commit details
Commits on Apr 20, 2024
-
SWDEV-453180 - Add UUID support for HIP_VISIBLE_DEVICES on Windows
- UUID needs to be specified in the format GPU-<body>, <body> encodes UUID as a 16 chars - Convert set UUID in HIP_VISIBLE_DEVICES to device index internally - Then use existing device index logic for HIP_VISIBLE_DEVICES Change-Id: I654f492a49cd4d7a9b7339360ab558165240caa5
Configuration menu - View commit details
-
Copy full SHA for fb217fa - Browse repository at this point
Copy the full SHA fb217faView commit details -
SWDEV-455041 - Continue processing fat binary even if other code obje…
…ct bundle processing fail. Change-Id: Iea553ab0265c08341f915644075ce2b6ed9b3200
Configuration menu - View commit details
-
Copy full SHA for 863c562 - Browse repository at this point
Copy the full SHA 863c562View commit details
Commits on Apr 22, 2024
-
SWDEV-440746 - Remove obsolete code
The "optimized" version of memcpy is outdated and was used in win32 only. Change-Id: I7f2e0e9051e37cec95438266824b5b0025c324c6
Configuration menu - View commit details
-
Copy full SHA for 7448113 - Browse repository at this point
Copy the full SHA 7448113View commit details -
SWDEV-440746 - Release last command on terminate
Change-Id: Ib6a9b8fc9a8692eb17b39b854cefd92c6b59733f
Configuration menu - View commit details
-
Copy full SHA for 0ccdb3e - Browse repository at this point
Copy the full SHA 0ccdb3eView commit details -
SWDEV-413997 - Cleanup fixes for Virtual Memory Management.
Change-Id: I9a4a4d9087b5daf15e3ba31e786d34db431212a1
Configuration menu - View commit details
-
Copy full SHA for 74ffc5f - Browse repository at this point
Copy the full SHA 74ffc5fView commit details
Commits on Apr 23, 2024
-
Switch luxmark to lightning compiler for all ASICs
Change-Id: Idcd37628a2167f0bd2db2a83132a1862cbd051b0
Configuration menu - View commit details
-
Copy full SHA for 5a715ed - Browse repository at this point
Copy the full SHA 5a715edView commit details -
[SWDEV-454661][SWDEV-454653] - GraphExecMemcpyNodeSetParam to return …
…error on memcpy direction change Change-Id: I2c8f5ea394caeaaa6895003e63cd62a052c491f8
Configuration menu - View commit details
-
Copy full SHA for 8809633 - Browse repository at this point
Copy the full SHA 8809633View commit details
Commits on Apr 24, 2024
-
457619 - Fixed the broken link to build HIP instructions
Change-Id: Ica87b4ab511d26e0372502f069afc0e3baaa3256
Configuration menu - View commit details
-
Copy full SHA for 1761f1b - Browse repository at this point
Copy the full SHA 1761f1bView commit details -
Lower layer ignores alignment Change-Id: If16df951ecefddc804a6effe013058afc595d30f
Configuration menu - View commit details
-
Copy full SHA for 5c23440 - Browse repository at this point
Copy the full SHA 5c23440View commit details -
SWDEV-451099 - Added include for __half type definitions for non-HIP …
…code on windows Change-Id: Id80cef5a36db8707276de052cbaf73b6826d222f
Configuration menu - View commit details
-
Copy full SHA for 2841aab - Browse repository at this point
Copy the full SHA 2841aabView commit details
Commits on Apr 25, 2024
-
SWDEV-447691 - Correct handle type for DX12 semaphore
Change-Id: Id23882286cb2a0d0472964ffc501ab27b7dc7f00
Configuration menu - View commit details
-
Copy full SHA for 9fdddb7 - Browse repository at this point
Copy the full SHA 9fdddb7View commit details -
SWDEV-420016 - Add more driver side graph APIs
Signed-off-by: shadi <[email protected]> Change-Id: Iff3ee7dcbcd24836f227fdc9bd5ff4b554ac914f
Configuration menu - View commit details
-
Copy full SHA for f2b0178 - Browse repository at this point
Copy the full SHA f2b0178View commit details -
SWDEV-457859 - Initialize isWGPMode_ in WorkGroupInfo
Change-Id: Ie3f3c0bcea84368c1b0607fd52b4bc7cae41c512
Configuration menu - View commit details
-
Copy full SHA for cffff4e - Browse repository at this point
Copy the full SHA cffff4eView commit details
Commits on Apr 26, 2024
-
SWDEV-353281 - Corret VA unmap
Make sure graph mempool unmaps VA on release Change-Id: Id3f1bd8d0115b533ae60aa5ba3676b8bf7e5b961
Configuration menu - View commit details
-
Copy full SHA for 5c1804a - Browse repository at this point
Copy the full SHA 5c1804aView commit details
Commits on Apr 28, 2024
-
SWDEV-413997 - Handling cases where multiple phys_mem is mapped into …
…a single virtual address. Change-Id: Ie678e607a64f2e5c35a10b9083185f041c5527ac
Configuration menu - View commit details
-
Copy full SHA for 49b4aef - Browse repository at this point
Copy the full SHA 49b4aefView commit details
Commits on Apr 29, 2024
-
SWDEV-451004 - Remove .bat files from hip-dev/devel package
The .bat file will not be removed from windows pacakge. Also used cmake install(PROGRAMS …), which will set the correct permission rather than hard coding the permission Change-Id: I8b57778b59f70e01de949be2ea353b67eec70d2f
Configuration menu - View commit details
-
Copy full SHA for 9e74f6d - Browse repository at this point
Copy the full SHA 9e74f6dView commit details -
SWDEV-458516 - Add support for external CI builds using Azure Pipelines
Change-Id: Iedd6290a813d6e43a4350709484f78e05b08adc8
Configuration menu - View commit details
-
Copy full SHA for d7d1e01 - Browse repository at this point
Copy the full SHA d7d1e01View commit details -
SWDEV-413997 - Changes to use GlobalContext in views.
Change-Id: I1f8411eae9ed49632667e244a25f223fed92c720
Configuration menu - View commit details
-
Copy full SHA for 0e1a057 - Browse repository at this point
Copy the full SHA 0e1a057View commit details -
SWDEV-311271 - Enable mempools under Linux
Change-Id: I7fda94e61121f9d3a30f4ad185b8a97712922f3c
Configuration menu - View commit details
-
Copy full SHA for 7a37150 - Browse repository at this point
Copy the full SHA 7a37150View commit details -
SWDEV-459583 - Update codeownder in clr repos
Change-Id: Ief73138faed0af70b90186db5bde6689e0a83f88
Configuration menu - View commit details
-
Copy full SHA for 08889f4 - Browse repository at this point
Copy the full SHA 08889f4View commit details
Commits on Apr 30, 2024
-
SWDEV-456279 - Adding new hip flag to access contiguous memory and pa…
…ss the flag to HSA API. Change-Id: I1bafeaa3096395c729723af958d609bc41e7845c
Configuration menu - View commit details
-
Copy full SHA for 1d48f2a - Browse repository at this point
Copy the full SHA 1d48f2aView commit details -
SWDEV-301667 - Add LOG_TS mask
- Add LOG_TS mask for printing signal times - Read raw ticks from signals Change-Id: Ibdd0bf06c790729f6c65083a4784c97a3c3219e0
Configuration menu - View commit details
-
Copy full SHA for 948ca5a - Browse repository at this point
Copy the full SHA 948ca5aView commit details -
SWDEV 301667 - Fix Debug dot print for graphs
- Fix possible buffer overflow for long kernel names Change-Id: I3c51669de7ff242d03f9210ee045b6d5e7ac274a
Configuration menu - View commit details
-
Copy full SHA for f1ef05b - Browse repository at this point
Copy the full SHA f1ef05bView commit details -
SWDEV-455586 - create ocltst ASAN test instrumented pkg
Change-Id: I820cacd75a36363d1387e2e881c96937281bc265
Configuration menu - View commit details
-
Copy full SHA for feaef7f - Browse repository at this point
Copy the full SHA feaef7fView commit details
Commits on May 1, 2024
-
[SWDEV-442583] - Fix hipLaunchCooperativeKernel API error codes
Change-Id: Id3cb4b58b9efd3aceec4377d4d1d3a053c50333e
Configuration menu - View commit details
-
Copy full SHA for 59051ed - Browse repository at this point
Copy the full SHA 59051edView commit details -
SWDEV-449327 - Adding CONTRIBUTING.md in clr repos
Change-Id: I3601373f680aa4bff0075f4b3b9e885e54b4600d
Configuration menu - View commit details
-
Copy full SHA for 996c16a - Browse repository at this point
Copy the full SHA 996c16aView commit details -
SWDEV-433371 - use comgr to unbundle code objects
1.Make runtime use comgr to unbundle code objects 2.Support compressed/uncompressed modes 3.Remove HIP_USE_RUNTIME_UNBUNDLER and HIPRTC_USE_RUNTIME_UNBUNDLER to simplify logics 4.Add comgr wrapper for amd_comgr_action_info_set_bundle_entry_ids() Change-Id: Ic41b1ad1b64cca1e31986437983a5146d52a7329
Configuration menu - View commit details
-
Copy full SHA for e53df57 - Browse repository at this point
Copy the full SHA e53df57View commit details
Commits on May 2, 2024
-
SWDEV-451594 - Change device kernel args to use HDP flush by default
The Readback and Avoid HDP Flush memory ordering workaround is used as a fallback solution only when HDP flush register is invalid Change-Id: Ic284eba1f95ed22b0270d3abeb904fb902015b1a
Configuration menu - View commit details
-
Copy full SHA for 6cb7b6e - Browse repository at this point
Copy the full SHA 6cb7b6eView commit details
Commits on May 3, 2024
-
SWDEV-433371 - Build needs comgr >= 2.8
Unbundle compressed code objects needs comgr 2.8 or higher Change-Id: I23942d2038e19b02c3ea5d3c9c1fe5367db87136
Configuration menu - View commit details
-
Copy full SHA for 1bd8b4e - Browse repository at this point
Copy the full SHA 1bd8b4eView commit details -
SWDEV-460091 - Handle cases where inline variables are included in mu…
…ltiple translation unit. Change-Id: I92179ad198abbdaf5aec9c3c4ba76eeb6b0cc761
Configuration menu - View commit details
-
Copy full SHA for 68ddd8f - Browse repository at this point
Copy the full SHA 68ddd8fView commit details
Commits on May 5, 2024
-
SWDEV-301667 - Pass reference to kernel name
Change-Id: I21abe109ddfabfe7640bf78a96c81a1317d31952
Configuration menu - View commit details
-
Copy full SHA for 4a9d24a - Browse repository at this point
Copy the full SHA 4a9d24aView commit details
Commits on May 6, 2024
-
SWDEV-451945 - Remove ShouldLoadPlatform function
Change-Id: Iabb4071bb77201576bc2c0488a04f4fa188815df
Configuration menu - View commit details
-
Copy full SHA for 2eb3037 - Browse repository at this point
Copy the full SHA 2eb3037View commit details -
SWDEV-460151 - Enabling gfx12 in HIP runtime.
Change-Id: I374ea7c3119b0c61f9846a862c4a448ddb179748
Configuration menu - View commit details
-
Copy full SHA for 7ad1416 - Browse repository at this point
Copy the full SHA 7ad1416View commit details
Commits on May 7, 2024
-
SWDEV-460242 - Add system memory suballocator
Switch commands creation to the new suballocator to avoid frequent expensive OS calls Change-Id: I3597c811820e577c15708bad8b8a41aa53acc400
Configuration menu - View commit details
-
Copy full SHA for 933aa1d - Browse repository at this point
Copy the full SHA 933aa1dView commit details
Commits on May 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for c76223c - Browse repository at this point
Copy the full SHA c76223cView commit details