-
Notifications
You must be signed in to change notification settings - Fork 540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request]: atomicAdd() to support half2 #3573
Comments
For half, we have |
is there any risk concern for |
Its unsafe because it causes the fast HW instruction to be generated, but those instructions don't work if they act on memory that is not cached, e.g. across a PCIe bus. The developer needs to assert that they are willing to take that risk. |
Does ROCm 6.2 support it ? /opt/rocm-6.2.0/lib/llvm/bin/../../../include/hip/amd_detail/amd_hip_fp16.h does not contain the function. |
Do you think it is better to have two types of atomic add functions than a single function in CUDA ? |
Hi @jinz2014, the function is recently added part of this. You can see it on staging but it hasn't made to 6.2.2 release yet, but please keep an eye out for it since it should be out soon. Meanwhile if you are interested, you can try to compile and install clr staging to see how it works.
In our case we wanted to ensure developers assert that they are using a unsafe function. However, I see your point in having a uniform interface to access these functions. We can will bring it up internally for discussion for sure. Thanks! |
Hi @jinz2014, to follow up on the discussion, in our opinion, it is best to have two versions of the atomic functions because atomic* and unsafeAtomic* APIs behave differently, hence the distinction to avoid confusion. We think this aspect outweigh the syntax discrepancy with CUDA, and should be explicitly noted by developers that use these APIs. Thanks! |
Hi @tcgu-amd Thanks for your answers. |
@jinz2014 No problem! Is there anything else we can help you with? If not I will close this issue for now. Thanks! |
I will try the atomicAdd() for half data types. I am not sure if @ZJLi2013 has more questions. |
hi, @tcgu-amd , I tried rocm/torch images with rocm6.2.3, rocm6.2.4, neither has anther thing may need you clarify, looks in will wait on a official release then Thanks again |
Yes, that's correct. There is no
Sounds good. Thanks! |
This issue will be closed since there is no further actionable item/activity. Please feel free to re-open for follow ups and further inquires requiring the release status. Thanks! |
@tcgu-amd |
@jinz2014 sure! Just a note that we would prefer closing issues with no further actionable items to help better track which ones still require active attention. Since the feature requested specifically for this issue is already in staging, we can keep it open a little longer. However, in general, feature requests that are on internal roadmaps will be closed, but we do encourage users to poll for progress by continue to ask follow up questions. Hope this makes sense. Thanks! |
Suggestion Description
hi, hip team,
here is cuda version,
looks there's non hip alternative yet, if built with hipcc, it gives:
Operating System
Ubuntu 22.04
GPU
mi300
ROCm Component
6.1.3 + rocblas + rocwmma
The text was updated successfully, but these errors were encountered: