Rocm jaxlib v0.4.30 qa cleanup #35

hsharsha · 2024-08-16T22:46:01Z

No description provided.

Main changes include: * Added support for fp8 matmul with output data type to be fp8 and bf16. * Added buffer comparators for fp8e4m3fnuz and fp8e5m2fnuz

hsharsha · 2024-08-16T22:48:41Z

@draganmladjenovic take a look at the failures for MLIR tests at 79b3692

Most of them are failing due to bigger thread size and smaller block size in AMD GPUs compared to NVIDIA. Some tests run into infinite loop on MI200 and hence I have commented out RunAndCompare

i-chaochen

I think we're ok on CublasDot at 0.4.28, but it's failed on 0.4.30?

wondering do we have a ticket to track on it?

hsharsha · 2024-08-17T16:13:44Z

I think we're ok on CublasDot at 0.4.28, but it's failed on 0.4.30?

wondering do we have a ticket to track on it?

It needs more investigation. It depends on the choice of autotune https://github.com/ROCm/frameworks-internal/issues/9088

…anup

pemeliya · 2024-08-29T14:37:00Z

xla/service/algorithm_util.cc

  switch (algorithm) {
    case PrecisionConfig::ALG_DOT_ANY_F8_ANY_F8_F32:
    case PrecisionConfig::ALG_DOT_ANY_F8_ANY_F8_F32_FAST_ACCUM:
      // Other F8 types are actually not supported by NVIDIA GPUs.
-      return is_cuda_ge_ada &&
+      return (is_cuda_ge_ada || is_rocm_mi100_and_above) &&


I am not sure if this is really correct. I guess FP8 support begins from MI300 arch? But I see that we have the same check also on upstream

pemeliya

I have reviewed gemm-related changes and buffer_comparator, leaving the remaining LLVM changes to Dragan and Chao

pramenku · 2024-11-07T13:18:10Z

Is this PR merged in QA-31 too? @hsharsha

pemeliya and others added 6 commits August 16, 2024 02:52

workspace fixing

49f81a7

[ROCM] Updated fp8 matmul with adjustments for updated hipBlasLt support

b61059e

Main changes include: * Added support for fp8 matmul with output data type to be fp8 and bf16. * Added buffer comparators for fp8e4m3fnuz and fp8e5m2fnuz

[ROCM] Addressed reviewer comment.

2e76267

[ROCM] Fix build after 21311f2

652fd38

[ROCM] Add basic scaffolding and enable MLIR fusion

e085f5d

Enable dot algorithms for AMD GPUs

b9622e2

github-actions bot added the kokoro:force-run label Aug 16, 2024

hsharsha requested review from i-chaochen and draganmladjenovic August 16, 2024 22:46

i-chaochen reviewed Aug 17, 2024

View reviewed changes

hsharsha force-pushed the rocm-jaxlib-v0.4.30-qa-cleanup branch from e1d02ab to be97509 Compare August 23, 2024 20:07

hsharsha added 2 commits August 28, 2024 13:01

Fix MLIR tests specifically w.r.t number of threads and number of blocks

9560ccf

Disable FP8 rewrite pattern test on ROCm

f57f22a

hsharsha force-pushed the rocm-jaxlib-v0.4.30-qa-cleanup branch from dd85325 to 4bc36d5 Compare August 28, 2024 20:02

Disable workspace setting

339dde0

hsharsha force-pushed the rocm-jaxlib-v0.4.30-qa-cleanup branch from 4bc36d5 to 339dde0 Compare August 28, 2024 20:05

hsharsha requested review from pemeliya and i-chaochen August 28, 2024 20:10

Merge branch 'rocm-jaxlib-v0.4.30-qa' into rocm-jaxlib-v0.4.30-qa-cle…

8ae1de7

…anup

pemeliya reviewed Aug 29, 2024

View reviewed changes

pemeliya approved these changes Aug 29, 2024

View reviewed changes

hsharsha merged commit 5945307 into rocm-jaxlib-v0.4.30-qa Aug 29, 2024
3 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rocm jaxlib v0.4.30 qa cleanup #35

Rocm jaxlib v0.4.30 qa cleanup #35

hsharsha commented Aug 16, 2024

hsharsha commented Aug 16, 2024

i-chaochen left a comment

hsharsha commented Aug 17, 2024

pemeliya Aug 29, 2024

pemeliya left a comment

pramenku commented Nov 7, 2024

Rocm jaxlib v0.4.30 qa cleanup #35

Rocm jaxlib v0.4.30 qa cleanup #35

Conversation

hsharsha commented Aug 16, 2024

hsharsha commented Aug 16, 2024

i-chaochen left a comment

Choose a reason for hiding this comment

hsharsha commented Aug 17, 2024

pemeliya Aug 29, 2024

Choose a reason for hiding this comment

pemeliya left a comment

Choose a reason for hiding this comment

pramenku commented Nov 7, 2024