Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BLAS][MKLGPU] Trsv tests can fail on PVC #600

Open
Rbiessy opened this issue Oct 21, 2024 · 0 comments
Open

[BLAS][MKLGPU] Trsv tests can fail on PVC #600

Rbiessy opened this issue Oct 21, 2024 · 0 comments
Labels
BLAS domain BLAS domain issue/request bug A request to fix an issue

Comments

@Rbiessy
Copy link
Contributor

Rbiessy commented Oct 21, 2024

Summary

The MKLGPU backend tests can fail when running Trsv on PVC.

Version

Using the tip of develop as of today (6923d40).

Environment

Running on PVC ( GPU Max 1100 1.3) with the oneAPI base toolkit 2024.2.0. OS is Ubuntu 22.04.
apt level-zero package versions:

  • level-zero: 1.16.15-881~22.04
  • level-zero-dev: 1.16.15-881~22.04
  • intel-level-zero-gpu: 1.3.30049.10-950~22.04

Steps to reproduce

cmake -Bbuild-pvc -GNinja -DREF_BLAS_ROOT=/path/to/lapack/install -DREF_LAPACK_ROOT=/path/to/lapack/install .
cd build-pvc
ninja
ctest -R ".*Trsv.*" --output-on-failure

Observed behavior

Full log: log_pvc.txt
The tests are failing with:

FATAL: Unexpected page fault from GPU at 0x7fa3dc0df000, ctx_id: 1 (CCS) type: 0 (NotPresent), level: 0 (PTE), access: 0 (Read), banned: 1, aborting.
FATAL: Unexpected page fault from GPU at 0x7fa3dc0df000, ctx_id: 1 (CCS) type: 0 (NotPresent), level: 0 (PTE), access: 0 (Read), banned: 1, aborting.
Abort was called at 287 line in file:
./shared/source/os_interface/linux/drm_neo.cpp

Note the DFT failures are reported in a separate issue: #601

Expected behavior

The tests should pass.

@Rbiessy Rbiessy added bug A request to fix an issue BLAS domain BLAS domain issue/request labels Oct 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BLAS domain BLAS domain issue/request bug A request to fix an issue
Projects
None yet
Development

No branches or pull requests

1 participant