-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DSA initcontainer failures #1836
Comments
can you check for |
It looks like for some reason it cannot enable
iommu and sm_on is enabled:
DMAR info:
|
AFAIK getting it to 1 is a precondition for this to work. Maybe you are missing some setting in your BIOS? |
For reference, same logs from our SPR node that runs the DSA tests:
|
Do you have some instruction about the DSA related BIOS setting? Or related document can share with us? We also need to include this BIOS setting into our readme. Thanks! |
The DSA documentation does not mention about any other bios settings other than Intel® Virtualization Technology or Directed I/O (VT-d). these both are enabled. We were suggested (intel/idxd-config#46) to disable 5 lvl paging via kernel boot parameters. do you have it disabled in your Nodes? we are going to try this but its not documented in DSA guide as requirement. |
Look for ENQCMD/ENQCMDS |
ENQCMD/ENQCMDS are enabled in Bios and we still see this issue. We also tried disabling 5 level Pagetables from the issue above but it does not resolve the issue either. |
Update: After the idxd driver was loaded properly, the initcontainer was still failing. It turns out the Driver on RHEL 9.2 requires the "driver_name" to be present in the DSA config file (intel/idxd-config@84f099c). After adding the "driver_name" in the config, initcointainer started properly. Thanks to @mythi for the help debugging the issue. |
Describe the bug
The DSA initcontainer fails in openshift in a system with DSA available.
The log shows:
the dmesg shows:
[82721.495946] user: probe of wq0.2 failed with error -95
The system has intel iommu enabled and sm_on is also added to kernel boot parameters.
we are using the default dsa.cong from the repo.
please let us know if we are missing something.
To Reproduce
Create the DSA plugin using operator UI . the pod is created but it initcontainer fails
Expected behavior
The DSA plugin should be in running state
Screenshots
If applicable, add screenshots to help explain your problem.
System (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: