You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is gonna be a constant issue... A --cpu flag seems fine to me...
But it won't fix the issue of the GPU not working of course, we are not gonna work on every single GPU in the world, but if someone opens an PR to support this one, great!
I did deliberately remove support for a lot of older GPUs in the AMD Containerfile to save about 20G in container image size, but if people enable extra ones one by one they would like, no big deal. The problem is if you enable every little one, you get a huge image. Also some GPUs will just prove to be headaches and a lot of effort.
The two ways I could see this fixed was a --cpu flag OR having the RocM items added only when --gpu is a command line option which would match the man pages and other documentation.
If we go with --cpu I would say that --cpu and --gpu conflict as command line options. The documentation is fixed to say that --gpu is only for when the system is not running a container, and --cpu will override that and give only local cpu performance.
Writing the above kind of made me think that adding --cpu was going to make it more complicated than having --gpu checked with the container.
In trying to debug https://bugzilla.redhat.com/show_bug.cgi?id=2329826 I found that the containers for Rocm would not work with at least 2 AMD chipsets:
This fails on
I could not figure out a way to force it to use just the CPU so possibly a
--cpu
flag which tells it not to try speeding things up ?The text was updated successfully, but these errors were encountered: