[Q] Minimum GPU Ram requirements for inference on models? #512
-
What minimum GPU ram do I need for inference on each model, e.g., I couldn't find 8-bit inference guides using pretrained weights. Is this possible? What would be the RAM requirements then? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
I don't think we know the exact amount of GPU RAM required, but you can follow this tutorial for 8-bit inference https://github.com/mlfoundations/open_clip/blob/main/tutorials/int8_tutorial.ipynb Concretely, your going to want to do something like the following (note I didn't actually check this code, just copied from the tutorial so there may be a typo, please double check tutorial if it doesn't work)
|
Beta Was this translation helpful? Give feedback.
I don't think we know the exact amount of GPU RAM required, but you can follow this tutorial for 8-bit inference https://github.com/mlfoundations/open_clip/blob/main/tutorials/int8_tutorial.ipynb
Concretely, your going to want to do something like the following (note I didn't actually check this code, just copied from the tutorial so there may be a typo, please double check tutorial if it doesn't work)