How do you convert a model trained on GPU to be used for inferencing on CPU? #957

TheArowanaDude · 2019-08-04T22:13:32Z

Hi, I am trying to use to a model I trained on a server that had cuda GPUs on CPU only; are there any library functions I can use to pull this off? Thanks in advance

krzynio · 2019-08-05T08:52:42Z

You don't need to convert it - it works out of the box.

TheArowanaDude · 2019-08-05T18:06:13Z

You don't need to convert it - it works out of the box.

That's what I thought, however when I specified cpu using this command:
flair.device = torch.device('cpu')

I got this error: "RuntimeError: Expected object of backend CUDA but got backend CPU for argument #3 'index'"

alanakbik · 2019-08-06T06:36:45Z

Normally it should work out of the box, but there are some embedding types that we import for which this functionality does not work. Notably, that's the embeddings we get from the allennlp library, i.e. the ELMoEmbeddings. Is your model trained with these embeddings? Which embeddings are you using?

TheArowanaDude · 2019-08-07T22:40:06Z

Normally it should work out of the box, but there are some embedding types that we import for which this functionality does not work. Notably, that's the embeddings we get from the allennlp library, i.e. the ELMoEmbeddings. Is your model trained with these embeddings? Which embeddings are you using?

Ahh, that makes sense, I am using elmo embeddings

alanakbik · 2019-08-08T07:19:55Z

Yeah unfortunately this is a known problem, see #635. Perhaps the fix in that thread will work for you? I hope we can fix this at some point.

gaurav8707 · 2024-11-20T19:33:31Z

If the model is torch based you can convert the model to ONXX model , then you can further optimize it

TheArowanaDude added the question Further information is requested label Aug 4, 2019

TheArowanaDude closed this as completed Aug 11, 2019

alanakbik mentioned this issue Feb 13, 2020

aggregated_embedding not working with CPU #1406

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do you convert a model trained on GPU to be used for inferencing on CPU? #957

How do you convert a model trained on GPU to be used for inferencing on CPU? #957

TheArowanaDude commented Aug 4, 2019

krzynio commented Aug 5, 2019

TheArowanaDude commented Aug 5, 2019

alanakbik commented Aug 6, 2019

TheArowanaDude commented Aug 7, 2019 •

edited

Loading

alanakbik commented Aug 8, 2019

gaurav8707 commented Nov 20, 2024

How do you convert a model trained on GPU to be used for inferencing on CPU? #957

How do you convert a model trained on GPU to be used for inferencing on CPU? #957

Comments

TheArowanaDude commented Aug 4, 2019

krzynio commented Aug 5, 2019

TheArowanaDude commented Aug 5, 2019

alanakbik commented Aug 6, 2019

TheArowanaDude commented Aug 7, 2019 • edited Loading

alanakbik commented Aug 8, 2019

gaurav8707 commented Nov 20, 2024

TheArowanaDude commented Aug 7, 2019 •

edited

Loading