use existing llama.cpp install #9

scalar27 · 2024-03-24T21:44:51Z

I've been using llama.cpp for quite a while (M1 Mac). Is there a way I can get ai_voicetalk_local.py to point to that installation instead of reinstalling it here? Sorry, newbie question...

KoljaB · 2024-03-25T09:51:07Z

Just leave out step 2 of installation. I think coqui engine does not run in realtime on a Mac though.

scalar27 · 2024-03-27T15:06:23Z

I did leave out step 2 but then I get an error when I try to run:
ModuleNotFoundError: No module named 'llama_cpp'

KoljaB · 2024-03-27T15:25:10Z

Python import of llama_cpp fails, that means your environment does not have working python bindings for your llama.cpp.
Please look here for Mac bindings, probably Metal (MPS).

scalar27 · 2024-03-27T20:01:23Z

Thank you. I did get it to work following your comment. Like the other M1 person, I do get stuttering. It's a shame because the voice quality is excellent and the latency is rather short. Hope a future update might solve this for us!

scalar27 · 2024-07-17T19:00:26Z

I managed to get this working with the Gemma 2 model. However, I am having trouble setting the parameters. It's working but doesn't seem optimal. I see them in creation_params.json, and also in coqui_engine.py. Would it be possible for LocalAiVoiceChat to utilize Llama.cpp's server endpoint (instead)? Or would that require a lot of rewriting of the code?

KoljaB · 2024-07-17T19:24:31Z

I like that idea, I'll have to look into that.

scalar27 · 2024-07-18T00:15:52Z

Great. It seems like a more standard approach these days. I'd be happy to test whatever. As mentioned above I'm on a M1 Mac so this isn't the fastest setup but it's now working pretty well with no stuttering.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use existing llama.cpp install #9

use existing llama.cpp install #9

scalar27 commented Mar 24, 2024

KoljaB commented Mar 25, 2024

scalar27 commented Mar 27, 2024

KoljaB commented Mar 27, 2024

scalar27 commented Mar 27, 2024

scalar27 commented Jul 17, 2024

KoljaB commented Jul 17, 2024

scalar27 commented Jul 18, 2024

use existing llama.cpp install #9

use existing llama.cpp install #9

Comments

scalar27 commented Mar 24, 2024

KoljaB commented Mar 25, 2024

scalar27 commented Mar 27, 2024

KoljaB commented Mar 27, 2024

scalar27 commented Mar 27, 2024

scalar27 commented Jul 17, 2024

KoljaB commented Jul 17, 2024

scalar27 commented Jul 18, 2024