Fix the incompatibility of ollama and groq json's response and update default model selection #87

init0xyz · 2024-08-13T14:45:50Z

The changes included in this commit:

fix the incompatibility of Ollama and Groq JSON's response
update the default model selection
add the support for third-party Openai-proxy server

The incompatibility of Ollama and Groq JSON's response

This problem has been mentioned in issue 1, issue 2, issue 3.
It's mainly caused by the function calling compatibility required by instructor JSON's response, which seems not to work very well in Groq's llama and Ollama's model(looks like a bug of litellm) when using expert search and generates related queries.
To solve this problem, it's suggested that JSON mode rather than tools be used in the instructor response model when the model is Groq and Ollama.
Based on my tests, this change will stabilize Groq and Ollama’s structured generation.

Update the default model selection

change the default "fast" model from GPT-3.5-turbo to GPT-4o-mini, using a more powerful model without increasing cost.
update the Groq model from llama3-70b to llama3.1-70b, also changed the Ollama llama3 to llama3.1.

Third-party Openai-proxy server

add the support for third-party Openai-proxy server by including the "OPENAI_API_BASE" env variable in the docker-compose file.

add OPENAI_API_BASE variable in env-template and docker-compose file

…and update model selection

vercel · 2024-08-13T14:45:54Z

@init0xyz is attempting to deploy a commit to the rashadphil's projects Team on Vercel.

A member of the Team first needs to authorize it.

rashadphz · 2024-09-06T03:54:17Z

thanks!!

init0xyz added 2 commits July 23, 2024 21:49

add support for third-party openai proxy

ceb3fed

add OPENAI_API_BASE variable in env-template and docker-compose file

fix: incompatibility of ollama and groq when outputing json response …

b1741f2

…and update model selection

init0xyz changed the title ~~Fix the incompatibility of ollama and groq json's response and update default model selection.~~ Fix the incompatibility of ollama and groq json's response and update default model selection Aug 14, 2024

SimoMay approved these changes Aug 22, 2024

View reviewed changes

rashadphz approved these changes Sep 6, 2024

View reviewed changes

rashadphz merged commit 883003f into rashadphz:main Sep 6, 2024
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the incompatibility of ollama and groq json's response and update default model selection #87

Fix the incompatibility of ollama and groq json's response and update default model selection #87

init0xyz commented Aug 13, 2024

vercel bot commented Aug 13, 2024

rashadphz commented Sep 6, 2024

Fix the incompatibility of ollama and groq json's response and update default model selection #87

Fix the incompatibility of ollama and groq json's response and update default model selection #87

Conversation

init0xyz commented Aug 13, 2024

The incompatibility of Ollama and Groq JSON's response

Update the default model selection

Third-party Openai-proxy server

vercel bot commented Aug 13, 2024

rashadphz commented Sep 6, 2024