-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: JSON Decode Error #471
Comments
@AlonsoGuevara |
I initially had problems with the models outputting more than just JSON, un-handy descriptors or irrelevant information, such as "Here is the JSON you requested: etc..". Modified the json scrubbing in llm/openai/_json.py to shrink the string into the first and last brackets found. Likely error-prone on larger datasets and/or models other than what I'm using, but it works well enough for what I need at the moment:
|
I'm hitting this bug.
Update: Should I open another issue? |
I was able to use Claude 3 haiku with my proxy(https://github.com/jaigouk/claude-proxy-api) on k3s with 3 replica and My settings.yaml is llm:
api_key: ${CLAUDE_PROXY_API_KEY}
type: openai_chat
model_supports_json: true
model: "claude-3-haiku-20240307"
api_base: "http://192.168.8.213:30012/v1"
# max_tokens: 10000 # Adjusted based on Claude 3 Haiku's typical context window
request_timeout: 30
tokens_per_minute: 100000
requests_per_minute: 1000
max_retry_wait: 5
temperature: 0.1
embeddings:
async_mode: threaded
llm:
api_key: ${EMBEDDING_API_KEY}
type: openai_embedding
model: "BAAI/bge-m3"
api_base: "http://localhost:7997" I am using https://github.com/michaelfeil/infinity for embeddings. with 8k text tokens, the indexing time takes about 1 min. without |
Consolidating alternate model issues here: #657 |
Describe the bug
When I try to search using global query, it report json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0).
And the response missing a paragraph which is not generated with JSON formatted.
Steps to reproduce
using gemma 9b as llm model as confined as the following.
Expected Behavior
It should be able to query successfully and not miss any content which is not formatted by JSON.
GraphRAG Config Used
Logs and screenshots
Additional Information
The text was updated successfully, but these errors were encountered: