Support model providers other than OpenAI and Azure #657

natoverse · 2024-07-22T20:13:50Z

Right now GraphRAG only natively supports models hosted by OpenAI and Azure. Many users would like to run additional models, including alternate APIs, SLMs, or models running locally. As a research team with limited bandwidth it is unlikely we will add native support for more model providers in the near future. Our focus is on memory structures and algorithms to improve LLM information retrieval, and we've got a lot of experiments in the queue!

There are alternative options to achieve extensibility, and many GraphRAG users have had luck extending the library. So far we've seen this most commonly with Ollama, which runs on localhost and supports a very wide variety of models. This approach depends on Ollama supporting the standard OpenAI API for chat completion and embeddings so it can proxy our API calls, and it looks like this is working for a lot of folks (though may require some hacking).

Please note: while we are excited to see GraphRAG used with more models, our team will not have time to help diagnose issues. We'll do our best to route bug reports to existing conversations that might be helpful. For the most part you should expect that if you file a bug related to running an alternate solution, we'll link to this issue, a relevant conversation if we're aware of one, and then we'll close the bug.

Here is a general discussion regarding OSS LLMs: #321.

And a couple of popular Ollama-related issues: #339 and #345. We'll link to others in the comments when relevant.

Have a look at issues tagged with the community_support label as well.

natoverse · 2024-08-16T17:26:21Z

@Mxk-1 has found chunking settings that help resolve issues with create_base_entity_graph with Ollama:

The chunk splitting in the original setting.yaml provided may not be suitable for the model launched with Ollama, as it could be either too large or too small, leading to errors in the model's responses. The original paper mentioned using the GPT-4o model, while the model I deployed locally is Gemma2:9b via Ollama. These two models differ in size and performance.

Additionally, since the pipeline relies on prompt-based Q&A with the text, the prompt itself takes up some of the model's processing length. By adjusting the chunk_size, I was able to successfully run the experiment. If you encounter this issue, try increasing or decreasing the chunk_size. If you have a better solution, feel free to discuss it with me.

MortalHappiness · 2024-09-12T10:18:55Z

I collected several comments scattered across different issues and created a monkey patch script along with a working setting for Ollama. It has been tested on version 0.3.2 and works properly. I’m sharing it for those who might need it: https://gist.github.com/MortalHappiness/7030bbe96c4bece8a07ea9057ba18b86.

I’m not sure if it’s appropriate to comment here, so if the reviewers think it’s not, I'll delete this comment and post it in a more suitable place. Thank you in advance!

natoverse added the enhancement New feature or request label Jul 22, 2024

This was referenced Jul 22, 2024

[Ollama][Other] GraphRAG OSS LLM community support #339

Closed

[Ollama] GraphRAG Community Support for running Ollama #345

Closed

[Local Embeddings] Community Support thread #370

Closed

natoverse pinned this issue Jul 22, 2024

natoverse added the community_support Issue handled by community members label Jul 22, 2024

This was referenced Jul 22, 2024

ERROR Error executing verb "cluster_graph" in create_base_entity_graph: Columns must be same length as key #426

Closed

Error executing verb "cluster_graph" in create_base_entity_graph: Columns must be same length as key #414

Closed

This was referenced Jul 22, 2024

Indexer Error #464

Closed

[Bug]: Errors in local search #451

Closed

ValueError(\"Columns must be same length as key\")\nValueError: Columns must be same length as key\n", "source": "Columns must be same length as key", "details": null} #455

Closed

AlonsoGuevara mentioned this issue Jul 22, 2024

ValueError: Query vector size 1024 does not match index column size 1536 #392

Closed

yangxue-1 mentioned this issue Aug 12, 2024

How to evaluate the extracted entity, relationship, and community reports? #904

Closed

2 tasks

natoverse mentioned this issue Aug 12, 2024

[Bug]: NotADirectoryError: [Errno 20] Not a directory: '/Users/username/ragtest/output/.DS_Store/artifacts/create_final_nodes.parquet' #891

Closed

3 tasks

zhangjiuyang1993 mentioned this issue Aug 13, 2024

[Issue]: <title> GraphRAG supports English, but does it support other languages, such as Chinese? #916

Closed

2 tasks

yangxue-1 mentioned this issue Aug 13, 2024

Specify prompts for local and global search via config/CLI #917

Open

2 tasks

kingui1106 mentioned this issue Aug 14, 2024

[Issue]: <title> 微调promot的参数改了吗 KylinMountain/graphrag-server#16

Closed

2 tasks

SamllPigYanDong mentioned this issue Aug 14, 2024

[Issue]: <title> 运行webserver的main.py的时候报错【python webserver/main.py】 KylinMountain/graphrag-server#17

Closed

2 tasks

wolfhawkld mentioned this issue Aug 14, 2024

[Issue]: <title> Prompts tuning issue #924

Open

2 tasks

cco9ktb mentioned this issue Aug 14, 2024

[Issue]: Error executing verb "select" in create_final_entities: "['type', 'description'] not in index" #926

Closed

2 tasks

andreiionut1411 mentioned this issue Aug 14, 2024

[Issue]: Global Search takes a long time to respond for large datasets #928

Open

2 tasks

asynchat mentioned this issue Aug 14, 2024

[Issue]: llama-index-core (0.10.65) depends on nltk (>=3.8.2) and tenacity (>=8.2.0,<8.4.0 || >8.4.0,<9.0.0) #929

Closed

2 tasks

yangxue-1 mentioned this issue Aug 15, 2024

The content of the file generated in the index process is incorrect. #936

Closed

2 tasks

ReeveWu mentioned this issue Aug 15, 2024

[Issue]: Report only includes partial content for indexing multiple docs #939

Open

2 tasks

yangxue-1 mentioned this issue Aug 15, 2024

The create_final_community_reports.parquet file disappears. #940

Open

2 tasks

c0derm4n mentioned this issue Aug 16, 2024

[Issue]: Can multiple model instances be called concurrently to construct a graph？ #949

Closed

2 tasks

Mxk-1 mentioned this issue Aug 16, 2024

[Issue]: <title> ❌ create_base_entity_graph solution | 按照graphrag最后一步create_base_entity_graph失败的解决方案 #951

Closed

2 tasks

natoverse mentioned this issue Aug 19, 2024

httpx.ConnectError: [Errno 111] Connection refused #971

Closed

3 tasks

This was referenced Aug 21, 2024

[问题]: 运行到create_final_entities报错 #994

Closed

[Feature Request]: Integrate with Llamaindex #985

Closed

[Bug]: <title>graphrag can't index using mistral large 123B with exllamav2 #812

Closed

jgbradley1 mentioned this issue Aug 22, 2024

归档 #814

Closed

4 tasks

natoverse mentioned this issue Aug 23, 2024

[Bug]: <title>openai.NotFoundError: Error code: 404 - {'timestamp': '2024-08-23T08:47:30.562+00:00', 'status': 404, 'error': 'Not Found', 'path': '/v4/chat/completions/chat/completions'} #1009

Closed

3 tasks

mkhludnev mentioned this issue Sep 6, 2024

How to set the graphRAG with local ollama Cinnamon/kotaemon#212

Closed

natoverse mentioned this issue Sep 9, 2024

[Issue]: httpx INFO HTTP Request: POST https://api.deepinfra.com/v1/openai/embeddings "HTTP/1.1 422 Unprocessable Entity" #1094

Closed

3 tasks

natoverse mentioned this issue Oct 22, 2024

[Issue]: create_base_text_units show Errors occurred during the pipeline run, see logs for more details #1286

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support model providers other than OpenAI and Azure #657

Support model providers other than OpenAI and Azure #657

natoverse commented Jul 22, 2024 •

edited

Loading

natoverse commented Aug 16, 2024

MortalHappiness commented Sep 12, 2024

Support model providers other than OpenAI and Azure #657

Support model providers other than OpenAI and Azure #657

Comments

natoverse commented Jul 22, 2024 • edited Loading

natoverse commented Aug 16, 2024

MortalHappiness commented Sep 12, 2024

natoverse commented Jul 22, 2024 •

edited

Loading