Welcome to RAGHub, a living collection of new and emerging frameworks, projects, and resources in the Retrieval-Augmented Generation (RAG) ecosystem. This is a community-driven project for r/RAG, where we aim to catalog the rapid growth of RAG tools and projects that are pushing the boundaries of the field.
Each day, it feels like a new tool or framework emerges, and choosing the right one is becoming more of an art than a science. Is the framework from three months ago still relevant? Or was it just hype, rehashing old concepts with a fresh look? RAGHub exists to help you stay ahead of these changes, providing a platform for the latest innovations in RAG.
This is a community project, and we welcome contributions from everyone! If you’d like to add a new framework, project, or resource, please check out our Contribution Guidelines for details on how to get started.
Name | Description | Website | Github | Stars | Activity |
---|---|---|---|---|---|
LangChain | A framework for building applications with LLMs | Website | Github | 93.2k | 9h ago |
Haystack | A framework for building search engines using neural networks | Website | Github | 17k | Last week |
LlamaIndex | A framework for building data-driven LLM applications | Website | Github | 35.9k | 7h ago |
BentoML | Build Inference APIs, LLM apps, Multi-model chains, RAG service | Website | Github | 7k | 1h ago |
LightRAG | Simple and fast Retrieval-Augmented Generation | Website | Github | 268 | 1d ago |
Swarm by OpenAI | Educational framework for lightweight multi-agent orchestration | - | Github | 7k | 1d ago |
Langroid | Python framework to easily build LLM-powered applications | Website | Github | 2.4k | 10h ago |
NeMo-Guardrails | Toolkit for adding programmable guardrails to LLM-based applications | Website | Github | 4k | Last week |
Swiftide | A Rust library for building fast, streaming applications with LLMs | Website | Github | 222 | 1h ago |
Korvus | The entire RAG pipeline in a single database query | Website | Github | 1.3k | Last month |
semantic-router | A framework for routing LLM requests using semantic vectors. | Website | Github | 2k | 4h ago |
AWS Bedrock Knowledge Bases | Service to build, scale, and deploy RAG-powered applications | Website | - | - | 1h ago |
langflow | A framework to build, scale, and deploy RAG and multi-agent AI apps. | Website | Github | 31.4k | 1h ago |
dspy | A framework to build language model apps with modular programming. | Website | Github | 17.8k | 13h ago |
mem0 | The Memory layer for your AI apps. | Website | Github | 22.6k | 2h ago |
RAGLite | A Python package for building RAG applications | Website | Github | 542 | 18h ago |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
Trulens | Measures and enhance LLM app quality with feedback functions for scalable evaluation | Website | Github | 2.1k | 11h ago |
Phoenix | AI observability platform designed for experimentation, evaluation, and troubleshooting | Website | Github | 3.6k | 1d ago |
ragas | Evaluates and quantifies the performance of RAG pipelines that enhance LLM context with external data | Website | Github | 6.8k | 3h ago |
Deepchecks | Continuous validation of AI & ML models, detecting data drift and model issues | Website | Github | 3.6k | 8m ago |
AutoRAG | End-to-end RAG optimization: parsing, chunking, evaluation dataset creation, and pipeline deployment | Website | Github | 1.6k | 1h ago |
evalmy.ai | Fine tuned lightweight RAG evaluation service + python client lib | Website | Github | -- | -- |
TextGrad | A Framework for LLM-based text optimization,focusing on reducing hallucinations, and improving prompt. | Website | Github | 1.7k | 24h ago |
langfuse | Traces, evals, prompt management and metrics to debug and improve your LLM application. | Website | Github | 6.9k | 1h ago |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
TrustGraph | LLM Agnostic Agent Development Platform | Website | Github | 120 | 2d ago |
R2R | The Elasticsearch for RAG, helps you quickly build and launch scalable RAG solutions | Website | Github | 3.4k | 6h ago |
RAGFlow | Open-source RAG engine based on deep document understanding | Website | Github | 18.7k | 1h ago |
Vertex AI Knowledge Engine | A data framework for context-augmented LLM applications | Website | - | - | 1d ago |
Embedchain | Open Source Framework for personalizing LLM responses under 10 lines of code | Website | Github | 22.2k | Last week |
txtai | All-in-one embeddings database for semantic search, LLM orchestration, and RAG workflows | Website | Github | 8.8k | Last week |
dsRAG | High-performance retrieval engine for unstructured data | - | Github | 815 | Last week |
Flash-Rank | Use Pairwise or Listwise rerankers to improve search accuracy before passing to LLMs. | Github | 606 | 2w ago | |
Graphlit | API-first platform for building knowledge-driven AI applications and agents | Website | Github | 16 | 8h ago |
rag-citation | Combines RAG with automatic citation generation to enhance content credibility | Website | Github | 6 | Last week |
PostgresML | Postgres + GPUs with functions for chunking, embedding, transforming and ranking | Website | Github | 6k | Yesterday |
chainlit | Build production-ready Conversational AI applications in minutes, not weeks | Website | Github | 7k | 24 h ago |
pathway | Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG. | Website | Github | 4.3k | 7 h ago |
cognita | RAG framework for modular, open-source production apps. | Website | Github | 3.3k | 2 days ago |
FlashRAG | A Python Toolkit for Efficient RAG Research | - | Github | 1.3k | 3 hour ago |
RAGatouille | Easily train and use advanced retrieval methods in any RAG pipeline. | - | Github | 3.1k | 4 months ago |
pgai | A suite of tools to develop RAG, semantic search, and other AI applications just in PostgreSQL | Website | Github | 2.6k | 10 hours ago |
Name | Description | Website | GitHub | Stars | Activity |
---|---|---|---|---|---|
LlamaParse | GenAI-native document parsing platform | Website | Github | 2.8k | 2d ago |
Langchain-extract | Web server to extract information from text and files using LLMs | Website | Github | 1k | 4m ago |
Needle | Production ready RAG pipelines out of the box. | Website | Github | 15 | 1h ago |
Unstructured.io | build custom preprocessing pipelines for labeling, training, or production ml | Website | Github | 8.7k | 3d ago |
Verba | RAG chatbot powered by Weaviate | Website | Github | 6.1k | 2w ago |
Unstract | No-code Platform to launch APIs and ETL Pipelines to structure unstructured documents | Website | Github | 2.3k | 4h ago |
Humata.ai | Ask questions across all of your document files | Website | 4h ago | ||
Ragie.ai | Fully managed RAG-as-a-Service for developers. | Website | Github | 12 | 12h ago |
Reducto | Parses complex documents and creates LLM-ready inputs | Website | Github | 16 | 2w ago |
Midship | Extract document data straight into your spreadsheet/erp/crm | Website | Github | - | - |
DocuPanda | Convert documents into a structured, standard set of fields and values | Website | - | - | - |
contextual-doc-retrieval-opneai-reranker | Using GPT-4 and Cohere for query expansion and re-ranking with BM25 | GitHub | 20 | Last week | |
Raggenie | Low-code platform to build custom RAG-based AI applications | Website | Github | 60 | 10h ago |
Chunkr | Vision model based PDF chunking and OCR, optimized for fast processing of large datasets | Website | Github | 651 | 11h ago |
tldw | Open-Source project similar to NotebookLM | Website | Github | 267 | Yesterday |
Cerbos | Access control for RAG and LLMs. | Website | GitHub | 3200 | 14h ago |
extractous | Extremely fast data extraction for your AI applications | Website | Github | - | - |
SWIRL | AI Search & RAG for your workplace. Get AI insights from your company's knowledge instantly. | Website | Github | 2.1K | 2w ago |
Site/Article | Description | Link |
---|---|---|
Contextual Retrieval | Anthropic introducing Contextual Retrieval | Website |
Open-RAG | Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Website |
ColPali | Efficient Document Retrieval with Vision Language Models | Website |
RAG_Techniques | Showcases various advanced techniques for RAG systems | Website |
GenAI_Agents | Tutorials and implementations for various AI Agent techniques | Website |
Name | Description | Link |
---|---|---|
Artificial Analysis | LLM Comparison | Website |
HuggingFace/mteb | Embedding models leaderboard | Website |
If you're looking for mainstream RAG frameworks and techniques**, check out the excellent repository by Nir Diamant: RAG Techniques. This repository focuses on more established tools and methods that have already gained traction in the community.
This project is licensed under the MIT License. See the LICENSE file for details.
This project is part of the r/RAG community. Have feedback or suggestions? Feel free to open an issue, start a discussion, or join the conversation on our Discord server! We want to make this repository a valuable resource for everyone exploring the RAG ecosystem, and your input is crucial.