A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
-
Updated
Dec 19, 2024 - Python
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
🍶 llm-distillery ⇢ use LLMs to run map-reduce summarization tasks on large documents until a target token size is met.
Semantic Chunking is a Python library for segmenting text into meaningful chunks using embeddings from Sentence Transformers.
Retrieval-Augmented Generation (RAG) Fundamentals and Semantic Chunking
Add a description, image, and links to the semantic-chunking topic page so that developers can more easily learn about it.
To associate your repository with the semantic-chunking topic, visit your repo's landing page and select "manage topics."