Skip to content

peluche/RAG-and-tatters

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

RAG and tatters

Threadbare implementation of a RAG toolkit:

  • document chunking
    • fixed length
    • Recursive Character Split (RCS)
    • document specific splitting
    • semantic splitting
  • document retrieval (exhaustive search for a one-liner "vector database")
  • Approximate Nearest Neighbor (ANN) with Navigable Small World (NSW) for K-Nearest Neighbor (KNN).

Building NSW

build_nsw

KNN

knn

About

RAG, Document Chunking and Navigable Small Worlds

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published