The PDF Awakens: Stream, Chunk, and Converse

This is a fun little project that I've explored to primarily get familiarized with various RAG strategies. The current iteration of the project can

- Handle large pdfs (Out of the box pdf chunking)
- Can Stream response realtime onto the terminal.
- Can hold conversations based on chat history.
- Supports local LLMs
- Supports OpenAI (BYOL)

The main motivation was to keep this RAG as decoupled as possible from Langchain or similar LLM frameworks. This project will keep recieving upgrades both logical and architectural for the foreseable future, or until its probably over engineered.

You can simply clone the repo, spin up the docker and hit the following command:

python main.py docs/NIPS-2017-attention-is-all-you-need-Paper.pdf

This would get it up and running.

If you're more of a Community person, you'd probably have your own LLM in which case you'd want to head over to the docker-compose uncomment the ollama services and make sure to check the configurations.py file.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.vscode		.vscode
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
configurations.py		configurations.py
constants.py		constants.py
docker-compose.yml		docker-compose.yml
llm.py		llm.py
main.py		main.py
readme.md		readme.md
redis_helper.py		redis_helper.py
requirements.txt		requirements.txt
utils.py		utils.py
vectordb.py		vectordb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The PDF Awakens: Stream, Chunk, and Converse

About

Releases

Packages

Languages

License

Babayaga-mp4/RAGing-Assistant

Folders and files

Latest commit

History

Repository files navigation

The PDF Awakens: Stream, Chunk, and Converse

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages