DocTalk

DocTalk is a project I'm working on to try to build my own LLM document chat.

I'm not 100% sure what I'm doing, but it's been great so far.

See Update Notes for changes I am making after the initial commit to this project.

Feel free to play around and please for the love of science, give me some feedback.

I legit have no idea if this is going to be useful or anything, but it's certainly teaching me python, and renewing my interest in snake_case variables.

Major Update July 9th, 2023:

I pretty much gutted the project and moved a bunch of things around. I implemented a different architecture, with the runners and what not.

Some day soon I will fill the rest of this documentation in!

Basic Usage (python developers)

To create the python env, and install requirements, run: install.ps1
Set your OPENAI_API_KEY environment variable, if you are going to use OpenAI's API. See .env.template for guidance.
Load your documents using ingest_documents.py
- Options for running the document loader include:
  - --document_directory: Directory from which to load documents
  - --database_name: The name of the database where you'd like to store the loaded documents
  - --run_open_ai: When set, this will force the use of the OpenAI LLM and embeddings. Make sure you set your API key.
  - --split_documents: If this is present, the loader will split loaded documents into smaller chunks
  - --split_chunks: How big the chunk sizes should be
  - --split_overlap: How much of an overlap there should be between chunks
Select a configuration file from the configurations folder, or create your own
- Currently there are a few supported AIs and runners- check the run.py for the supported types.
Once you've loaded your documents, and selected a configuration file, run run.py --config=<path to config file>

Usage (non-developers)

Coming Soon

Random notes

The following is mostly copy/paste stuff I use (or used) frequently

Creating env

python -m venv doctalk_venv

Fixing pip issues-- upgrading pip, clearing cache, reinstalling dependencies

python.exe -m pip install --upgrade pip
pip cache purge
pip --no-cache-dir install -r requirements.txt

Why isn't my llama-cpp working on my GPU?

Probably because you ran the /requirements.txt install before getting here. Make sure to set these environment variables before installing llama-cpp next time.

$env:CMAKE_ARGS="-DLLAMA_CUBLAS=on"      
$env:FORCE_CMAKE=1
$env:LLAMA_CUBLAS=1

And this next one is for when you have to force a re-install of llama-cpp because you left the instructions for the GPU below the /requirements.txt install 🙄

pip install --no-cache-dir --force-reinstall llama-cpp-python

Random CUDA Memory Error

Sometimes a random CUDA memory error will show up. Use this:

$env:GGML_CUDA_NO_PINNED=1

TODO List

langchain related (although I could do these manually if I want to spend the time learning it??):
- Add tool to allow LLM to google search and provide answers (google sign in)
- Add tool to allow the LLM to dynamically retrieve individual documents, vs. pre-processing a folder (e.g. from a website, or local folder)
  - repurpose scrape_pdfs.py
Probably other things
Documentation?? lol

Resources to look at

Question answering using embeddings
Open LLM Leaderboard
Best source for models: TheBloke on HuggingFace
LangChain Dev Blog

Update Notes

6/15/2023:
- Started to rework the project to separate the local and hosted (OpenAI) LLM stuff. There are different prompting techniques, and other stuff that I want to play with when it comes to local vs. hosted LLMs.
- Renamed run_llm.py to run_local_llm.py
- Added run_chain.py
- Updated some other random stuff
6/20/2023
- Updated splitting in document_loader.py so that it splits on newlines before hitting the character max.
- Added install.ps1
- Added support for top_k in non-local llms
6/21/2023
- Added command line support for run_chain.py and document_loader.py
- Removed old unused code
- Collapsed the local and remote LLM access (using langchain) into one file run_chain.py
6/23/2023
- Added multi-document store querying capabilities using run_react_agent.py
- Loading user defined tools using tool_loader.py
- Added an example tool configuration for my work-related stuff, medical_device_config.json
6/24/2023
- Updated ReAct agent to support self-ask, and call tools in a dynamic way: run_react_agent.py
7/9/2023
- Major refactor and reorganization
- Removed a bunch of unused old stuff
- Implemented selection of AI (QA chain for now) and runners
- Simplified document ingestion and running
- Added better support for API
- Started on getting Docker into the solution

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
configurations		configurations
diagrams		diagrams
frontend		frontend
images		images
src		src
timings		timings
.dockerignore		.dockerignore
.env.template		.env.template
.gitignore		.gitignore
DocTalk.code-workspace		DocTalk.code-workspace
Dockerfile		Dockerfile
LICENSE		LICENSE
calendar_credentials_template.json		calendar_credentials_template.json
docker-compose.debug.yml		docker-compose.debug.yml
docker-compose.yml		docker-compose.yml
install.ps1		install.ps1
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocTalk

Major Update July 9th, 2023:

Basic Usage (python developers)

Usage (non-developers)

Random notes

Creating env

Fixing pip issues-- upgrading pip, clearing cache, reinstalling dependencies

Why isn't my llama-cpp working on my GPU?

Random CUDA Memory Error

TODO List

Resources to look at

Update Notes

About

Releases

Packages

Languages

License

aronweiler/DocTalk

Folders and files

Latest commit

History

Repository files navigation

DocTalk

Major Update July 9th, 2023:

Basic Usage (python developers)

Usage (non-developers)

Random notes

Creating env

Fixing pip issues-- upgrading pip, clearing cache, reinstalling dependencies

Why isn't my llama-cpp working on my GPU?

Random CUDA Memory Error

TODO List

Resources to look at

Update Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages