Revamp Chatbot Prototype #7

sqr00t · 2024-05-09T15:58:19Z

Description

Revamps the NIBE prototype to use latest Langchain. Also separates responsibilities:

asf_hp_installer_chatbot: experimentation of data heavy pipelines, with associated utils
rag: all things end-to-end RAG Langchain related, this includes document processing pipelines
api: uses Langserve, a FastAPI wrapper. This enables more ways to interact with the chains throughout dev, test, and prod.

Note: WhatsApp interface hasn't been implemented, and this PR will be updated to reflect these changes. The aim of this PR is to first revamp the prototype to use the latest Langchain LCEL framework.

Addresses #6 , branch is 2_revamp

Instructions for Reviewer

Contact me for Langfuse integration details (env vars needed in your `.env` file)

Checkout branch 2_revamp
Run make pip-install if you already have the project conda environment setup
Test the integration test script that mimics the prototype test script:

python rag/tests/test_retrieval.py

To test the API:

Have poetry installed, feel free to ask me how best to do this
conda deactivate, then cd api
poetry env use 3.10, then poetry install
langchain app serve --app app.server:app
Navigate to 0.0.0.0:8000
Test out the /invoke endpoint by replacing the string in the JSON request "input": "string", with a query.

Please pay special attention to and other notes

Setting up, feel free to ask me questions
I'm still rethinking/ implementing the WhatsApp interface and the messaging system, will be best to not serialise the files but to log conversations to a message queuing system i.e. Redis. Then periodically send incremental batches to an S3 parquet file.

Checklist:

Other additional features to review:

Add CLI for running ingestion and reindexing of docs into vector database
Add CLI command for saving vector database snapshots to local and desired S3 path
Add CLI command for recreating an index from snapshot saved locally or in S3 path

Review instructions, after make install or make pip-install, test out CLI commands with chatbot --help, there are descriptions for other commands available.

…ler_chatbot chains

…ncies, and docs

helloaidank · 2024-08-01T15:42:45Z

rag/chains.py

+
+from rag.utils.history import get_session_history
+from langchain_core.runnables.history import RunnableWithMessageHistory
+


Suggested change

# Define a basic RAG (Retrieval-Augmented Generation) chain

# This chain consists of a prompt template, a language model, and an output parser

helloaidank · 2024-08-01T15:43:25Z

rag/chains.py

+from langchain_core.runnables.history import RunnableWithMessageHistory
+
+rag_chain = chatbot_template | openai_llm | StrOutputParser()
+


Suggested change

# Define a RAG chain that includes historical context from previous interactions

helloaidank · 2024-08-01T16:09:00Z

Description

Revamps the NIBE prototype to use latest Langchain. Also separates responsibilities:

asf_hp_installer_chatbot: experimentation of data heavy pipelines, with associated utils

rag: all things end-to-end RAG Langchain related, this includes document processing pipelines

api: uses Langserve, a FastAPI wrapper. This enables more ways to interact with the chains throughout dev, test, and prod.

Note: WhatsApp interface hasn't been implemented, and this PR will be updated to reflect these changes. The aim of this PR is to first revamp the prototype to use the latest Langchain LCEL framework.

Addresses #6 , branch is 2_revamp

Instructions for Reviewer

Contact me for Langfuse integration details (env vars needed in your .env file)

Checkout branch 2_revamp

Run make pip-install if you already have the project conda environment setup

Test the integration test script that mimics the prototype test script:
python rag/tests/test_retrieval.py
To test the API:

Have poetry installed, feel free to ask me how best to do this

conda deactivate, then cd api

poetry env use 3.10, then poetry install

langchain app serve --app app.server:app

Navigate to 0.0.0.0:8000

Test out the /invoke endpoint by replacing the string in the JSON request "input": "string", with a query.

Please pay special attention to and other notes

Setting up, feel free to ask me questions

I'm still rethinking/ implementing the WhatsApp interface and the messaging system, will be best to not serialise the files but to log conversations to a message queuing system i.e. Redis. Then periodically send incremental batches to an S3 parquet file.

Checklist:

I have refactored my code out from notebooks/ N/A

I have checked the code runs

I have tested the code

I have run pre-commit and addressed any issues not automatically fixed

I have merged any new changes from dev

I have documented the code

Major functions have docstrings

Appropriate information has been added to READMEs

I have explained this PR above

I have requested a code review

Other additional features to review:

Add CLI for running ingestion and reindexing of docs into vector database

Add CLI command for saving vector database snapshots to local and desired S3 path

Add CLI command for recreating an index from snapshot saved locally or in S3 path

Review instructions, after make install or make pip-install, test out CLI commands with chatbot --help, there are descriptions for other commands available.

Hi Solomon!

Thanks for the really comprehensive work, I am loving the configurability of the chatbot and think this enables us to have a lot of flexibility in terms of what we are able to do. The change to FastAPI from Flask makes sense especially when it comes to asynchronous nature/potential of chatbots and also love the FastAPI documentation that comes with it.

I was able to eventually get both the test script up and running and then also run the API with a local Qdrant vector database, so that's good. Ran into a couple of bugs which I've commented on and we've discussed (there have been quite a few changes since this PR was put out so that's unsurprising). It would be great to know what the steps are to deploy the chatbot on the EC2 instance, this is potentially we can discuss at some point in the coming weeks. I think it might be useful @Jack-Vines to review the API part of this PR as myself and @crispy-wonton do not have much experience with these packages or how they set up.

As a more general comment, I think we are missing some documentation at the top of each file, i.e. a clear message which is readable for those who either have no or some experience with either web applications or RAG pipelines and I think potentially some visual representations would really help us understand the functionality of the different files and the architecture overall. I am happy to help out with this!

I think as a whole, we also need documentation on how to set everything up from poetry, to LangFuse and of course with regards to running an EC2 instance! It doesn't seem trivial, at least to a beginner like myself, so I think this would be super helpful.

Thanks again for the stellar work!

…rating into langfuse

…ocessPoolExecutor

…ource documents

…on generated chains

…line for traces

…utilization metric error

…rtion test

sqr00t linked an issue May 9, 2024 that may be closed by this pull request

Revamp prototype #6

Open

7 tasks

sqr00t changed the title ~~06 revamp~~ Revamp Chatbot Prototype May 9, 2024

sqr00t changed the title ~~Revamp Chatbot Prototype~~ Revamp Chatbot Prototype #6 May 9, 2024

sqr00t changed the title ~~Revamp Chatbot Prototype #6~~ Revamp Chatbot Prototype May 9, 2024

sqr00t self-assigned this May 9, 2024

sqr00t requested a review from helloaidank May 9, 2024 16:05

sqr00t force-pushed the 02_revamp branch 2 times, most recently from 7cfe1c5 to 634e5ca Compare May 10, 2024 13:07

sqr00t force-pushed the 02_revamp branch 3 times, most recently from 1caf9e2 to 565645a Compare June 11, 2024 14:46

helloaidank and others added 19 commits June 11, 2024 16:01

small changes

bb73994

modularise code for the first prototype of the chatbot

ccef743

keep a directory and change doc string in chatbot script

ca3ae7f

change in a few doc strings of testing script

3db18fa

changes suggested by Roisin

83f6dbf

suggested changes around configuration and re-jigging functions

4071632

small change to yaml file

80f8eb8

change docstring

284a2cd

deps: update langchain dependencies

8aea180

build: bump to python3.10, use nodefaults conda channel

3700dae

build(pre-commit): add nbstripout

ef73884

feat(revamp): add utils and scripts using new langchain LCEL API

184068d

docs(LCEL example): add example notebook on using LCEL with hp_instal…

14c541b

…ler_chatbot chains

docs(LCEL example): add chain dag images

c2b2791

deps(api): prevent setuptools from installing api directory

d197277

test(revamp): add retrieval integration test that mirrors prototype test

39e3ed9

feat(api): add poetry project structure, scripts for API app, depende…

d7ee722

…ncies, and docs

build(envs): improve env vars loading, add env template

add4c95

feat(hosting): add ngrok tunnel capability

2915992

helloaidank reviewed Aug 1, 2024

View reviewed changes

helloaidank and others added 27 commits August 29, 2024 17:02

add new test script

1591482

adding reengineered prompt with new data sources

eb714fa

adding in some changes with regards to testing script ragas and integ…

f4cdf61

…rating into langfuse

merge conflicts

73eea6a

small change to test_retrieval

adaac92

including tej materials into the chatbot prompt

2d0a347

deps: remove irrelevant depedencies

2290f0f

refactor(vdb): add options to vdb init, change parallel loading to Pr…

252c2be

…ocessPoolExecutor

refactor(embeddings): update embeddings model from ada-002 to 3-small

5ac9e5a

refactor(llm): update default model from 4o to 4o-mini

9a770ff

docs(chains): add docstrings to rag_with_source chains

32a35d0

WIP: comment out format_source_docs to disable formatting retrieved s…

708f4fb

…ource documents

WIP: add a Langchain Callback for running Evaluations

e442123

refactor(retrievers): enable passing kwargs to retriever

3fc5656

test: update retrieval test to accept kwargs

9d4fff8

feat(cli): add test_query command

db9ba5b

deps(api): update dependencies, using pydantic>=2

919e32c

refactor(chains): update api server and whatsapp client to use functi…

418da78

…on generated chains

refactor(api events): disable ngrok tunnelling by default using env vars

e2125bd

feat(evaluation): add endpoint for triggering evaluation scoring pipe…

0170b3d

…line for traces

build(Dockerfile): update build script for best practices

9bb23a1

deps: fix uvicorn version conflict

8d63be5

fix(cli): pass click.context to commands

040010b

test(retrieval): update retrieval mmr kwargs

3663b54

WIP: update chatbot system prompt

4be696b

fix(evaluation pipeline): fix ragas version to 0.1.16 to fix context_…

2314fe8

…utilization metric error

fix(evaluation pipeline): add retreived_contexts, remove context asse…

b9769ea

…rtion test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revamp Chatbot Prototype #7

Revamp Chatbot Prototype #7

sqr00t commented May 9, 2024 •

edited

Loading

helloaidank Aug 1, 2024

helloaidank Aug 1, 2024

helloaidank commented Aug 1, 2024

Description

Instructions for Reviewer

Contact me for Langfuse integration details (env vars needed in your `.env` file)

Checklist:

Other additional features to review:


		from rag.utils.history import get_session_history
		from langchain_core.runnables.history import RunnableWithMessageHistory


	# Define a basic RAG (Retrieval-Augmented Generation) chain
	# This chain consists of a prompt template, a language model, and an output parser

		from langchain_core.runnables.history import RunnableWithMessageHistory

		rag_chain = chatbot_template \| openai_llm \| StrOutputParser()


	# Define a RAG chain that includes historical context from previous interactions

Revamp Chatbot Prototype #7

Are you sure you want to change the base?

Revamp Chatbot Prototype #7

Conversation

sqr00t commented May 9, 2024 • edited Loading

Description

Instructions for Reviewer

Contact me for Langfuse integration details (env vars needed in your .env file)

Checklist:

Other additional features to review:

helloaidank Aug 1, 2024

Choose a reason for hiding this comment

helloaidank Aug 1, 2024

Choose a reason for hiding this comment

helloaidank commented Aug 1, 2024

Description

Instructions for Reviewer

Contact me for Langfuse integration details (env vars needed in your .env file)

Checklist:

Other additional features to review:

sqr00t commented May 9, 2024 •

edited

Loading

Contact me for Langfuse integration details (env vars needed in your `.env` file)

Contact me for Langfuse integration details (env vars needed in your `.env` file)