Cilium Docs Langchain

A huge drawback of LLMs is their outdated knowledge. This is because they are trained on a static dataset. This is a problem for Cilium because it is constantly being updated. But there are some ways to overcome this problem. This repository is an attempt to use LLMs like OpenAI's GPT-3 and Pinecone vector database for Cilium documentation.

How to use

This is more of a proof-of-concept than a fully functional application. Thus it only consists of a jupyter notebook. The notebook will guide you through the process.

Note: For the proof of concept I'm only scraping the docs for Cilium v1.13.x.

How it works

Scrape the docs
Tokenize the scraped docs
Create OpenAI embeddings
Create Pinecone index
Load embeddings into Pinecone index
Now you can query GPT with augmented queries

All of this is mostly based on https://github.com/pinecone-io/examples/blob/master/generation/gpt4-retrieval-augmentation/gpt-4-langchain-docs.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
notebook.ipynb		notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cilium Docs Langchain

How to use

How it works

About

Languages

darox/cilium-docs-langchain

Folders and files

Latest commit

History

Repository files navigation

Cilium Docs Langchain

How to use

How it works

About

Topics

Resources

Stars

Watchers

Forks

Languages