RAG-Bot for News

Idea

This app is based on the LangChain framework, which provides a wrapper for the Facebook AI Similarity Search (FAISS) and Hugging Face's Transformer library. To provide the model with up-to-date information, the query is used to request matching documents from a vector store. The best matching documents are used to provide further context for the model to produce a correct answer to the query. The vector storage is built upon Universal AnglE Embedding as it was one of the most successful models on the MTEB-leaderboard. To produce the output, a powerful model is required, one that is both capable of dealing with large context lengths and is performant enough to produce high-quality text. Thus, we decided to apply a quantized version of Llama 2 with seven billion parameters ct2fast−Llama−2−7b−hf. Quantization is a technique in which the precision of the floating-point representation is reduced to speed up inference performance and reduce overall model size. Furthermore, this implementation also utilizes the speed of the C++ language and can be used with hardware acceleration.

Prerequisites

8GB of free disk space
20GB of available RAM
Optional: Create a virtual environment
Run: pip install -r requirements.txt

Run App

Run python src/main.py, visit http://127.0.0.1:7860 and ask a question on the online interface.

Limitations

Without appropriate hardware acceleration, the app performs slowly. A GPU is recommended.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
thubell.png		thubell.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG-Bot for News

Idea

Prerequisites

Run App

Limitations

About

Languages

LazerLambda/THU-ML-RAG

Folders and files

Latest commit

History

Repository files navigation

RAG-Bot for News

Idea

Prerequisites

Run App

Limitations

About

Topics

Resources

Stars

Watchers

Forks

Languages