Next.js + FastAPI: Chat with Your Website

This application facilitates a chatbot by leveraging Next.js as the frontend and FastAPI as the backend, utilizing the power of LangChain for dynamic web interactions.

Features of the Hybrid App

Web Interaction via LangChain: Utilizes the latest LangChain version for effective interaction and information extraction from websites.
Versatile Language Model Integration: Offers compatibility with various models including GPT-4. Users can easily switch between models to suit their needs.
User-Friendly Next.js Frontend: The interface is intuitive and accessible for users of all technical backgrounds.

Operational Mechanics

The application integrates the Python/FastAPI server into the Next.js app under the /api/ route. This is achieved through next.config.js rewrites, directing any /api/:path* requests to the FastAPI server located in the /api folder. Locally, FastAPI runs on 127.0.0.1:8000, while in production, it operates as serverless functions on Vercel.

Setting Up the Application

Install dependencies:
```
npm install
```
Create a .env file with your OpenAI API key:
```
OPENAI_API_KEY=[your-openai-api-key]
```
Start the development server:
```
npm run dev
```
Access the application at http://localhost:3000. The FastAPI server runs on http://127.0.0.1:8000.

For backend-only testing:

conda create --name nextjs-fastapi-your-chat python=3.10
conda activate nextjs-fastapi-your-chat
pip install -r requirements.txt
uvicorn api.index:app --reload

Maintaining Chat History (TODO List)

Options for preserving chat history include:

Global Variable: Simple but not ideal for scalability and consistency.
In-Memory Database/Cache: Scalable solutions like Redis for storing chat history.
Database Storage: Robust and persistent method, suitable for production environments.

Understanding RAG Algorithms

RAG (Retrieval Augmented Generation) enhances language models with context retrieved from a custom knowledge base. The process involves fetching HTML documents, splitting them into chunks, and vectorizing these chunks using embedding models like OpenAI's. This vectorized data forms a vector store, enabling semantic searches based on user queries. The retrieved relevant chunks are then used as context for the language model, forming a comprehensive response to user inquiries.

Implementing Context Retrieval

The get_vectorstore_from_url function extracts and processes text from a given URL, while get_context_retriever_chain forms a chain that retrieves context relevant to the entire conversation history. This pipeline approach ensures that responses are contextually aware and accurate.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
api		api
app		app
components		components
images		images
lib		lib
public		public
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
components.json		components.json
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

License

mazzasaverio/nextjs-fastapi-your-chat

Folders and files

Latest commit

History

Repository files navigation

Next.js + FastAPI: Chat with Your Website

Features of the Hybrid App

Operational Mechanics

Setting Up the Application

Maintaining Chat History (TODO List)

Understanding RAG Algorithms

Implementing Context Retrieval

Inspiration and References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages