Langchain, Pinecone, and GPT with Next.js - Full Stack Starter

This is a basic starter project for building with the following tools and APIs:

Next.js
LangchainJS
Pineceone Vector Database
GPT3
x

What we're building

This is an app that takes text (text files), embeds them into vectors, stores them into a vector database (Pinecone), and allows for semantic searching of the data.

Running the app

How to deploy and run this app:

Prerequisites

This app requires the following:

An OpenAI API key
Pinecone API Key

Up and running

To run the app locally, follow these steps:

Clone this repo

git clone git@github.com:geisera/ai-handbook.git

CD into the directory and install the dependencies using either NPM or Yarn

npm install

Copy .example.env.local to a new file called .env.local and update with your API keys and environment.

Be sure your environment is an actual environment given to you by Pinecone, like gcp-starter
(Optional) - Add your own custom text or markdown or text files into the /documents folder. Currently, this app will search our employee handbook.
Run the app:

npm run dev

Need to know

When creating the embeddings and the index, it can several minutes for the index to initialize. There is a settimeout function of 180 seconds in the utils that waits for the index to be created.

If the initialization takes longer, it will fail when you try to create the embeddings. If this happens, visit the Pinecone console to watch and wait for the status of your index being created to finish, then run the function again.

Running a query

The pre-configured app data is about the M&S Salaried Employee Handbook, so it will only understand related questions to that document unless you replace it with your own data. Here are a couple of questions you might ask it with the default data

What is M&S?
When was M&S founded?
Are alligators allowed at work?

This project was forked from this repository.

The base of that project was guided by this Node.js tutorial, with some restructuring and ported over to Next.js. You can also follow them here on Twitter!

Getting your data

Check out GPT Repository Loader which makes it simple to turn any GitHub repo into a text format, preserving the structure of the files and file contents, making it easy to chop up and save into pinecone using my codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
app		app
documents		documents
public		public
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
config.ts		config.ts
netlify.toml		netlify.toml
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
utils.ts		utils.ts
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Langchain, Pinecone, and GPT with Next.js - Full Stack Starter

What we're building

Running the app

Prerequisites

Up and running

Need to know

Running a query

Getting your data

About

Uh oh!

Releases

Packages

Languages

geisera/ai-handbook-deprecated

Folders and files

Latest commit

History

Repository files navigation

Langchain, Pinecone, and GPT with Next.js - Full Stack Starter

What we're building

Running the app

Prerequisites

Up and running

Need to know

Running a query

Getting your data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages