Vector Embedding Project with JSON and Pinecone

Overview

This project demonstrates how to perform vector embeddings on JSON data using a Language Model (LLM), store the embeddings in Pinecone, and enable semantic search and retrieval.

Prerequisites

Python 3.8+
Pinecone account
OpenAI API key (or alternative LLM provider)
Required Python libraries:
- pinecone-client
- gemini
- node

Installation

Clone the repository:

git clone https://github.com/CodingWithTashi/json-vector-embedder.git
cd json-vector-embedder

Install dependencies:

npm i

Set up environment variables:

NEXT_PUBLIC_PINECONE_API_KEY=key
NEXT_PUBLIC_PINECONE_INDEX=key
NEXT_PUBLIC_GOOGLE_API_KEY=key

Step-by-Step Process

1. Import JSON Data

import json

import inpuDataList from "../../example.data.json";

export async function loadInputData(): Promise<InputData[]> {
  try {
    if (inpuDataList.length === 0) {
      throw new Error("No data found in the JSON file");
    } else if (inpuDataList.length == 1 && inpuDataList[0].type === "test") {
      throw new Error("Add actual json data to example.data.json");
    } else {
      return inpuDataList as InputData[];
    }
  } catch (error) {
    console.error("Error loading or parsing JSON file:", error);
    throw error;
  }
}

2. Save Embeddings in Pinecone

await PineconeStore.fromDocuments(docs, embeddings, {
        pineconeIndex: index,
        namespace: "monastery_data", // Added specific namespace
      });

3. Query with LLM

const vectorQueryResponse = await pineconeIndex
      .namespace("monastery_data")
      .query({
        vector: embedding,
        topK: 4,
        includeMetadata: true,
      });

Contributing

Contributions are welcome! Please submit a pull request or open an issue.

Acknowledgements

Pinecone
OpenAI/Gemini

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
public		public
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
DATAPREPARE.md		DATAPREPARE.md
README.md		README.md
components.json		components.json
example.data.json		example.data.json
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vector Embedding Project with JSON and Pinecone

Overview

Prerequisites

Installation

Step-by-Step Process

1. Import JSON Data

2. Save Embeddings in Pinecone

3. Query with LLM

Contributing

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vector Embedding Project with JSON and Pinecone

Overview

Prerequisites

Installation

Step-by-Step Process

1. Import JSON Data

2. Save Embeddings in Pinecone

3. Query with LLM

Contributing

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages