<a href="https://colab.research.google.com/github/SourasishBasu/ChatPDF-clone-llama2b/blob/main/ChatPDF_Clone.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Installation of GradientAI and Cassandra packages



In [None]:
!pip install -q cassandra-driver

In [None]:
!pip install -q cassio>=0.1.1
!pip install -q gradientai --upgrade
!pip install -q llama-index
!pip install -q pypdf
!pip install -q tiktoken==0.4.0

# Import OS and JSON Modules

In [3]:
import os
import json
from google.colab import userdata

os.environ['GRADIENT_ACCESS_TOKEN'] = userdata.get('GRADIENT_ACCESS_TOKEN')
os.environ['GRADIENT_WORKSPACE_ID'] = userdata.get('GRADIENT_WORKSPACE_ID')

# Import Cassandra & Llama Index

In [4]:
from cassandra.auth import PlainTextAuthProvider
from cassandra.cluster import Cluster
from llama_index import ServiceContext
from llama_index import set_global_service_context
from llama_index import VectorStoreIndex, SimpleDirectoryReader, StorageContext
from llama_index.embeddings import GradientEmbedding
from llama_index.llms import GradientBaseModelLLM
from llama_index.vector_stores import CassandraVectorStore

In [None]:
import cassandra
print (cassandra.__version__)

# Connecting to the Vector Database (AstraDB)

In [None]:
# The AstraDB secure connect bundle is present in the .zip file
cloud_config= {
  'secure_connect_bundle': 'secure-connect-temp-db.zip'
}

# This Astra DB Application token json file which is autogenerated from the AstraDB Connection Dashboard,
with open("temp_db-token.json") as f:
    secrets = json.load(f)

CLIENT_ID = secrets["clientId"]
CLIENT_SECRET = secrets["secret"]

auth_provider = PlainTextAuthProvider(CLIENT_ID, CLIENT_SECRET)
cluster = Cluster(cloud=cloud_config, auth_provider=auth_provider)
session = cluster.connect()

row = session.execute("select release_version from system.local").one()
if row:
  print(row[0])
else:
  print("An error occurred.")

# Defining the Gradient LLM connection

In [7]:
llm = GradientBaseModelLLM(base_model_slug="llama2-7b-chat",
                           max_tokens=400,
                           )

#Configuring Embeddings

In [8]:
embed_model = GradientEmbedding(
    gradient_access_token = os.environ["GRADIENT_ACCESS_TOKEN"],
    gradient_workspace_id = os.environ["GRADIENT_WORKSPACE_ID"],
    gradient_model_slug="bge-large",
)

In [9]:
service_context = ServiceContext.from_defaults(
    llm = llm,
    embed_model = embed_model,
    chunk_size=256,
)

set_global_service_context(service_context)

#Load PDF to chat with

In [17]:
documents = SimpleDirectoryReader("/content/Documents").load_data()
print(f"Loaded {len(documents)} document(s).")

Loaded 17 document(s).


# Setup and Query Index

In [18]:
index = VectorStoreIndex.from_documents(documents, service_context=service_context)
query_engine = index.as_query_engine()

In [19]:
response = query_engine.query("How does Facebook use Cassandra?")
print(response)

 According to the text, Facebook uses Cassandra as the backend storage system for multiple services within the platform. Specifically, one of the applications in the Facebook platform uses Cassandra.
