## Loading My Capstone PDF

In [7]:
from langchain_community.document_loaders import PyPDFLoader
loader = PyPDFLoader("GROUP2_ANUJ_BHANDARI_0348445_CP1.pdf")
documents = loader.load()


## Document Splitter

In [8]:
from langchain_text_splitters import CharacterTextSplitter
splitter = CharacterTextSplitter(chunk_size = 200, chunk_overlap = 30)
docs = splitter.split_documents(documents)


In [10]:
len(docs)

62

## Ollama Embeddings

In [None]:
from langchain_community.embeddings import OllamaEmbeddings
embeddings = OllamaEmbeddings(model="gemma2:2b")
embedded_docs = embeddings.embed_documents(docs)

In [None]:
from langchain_community.vectorstores import FAISS
db = FAISS.from_documents(docs,embeddings)

## Querying the VectorDB

In [17]:
query = "Which YOLO is used in BinHero ?"
docs = db.similarity_search(query)
docs

[Document(id='85adad8d-d652-4476-84c9-75eb4caf4923', metadata={'producer': 'xdvipdfmx (20240305)', 'creator': 'LaTeX with hyperref', 'creationdate': '2025-07-24T15:08:47+00:00', 'source': 'GROUP2_ANUJ_BHANDARI_0348445_CP1.pdf', 'total_pages': 62, 'page': 10, 'page_label': '1'}, page_content='Chapter 1\nIntroduction\n1.1 Executive Summary\nRapid urbanization and city growth have caused a significant increase in worldwide waste.\nBy 2050, this production is anticipated to increase by 70%, per Kaza et al. (2018). W aste\nmanagement techniques in the past had certain issues that made it challenging for both\npeople and communities to participate and recycle properly . In order to confront this\nmultifaceted challenge, the application presents a novel approach and tackles the issue\nusing crowdsourcing, gamification, and artificial intelligence as features within a single\nmobile application.\nBinHero is a mobile application that is powered by AI technologies like object detection\nand moti

In [27]:
# can also search with embeddings
query_embed = embeddings.embed_query(query)
results = db.similarity_search_by_vector(query_embed)
results[0].page_content

'Chapter 1\nIntroduction\n1.1 Executive Summary\nRapid urbanization and city growth have caused a significant increase in worldwide waste.\nBy 2050, this production is anticipated to increase by 70%, per Kaza et al. (2018). W aste\nmanagement techniques in the past had certain issues that made it challenging for both\npeople and communities to participate and recycle properly . In order to confront this\nmultifaceted challenge, the application presents a novel approach and tackles the issue\nusing crowdsourcing, gamification, and artificial intelligence as features within a single\nmobile application.\nBinHero is a mobile application that is powered by AI technologies like object detection\nand motion tracking. These AI technologies help validate trash disposal actions in real-\ntime, while also promoting accurate sorting behavior with fast feedback. Point systems,\nleaderboards, PvP challenges, and team-based tasks, such as gamified elements, further\nenhance user engagement and devel

In [18]:
retrievers = db.as_retriever()
result = retrievers.invoke(query)

In [28]:
print(result[0].page_content)

Chapter 1
Introduction
1.1 Executive Summary
Rapid urbanization and city growth have caused a significant increase in worldwide waste.
By 2050, this production is anticipated to increase by 70%, per Kaza et al. (2018). W aste
management techniques in the past had certain issues that made it challenging for both
people and communities to participate and recycle properly . In order to confront this
multifaceted challenge, the application presents a novel approach and tackles the issue
using crowdsourcing, gamification, and artificial intelligence as features within a single
mobile application.
BinHero is a mobile application that is powered by AI technologies like object detection
and motion tracking. These AI technologies help validate trash disposal actions in real-
time, while also promoting accurate sorting behavior with fast feedback. Point systems,
leaderboards, PvP challenges, and team-based tasks, such as gamified elements, further
enhance user engagement and develop long-term ha

# Similarity Search with Scores

In [22]:
docs_and_scores = db.similarity_search_with_score(query)
for doc, score in docs_and_scores:
    print(f"Score: {score}, Content: {doc.page_content}")

Score: 8096.2431640625, Content: Chapter 1
Introduction
1.1 Executive Summary
Rapid urbanization and city growth have caused a significant increase in worldwide waste.
By 2050, this production is anticipated to increase by 70%, per Kaza et al. (2018). W aste
management techniques in the past had certain issues that made it challenging for both
people and communities to participate and recycle properly . In order to confront this
multifaceted challenge, the application presents a novel approach and tackles the issue
using crowdsourcing, gamification, and artificial intelligence as features within a single
mobile application.
BinHero is a mobile application that is powered by AI technologies like object detection
and motion tracking. These AI technologies help validate trash disposal actions in real-
time, while also promoting accurate sorting behavior with fast feedback. Point systems,
leaderboards, PvP challenges, and team-based tasks, such as gamified elements, further
enhance user en

In [24]:
db.save_local("mycpDB")

In [25]:
db.load_local("mycpDB",embeddings,allow_dangerous_deserialization=True)

<langchain_community.vectorstores.faiss.FAISS at 0x2a6b31920d0>

In [29]:
res = db.similarity_search(query)
print(res[0].page_content)

Chapter 1
Introduction
1.1 Executive Summary
Rapid urbanization and city growth have caused a significant increase in worldwide waste.
By 2050, this production is anticipated to increase by 70%, per Kaza et al. (2018). W aste
management techniques in the past had certain issues that made it challenging for both
people and communities to participate and recycle properly . In order to confront this
multifaceted challenge, the application presents a novel approach and tackles the issue
using crowdsourcing, gamification, and artificial intelligence as features within a single
mobile application.
BinHero is a mobile application that is powered by AI technologies like object detection
and motion tracking. These AI technologies help validate trash disposal actions in real-
time, while also promoting accurate sorting behavior with fast feedback. Point systems,
leaderboards, PvP challenges, and team-based tasks, such as gamified elements, further
enhance user engagement and develop long-term ha