In [1]:
from langchain_community.document_loaders import TextLoader

loader = TextLoader("../data/nividia.txt")
text_data = loader.load()

text_data[0].page_content

'\n‚∏ª\n\nüìÑ NVIDIA Corporation ‚Äì Founder, History & Financial Overview (2000‚Äì2026)\n\nFounder and Company Creation\n\nNVIDIA Corporation was founded in April 1993 by Jensen Huang, Chris Malachowsky, and Curtis Priem. Jensen Huang, a Taiwanese-American electrical engineer, serves as the co-founder, President, and Chief Executive Officer of NVIDIA. Under his leadership, NVIDIA evolved from a small graphics chip startup into a global leader in high-performance computing, artificial intelligence, and data-center acceleration.\n\nThe company‚Äôs original vision was to harness the power of parallel computing to solve complex problems in graphics and computation. NVIDIA is headquartered in Santa Clara, California, and trades publicly under the ticker NVDA on the NASDAQ stock exchange.\n\n‚∏ª\n\nBusiness Overview\n\nNVIDIA is a leading designer of graphics processing units (GPUs) and accelerated computing platforms. Its products are widely used across multiple markets, including:\n\t‚Ä¢

In [2]:
from langchain_community.document_loaders import WebBaseLoader

urls = [
    "https://corporatefinanceinstitute.com/resources/valuation/equity-research-report/",
    "https://www.moneycontrol.com/",
]

USER_AGENT environment variable not set, consider setting it to identify your requests.


In [3]:
loader = WebBaseLoader(urls)
web_data = loader.load()
web_data[0]

Document(metadata={'source': 'https://corporatefinanceinstitute.com/resources/valuation/equity-research-report/', 'title': 'Equity Research Report: Definition, Types, and Key Components', 'description': 'Learn how to write an equity research report‚Äîkey sections, examples, and tips used by analysts to evaluate stocks, forecasts, and investment recommendations.', 'language': 'en-US'}, page_content="\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nEquity Research Report: Definition, Types, and Key Components\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n              Corporate Finance Institute            \n\n\nMenu\n\n\n\n\n\n\n\n\n\n\nCertifications\n\n\nCertifications\n\nCertifications are rigorous, career-focused programs designed to develop the practical skills and confidence required for success in the finance industry.\n Financial Modeling & Valuation Analyst (FMVA¬Æ)\nFinancial Planning & Analysis Professional (FPAP‚Ñ¢) Certifi

In [4]:
documents = text_data + web_data

In [5]:
from langchain_text_splitters import RecursiveCharacterTextSplitter

text_splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=200,separators=["\n\n", "\n", " ", ""])

chunks = text_splitter.split_documents(documents)
print(f"Split documents {len(documents)} documents into {len(chunks)} chunks")

Split documents 3 documents into 190 chunks


In [6]:
for i, chunk in enumerate(chunks[:2]):
    print(f"\n--- Chunk {i + 1} ---")
    print("SOURCE:", chunk.metadata.get("source"))
    print(chunk.page_content[:400])


--- Chunk 1 ---
SOURCE: ../data/nividia.txt
‚∏ª

üìÑ NVIDIA Corporation ‚Äì Founder, History & Financial Overview (2000‚Äì2026)

Founder and Company Creation

NVIDIA Corporation was founded in April 1993 by Jensen Huang, Chris Malachowsky, and Curtis Priem. Jensen Huang, a Taiwanese-American electrical engineer, serves as the co-founder, President, and Chief Executive Officer of NVIDIA. Under his leadership, NVIDIA evolved from a small graphics chi

--- Chunk 2 ---
SOURCE: ../data/nividia.txt
‚∏ª

Business Overview

NVIDIA is a leading designer of graphics processing units (GPUs) and accelerated computing platforms. Its products are widely used across multiple markets, including:
	‚Ä¢	PC gaming and professional visualization
	‚Ä¢	Data centers and cloud computing
	‚Ä¢	Artificial intelligence and machine learning
	‚Ä¢	Autonomous vehicles and robotics
	‚Ä¢	High-performance computing (HPC)

Over time,


In [7]:
for i, chunk in enumerate(chunks[:5]):
    print(f"Chunk {i + 1} length:", len(chunk.page_content))

Chunk 1 length: 795
Chunk 2 length: 943
Chunk 3 length: 919
Chunk 4 length: 945
Chunk 5 length: 912


In [8]:
from langchain_community.embeddings import OllamaEmbeddings

embeddings_model = OllamaEmbeddings(model="nomic-embed-text")
embeddings_model.embed_query("What is nividia")

  embeddings_model = OllamaEmbeddings(model="nomic-embed-text")


[0.33898767828941345,
 0.771591067314148,
 -3.4281914234161377,
 -0.40876305103302,
 0.9289260506629944,
 -1.1907978057861328,
 -0.20271924138069153,
 -1.7530714273452759,
 -0.18792177736759186,
 0.8543890714645386,
 -0.17794020473957062,
 1.2349166870117188,
 0.37654930353164673,
 -0.022515257820487022,
 -0.8861027956008911,
 -0.7887607216835022,
 -0.7016777992248535,
 -0.38746896386146545,
 0.3744550049304962,
 0.7300872206687927,
 -2.114030122756958,
 -1.2000222206115723,
 -0.9990964531898499,
 0.44366416335105896,
 1.796762466430664,
 0.4834224283695221,
 0.7407024502754211,
 -0.06032320484519005,
 0.20169880986213684,
 1.4194960594177246,
 1.6145542860031128,
 0.1532917320728302,
 0.282284140586853,
 0.8439176082611084,
 1.0196754932403564,
 -0.13861067593097687,
 0.4532450735569,
 0.4353477358818054,
 -0.371155321598053,
 1.2670129537582397,
 0.12140021473169327,
 -0.9526846408843994,
 0.5406261086463928,
 -0.8658179044723511,
 1.4247583150863647,
 -0.16910389065742493,
 0.481994

In [9]:
from langchain_community.vectorstores import Chroma

vectorstore = Chroma.from_documents(
    documents=chunks, embedding=embeddings_model, persist_directory="./equity_db"
)

In [10]:
query = "What are the key risks in equity research?"
docs = vectorstore.similarity_search(query, k=3)

for d in docs:
    print(d.metadata.get("source"))
    print(d.page_content[:200])

https://corporatefinanceinstitute.com/resources/valuation/equity-research-report/
Equity Research Report Example
Below is an example of an equity research report on Kraft Foods. As you can see in the images below, the analyst clearly lays out the recommendation, target price, recen
https://corporatefinanceinstitute.com/resources/valuation/equity-research-report/
Start Free                          



What is an Equity Research Report?
An equity research report is a document prepared by an analyst that provides a recommendation on whether investors should buy
https://corporatefinanceinstitute.com/resources/valuation/equity-research-report/
3. Who writes equity research reports?
Equity research reports are written by equity research analysts who work at investment banks, brokerage firms, or asset management companies. These analysts typi
