ComparEdge Data Loader for LangChain

Pulls SaaS product data from the ComparEdge API into LangChain Documents. Real SaaS pricing — plans, features, ratings. No API key.

Quick start

from comparedge_loader import ComparEdgeLoader

# Load all LLM products
loader = ComparEdgeLoader(category="llm", include_pricing=True)
docs = loader.load()

# Each doc: product name, description, pricing plans
for doc in docs[:3]:
    print(doc.metadata["name"], doc.metadata.get("starting_price"))

Parameters

Param	Type	Default	What it does
`category`	str or None	None	Filter by slug: `"crm"`, `"llm"`, `"project-management"`, etc. `None` = all products
`include_pricing`	bool	True	Add pricing plans to document text + `starting_price` to metadata
`include_features`	bool	False	Append feature list to document text (capped at 20 per product)

Available category slugs

accounting, ai-agents, analytics, bi-tools, cms, crm, customer-support, data-pipeline, design, devops, email-marketing, erp, helpdesk, hr, llm, marketing-automation, monitoring, project-management, sales, security, seo, social-media, storage, video-conferencing, and more

Full list: GET https://comparedge-api.up.railway.app/api/v1/categories

Document schema

page_content: Markdown-formatted text with product name, category, description, optional pricing table, optional features list.

metadata:

Key	Type	Description
`source`	str	Canonical URL on comparedge.com
`name`	str	Product display name
`slug`	str	URL-safe identifier
`category`	str	Category slug
`g2_rating`	float or null	G2 crowd rating
`has_free_tier`	bool	Product has a free plan
`starting_price`	float	Lowest paid plan price (when `include_pricing=True`)
`website`	str	Vendor homepage

Sample document

# Notion
Category: project-management

All-in-one workspace for notes, docs, and projects.

## Pricing
- Free: Free
- Plus: $12/user/mo
- Business: $18/user/mo
- Enterprise: Free

Use cases

RAG pipeline for software recommendation chatbots
Automated vendor evaluation reports
Price monitoring agents
SaaS stack analysis
Competitive intelligence

Pagination

The loader paginates automatically. All matching products are streamed via lazy_load() without loading the full dataset into memory at once.

# Memory-efficient streaming
loader = ComparEdgeLoader()
for doc in loader.lazy_load():
    index(doc)

API

Base URL: https://comparedge-api.up.railway.app/api/v1

No auth required. Be reasonable with request rate.

Docs: https://comparedge-api.up.railway.app/docs

Integration with LangChain (PR target)

This loader targets langchain_community.document_loaders. The PR-ready file is at langchain_pr/comparedge.py.

Expected import after merge:

from langchain_community.document_loaders import ComparEdgeLoader

Testing

# Unit tests (mocked, no network)
python langchain_pr/test_comparedge.py

# Live test against the API
python example.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
langchain_pr		langchain_pr
README.md		README.md
comparedge_loader.py		comparedge_loader.py
example.py		example.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComparEdge Data Loader for LangChain

Quick start

Parameters

Available category slugs

Document schema

Sample document

Use cases

Pagination

API

Integration with LangChain (PR target)

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ComparEdge Data Loader for LangChain

Quick start

Parameters

Available category slugs

Document schema

Sample document

Use cases

Pagination

API

Integration with LangChain (PR target)

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages