Claimify ESG

Greenwashing detection for oil and gas sustainability reports. Scores corporate climate claims against NGO evidence and flags contradictions — with traceable rationale for every verdict.

What it does

The pipeline ingests sustainability reports from ten oil and gas majors, extracts climate claims, retrieves relevant evidence from NGO sources (Carbon Tracker, InfluenceMap, ClientEarth, Global Witness, The Guardian), and scores each claim as well_substantiated, weakly_substantiated, or contradicted. An LLM generates a plain-language rationale for each verdict grounded in the retrieved evidence.

The frontend has three views:

Claims — browse and filter scored claims per company with full evidence trails
Compare — head-to-head claim analysis across companies and categories
Tracker — historical pledge tracking with fulfilment verdicts

Tech stack

Layer	Tools
Ingestion	pdfplumber, BeautifulSoup, Guardian API
NLP	ClimateBERT (relevance filter), OpenAI (claim extraction + rationale)
Retrieval	SBERT embeddings, cross-encoder reranking
Frontend	React 18, Vite, Tailwind CSS
Serving	nginx (Docker), Vercel (hosted)

Getting started

Frontend

cd frontend
npm install
npm run dev       # http://localhost:5173

Python pipeline

python -m venv .venv
.venv\Scripts\activate        # Windows
# source .venv/bin/activate   # macOS/Linux

pip install -e ".[nlp]"
cp .env.example .env          # add OPENAI_API_KEY and GUARDIAN_API_KEY

Run the full pipeline:

python -m claimify.ingestion.download_reports
python -m claimify.retrieval.run_retrieval
python -m claimify.scoring.run_scoring
python -m claimify.scoring.run_rationale
python gen_frontend_data.py

Docker

docker compose up --build     # serves frontend at http://localhost:8080

Project structure

src/claimify/
  ingestion/    PDF download and parsing, NGO feed fetching
  nlp/          Sentence splitting, ClimateBERT filter, claim extraction
  retrieval/    Corpus building, SBERT embeddings, reranking
  scoring/      LLM scorer, rationale generator, eval metrics
  eval/         Hand-labelled dataset and evaluation harness
frontend/src/   React components and utilities
scripts/        Historical claim extraction and tracker data generation
config/         Company list, NGO sources, pipeline settings
data/eval/      60-claim hand-labelled eval set (76.7% accuracy)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
config		config
data/eval		data/eval
docs		docs
frontend		frontend
scripts		scripts
src/claimify		src/claimify
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE (MIT)		LICENSE (MIT)
README.md		README.md
config.yaml		config.yaml
docker-compose.yml		docker-compose.yml
gen_frontend_data.py		gen_frontend_data.py
nginx.conf		nginx.conf
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Claimify ESG

What it does

Tech stack

Getting started

Frontend

Python pipeline

Docker

Project structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Claimify ESG

What it does

Tech stack

Getting started

Frontend

Python pipeline

Docker

Project structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages