# LeadRankerAI Demo
This notebook shows how to score a raw leads file and inspect the results.

In [None]:
import pandas as pd from scoring import score_lead

## 1. Load sample leads

In [None]:
df = pd.read_csv("sample_data.csv") df.head()

## 2. Apply LeadRankerAI scoring

In [None]:
df["Lead Score"] = df.apply(lambda r: score_lead( r.Title, r.Industry, r.Website, r.Company ), axis=1) 
df_sorted = df.sort_values("Lead Score", ascending=False) 
df_sorted.head(10)

## 3. Visualize score distribution

In [None]:
import matplotlib.pyplot as plt 
df["Lead Score"].hist(bins=10) 
plt.xlabel("Score") 
plt.ylabel("Count") 
plt.title("Lead Score Histogram") 
plt.show()

## 4. Export ranked leads

In [None]:
df_sorted.to_csv("scored_leads_demo.csv", index=False) 
print("File saved as scored_leads_demo.csv")

## 5. Next Steps

This demo notebook illustrates a lightweight, rule-based lead scoring framework. While effective for quick filtering, there are several areas where the tool could be extended to provide even more value in real-world sales or M&A contexts:

### 🔄 Model Refinement
- **Train a Machine Learning classifier** (e.g. Naive Bayes or Logistic Regression) on historical lead conversion data to improve scoring accuracy.
- Use embeddings (e.g. from OpenAI or HuggingFace) to better understand titles or company descriptions contextually.

### 🌐 Data Enrichment
- **Integrate Clearbit, Apollo, or Crunchbase APIs** to pull real-time data like funding stage, team size, tech stack, or recent press coverage.
- Enrich missing or vague fields (e.g. deducing industry from domain name).

### 🧠 Intelligence Layers
- Score based on **intent signals** such as LinkedIn activity or website behavior (if available).
- Detect **AI-readiness indicators** like `.ai` domains, blog content, or job postings.

### 🛠️ Productization
- Turn this into a **hosted web app** using Streamlit Sharing, Vercel, or Hugging Face Spaces.
- Add **CSV validation**, email deduplication, or Slack/CRM export options.

---

This notebook provides a clean foundation for experimentation, rapid testing, and future scaling.
