Release v0.3.0 -- LLM Scoring, Fellegi-Sunter, Plugins, Connectors, Streaming, Graph ER · benseverndev-oss/goldenmatch

What's New

GPT-4o-mini scores borderline pairs, boosting product matching from 44.5% to 66.3% F1 (precision 35% -> 95%) for $0.04
Budget caps (max_cost_usd, max_calls), model tiering, graceful degradation
Three-tier: auto-accept (>0.95), LLM judge (0.75-0.95), auto-reject (<0.75)

EM-trained m/u probabilities with Splink-style training (fix u from random pairs, train only m)
Comparison vectors with 2/3/N levels, automatic threshold estimation
98.8% precision on DBLP-ACM -- opt-in for high-precision use cases

Extend with custom scorers, transforms, connectors, and golden strategies
Entry-point discovery: pip install goldenmatch-my-plugin auto-registers
Protocol classes: ScorerPlugin, TransformPlugin, ConnectorPlugin, GoldenStrategyPlugin

Template-based natural language explanations (zero LLM cost)
Per-pair: "Matched because names are phonetically identical, zip codes match exactly"
Per-cluster: summaries with bottleneck identification
Streaming lineage output (no 10K pair cap)

Match within entity types, propagate evidence across relationships
Iterative convergence with configurable propagation modes
"If customer A's orders match customer B's orders, boost the A-B customer score"

Dataset	Strategy	Precision	Recall	F1	Cost
DBLP-ACM	Weighted fuzzy	97.2%	97.1%	97.2%	$0
DBLP-ACM	Fellegi-Sunter	98.8%	57.6%	72.8%	$0
Abt-Buy	Embedding + ANN	35.5%	59.4%	44.5%	$0
Abt-Buy	Embedding + ANN + LLM	95.4%	50.9%	66.3%	$0.04

Scale: 7,823 rec/s at 100K records. 792 tests passing.

pip install goldenmatch
goldenmatch dedupe your_data.csv