Synthetic data quality evaluation with interactive HTML reports.
Added
SynthEvalclass — dual-input (DataFrame or ProfileReport), lazy diff computation- Dataset overview: comparative row/column counts, missing %, duplicate %, type distribution
- Column comparison: per-column stat deltas with distribution overlay charts and trend lines
- Bivariate interactions: scatter plots and box plots with outlier display, dropdown-based column selection
- Correlation comparison: Original, Synthetic, and Diff heatmaps (Pearson, Spearman, Kendall, Cramér's V, Phik)
- Alert changes: new and resolved alerts with trigger values
- Proportion-based charts for fair comparison across different dataset sizes
- Interactive HTML report with ECharts (histogram, bar, line, heatmap, scatter, boxplot)
- Programmatic access:
se.diff,.to_html(),.original_report,.synthetic_report - Dual ingestion:
pl.DataFrameorProfileReport - PEP 561
py.typedmarker
Full Changelog: https://github.com/dataxid/dataxid-syntheval/releases/tag/v0.1.0