AutoITE: Automated Individual Treatment Effect Estimation

A residual-based approach to causal inference that detects latent heterogeneity through baseline coupling, enabling Just-in-Time discovery of treatment effects.

Key Insight

Traditional causal inference methods condition on observed features, but latent confounders create hidden subgroups with dramatically different treatment responses. AutoITE exploits baseline coupling---the fact that latent confounders affect not just treatment response but also baseline outcomes---to discover these hidden subgroups through residual analysis.

Installation

pip install autoite

For additional features:

pip install autoite[viz]        # Visualization (matplotlib, seaborn)
pip install autoite[comparison] # SOTA comparison (econml, lightgbm)
pip install autoite[all]        # Everything

Quick Start

from autoite import AutoITEEstimator, BimodalityDiagnostic

# Fit the model (k="auto" uses 10% of samples by default)
model = AutoITEEstimator()
model.fit(X_train, T_train, Y_train, Y_pre_train)

# Predict individual treatment effects
tau_pred = model.predict(X_test, Y_pre_test)

# Check for hidden subgroups
diag = BimodalityDiagnostic()
diag.fit(X_train, Y_pre_train)
result = diag.quantify_unknown(X_test, Y_pre_test)
print(f"Bimodality Score: {result['bimodality_score']:.4f}")
print(f"Interpretation: {result['interpretation']}")

Architecture

Global Ridge: Baseline model predicting pre-treatment outcomes from features
Residual Computation: Leave-one-out residuals encode latent causal state
Residual Matching: k-NN in residual space finds individuals with similar latent states
Local Ridge: Treatment effects estimated from residual neighbors
Triage: High-uncertainty cases flagged for expert review

Key Results

From the accompanying paper:

Method	Corr(τ̂, U)	Detection Rate	MAE	Median
Causal Forest	0.00	27.3%	0.230	0.042
X-Learner	0.00	27.1%	0.245	0.045
AutoITE	-0.94	97.5%	0.095	0.034

AutoITE achieves 59% lower MAE than Causal Forest (0.095 vs 0.230). With 15% triage, MAE reduces to 0.042—only 18% of Causal Forest's error—and deaths drop from 8 to 5.

Components

AutoITEEstimator

Core estimator for individual treatment effect prediction.

k: Number of residual neighbors (default: "auto" = 10% of samples)
alpha_global: Ridge regularization for global model (default: 1.0)
alpha_local: Ridge regularization for local models (default: 0.01)
triage_percentile: Fraction of high-uncertainty cases to flag

BimodalityDiagnostic

Detects hidden subgroups via GMM-based residual analysis.

Bimodality score < 0.01: No hidden structure
Bimodality score 0.01-0.05: Weak structure
Bimodality score 0.05-0.10: Moderate structure
Bimodality score > 0.10: Strong hidden structure (likely latent confounder)

UnexplainedHeterogeneityIndex

Measures whether local models improve over global, indicating heterogeneity not captured by observed features.

Reproducing Paper Results

cd experiments/paper_experiments
python run_all_experiments.py

Real-World Validation

The UCI Student Performance experiment demonstrates AutoITE on real educational data:

cd experiments/paper_experiments
python uci_student_intervention.py

Fundamental Limits

AutoITE can detect latent confounders that affect baseline outcomes (baseline coupling). However, interaction-only confounders---those that affect ONLY treatment response without leaving baseline fingerprints---are fundamentally undetectable by any observational method.

Paper

See paper/auto_ite_final.pdf for the full manuscript:

AutoITE: Residual-Based Individual Treatment Effect Estimation via Baseline Coupling

Jake Peace, November 2025

Data Attribution

UCI Student Performance Dataset

The real-world validation uses the Student Performance dataset from the UCI Machine Learning Repository, provided under the CC BY 4.0 license.

Creator: Paulo Cortez
Source: https://archive.ics.uci.edu/dataset/320/student+performance
DOI: 10.24432/C5TG7T
Citation: P. Cortez and A. Silva. Using Data Mining to Predict Secondary School Student Performance. In A. Brito and J. Teixeira Eds., Proceedings of 5th FUture BUsiness TEChnology Conference (FUBUTEC 2008) pp. 5-12, Porto, Portugal, April, 2008, EUROSIS, ISBN 978-9077381-39-7.

License

MIT License - see LICENSE file for details.

Author

Jake Peace (2025)

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
autoite		autoite
experiments/paper_experiments		experiments/paper_experiments
paper		paper
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AutoITE: Automated Individual Treatment Effect Estimation

Key Insight

Installation

Quick Start

Architecture

Key Results

Components

AutoITEEstimator

BimodalityDiagnostic

UnexplainedHeterogeneityIndex

Reproducing Paper Results

Real-World Validation

Fundamental Limits

Paper

Data Attribution

UCI Student Performance Dataset

License

Author

About

Uh oh!

Releases

Packages

Languages

License

HotProtato/AutoITE

Folders and files

Latest commit

History

Repository files navigation

AutoITE: Automated Individual Treatment Effect Estimation

Key Insight

Installation

Quick Start

Architecture

Key Results

Components

AutoITEEstimator

BimodalityDiagnostic

UnexplainedHeterogeneityIndex

Reproducing Paper Results

Real-World Validation

Fundamental Limits

Paper

Data Attribution

UCI Student Performance Dataset

License

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages