matchatr

matchatr provides causal inference for (matched) case-control, nested case-control (NCC), and case-cohort study designs. It pairs design-faithful classical estimators with marginal causal effects, and integrates with the etverse ecosystem — delegating estimation to causatr (g-computation / IPW / AIPW with sandwich and bootstrap variance) and survatr (causal survival on person-period data).

Status: classical odds-ratio engines landing. The design taxonomy, the two-step matcha() / contrast() API, and the (design, estimator) dispatch (PHASE_1) are in place, and the classical odds-ratio engines now run end to end: the unmatched case-control logistic and Mantel-Haenszel ORs (PHASE_2), the matched case-control conditional-logistic and McNemar ORs with stratum-specific effect modification (PHASE_3), and the polytomous subtype ORs for multi-group outcomes, with a test_homogeneity() Wald test of whether the exposure OR is constant across subtypes plus the pooled common OR (PHASE_4). See the articles for worked examples. The time-to-event sampling designs and the marginal causal-weighting / survival layer (PHASE_5–PHASE_20) remain at the design stage.

What it does

Two orthogonal axes: a design object encodes the sampling structure (strata, matching ratio, time scale, prevalence, inclusion weights); an estimator chooses the analysis.

Design	Classical estimand	Causal (marginal) estimand
Unmatched case-control	conditional OR, Mantel-Haenszel	RD / RR / marginal OR (case-control weighting)
Matched case-control	conditional OR (conditional logistic)	RD / RR via standardization
Nested case-control	risk-set HR; Samuelsen IPW Cox	marginal survival contrasts (design-weighted)
Case-cohort	Prentice / Self-Prentice / Borgan HR	absolute risk, RD(t), RMST

Marginal causal effects use case-control weighting (the Rose & van der Laan g-formula / IPW / AIPW / TMLE family) and design-based inclusion weighting (Samuelsen, Borgan): the weights are passed as observation weights into the etverse engines, so they compose directly with existing estimators.

Installation

You can install the development version of matchatr from GitHub with:

# install.packages("pak")
pak::pak("etverse/matchatr")

Example

library(matchatr)

# Matched case-control -> conditional odds ratio (infert: a matched study of
# spontaneous/induced abortion and infertility, matched on age and parity).
fit <- matcha(
  infert,
  outcome = "case", exposure = "spontaneous",
  design = matched_cc(strata = "stratum"),
  confounders = ~ induced, estimator = "clogit"
)

contrast(fit, type = "or")
#> <matchatr_result>
#>  Estimator:  clogit  (engine: clogit)
#>  Estimand:   conditional OR
#>  Contrast:   Odds ratio
#>  CI method:  model
#>  N:          248
#> 
#> Contrasts:
#>     comparison estimate     se ci_lower ci_upper
#>         <char>    <num>  <num>    <num>    <num>
#> 1: spontaneous 7.285423 2.5677 3.651357 14.53635

The marginal causal contrasts (case-control weighting) reuse the same two-step API once a source-population prevalence q0 is supplied — they are part of the roadmap below:

fit <- matcha(
  data,
  outcome = "case", exposure = "x",
  design = unmatched_cc(prevalence = 0.02),   # source-population prevalence q0
  confounders = ~ age + smoke, estimator = "ccw_gformula"
)
contrast(fit, type = "difference", ci_method = "sandwich")

Roadmap

The design is documented in PHASE_1–PHASE_20 at the repository root, mapping the Handbook of Statistical Methods for Case-Control Studies (Borgan et al., 2018) to an implementation plan. See CLAUDE.md for the phase index and FEATURE_COVERAGE_MATRIX.md for what is implemented and tested.

Part of the etverse

matchatr is one package in the etverse family for causal inference and methodological triangulation, alongside causatr (causal effect estimation) and survatr (causal survival analysis).

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.claude		.claude
.github/workflows		.github/workflows
.vscode		.vscode
R		R
altdoc		altdoc
man		man
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
.lintr		.lintr
CLAUDE.md		CLAUDE.md
DESCRIPTION		DESCRIPTION
FEATURE_COVERAGE_MATRIX.md		FEATURE_COVERAGE_MATRIX.md
LICENSE		LICENSE
LICENSE.md		LICENSE.md
Makefile		Makefile
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
PHASE_10_CAUSAL_SURVIVAL_SAMPLED.md		PHASE_10_CAUSAL_SURVIVAL_SAMPLED.md
PHASE_11_TWO_PHASE.md		PHASE_11_TWO_PHASE.md
PHASE_12_CALIBRATION.md		PHASE_12_CALIBRATION.md
PHASE_13_MULTIPLE_IMPUTATION.md		PHASE_13_MULTIPLE_IMPUTATION.md
PHASE_14_SEMIPARAMETRIC_MLE.md		PHASE_14_SEMIPARAMETRIC_MLE.md
PHASE_15_SMALL_SAMPLE.md		PHASE_15_SMALL_SAMPLE.md
PHASE_16_POWER.md		PHASE_16_POWER.md
PHASE_17_ALT_RISK_MODELS.md		PHASE_17_ALT_RISK_MODELS.md
PHASE_18_SECONDARY_ANALYSIS.md		PHASE_18_SECONDARY_ANALYSIS.md
PHASE_19_SCCS.md		PHASE_19_SCCS.md
PHASE_1_DESIGN_TAXONOMY.md		PHASE_1_DESIGN_TAXONOMY.md
PHASE_20_RESPONSE_SELECTIVE.md		PHASE_20_RESPONSE_SELECTIVE.md
PHASE_2_UNMATCHED_CC.md		PHASE_2_UNMATCHED_CC.md
PHASE_3_MATCHED_CC.md		PHASE_3_MATCHED_CC.md
PHASE_4_MULTIPLE_GROUPS.md		PHASE_4_MULTIPLE_GROUPS.md
PHASE_5_NESTED_CC.md		PHASE_5_NESTED_CC.md
PHASE_6_CASE_COHORT.md		PHASE_6_CASE_COHORT.md
PHASE_7_IPW_NCC.md		PHASE_7_IPW_NCC.md
PHASE_8_CAUSAL_STRATEGY.md		PHASE_8_CAUSAL_STRATEGY.md
PHASE_9_CCW_CONTRASTS.md		PHASE_9_CCW_CONTRASTS.md
README.Rmd		README.Rmd
README.md		README.md
air.toml		air.toml
codecov.yml		codecov.yml
matchatr.Rproj		matchatr.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

matchatr

What it does

Installation

Example

Roadmap

Part of the etverse

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

matchatr

What it does

Installation

Example

Roadmap

Part of the etverse

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages