# Objective 

This study aims to apply **Balanced Risk Set Matching** as a statistical method to improve causal inference in observational studies where treatment assignment is based on evolving patient conditions rather than randomization. By implementing **risk set matching**, treated patients are paired with untreated patients who had similar symptom histories up to the time of treatment, ensuring comparability at the moment of intervention. Additionally, **integer programming** is used to balance the distributions of key covariates across matched groups, minimizing bias in treatment effect estimation. This approach is applied to analyze the impact of cystoscopy and hydrodistention on interstitial cystitis symptoms, with a **sensitivity analysis** assessing the robustness of findings to hidden biases. The study ultimately aims to enhance the validity of treatment comparisons in non-randomized medical research.

# Workflow

### Step 1: Data Collection & Preprocessing
- Load the dataset, ensuring it contains treatment times, symptom histories, and follow-up measures.
- Standardize symptom measures for comparability.
- Identify treated vs. untreated patients and structure data for time-sequenced analysis.
- Output: Cleaned dataset with time-ordered symptom histories and treatment indicators.

### Step 2: Risk Set Matching
- Identify risk sets by finding untreated patients who have a similar symptom history as a treated patient up to the time of treatment.
- Ensure that future data is not used for matching.
- Compute Mahalanobis distance to measure similarity between treated and untreated patients.
- Output: Initial pool of potential matches.

### Step 3: Optimal Matching via Integer Programming
- Implement integer programming to:
    - Minimize Mahalanobis distance between treated and control pairs.
    - Ensure balanced covariate distributions across groups.
- Use network flow optimization to efficiently find the best matches.
- Output: Finalized matched dataset with treatment-control pairs.


### Step 4: Sensitivity Analysis for Hidden Bias 
- Introduce an unobserved covariate to simulate hidden biases.
- Evaluate how much hidden bias would be needed to invalidate the results.
- Conduct proportional hazards modeling to analyze potential confounders.
- Output: Bias-adjusted estimates of treatment effects.

### Step 5: Statistical Analysis & Interpretation
- Perform hypothesis testing to compare treatment vs. control groups:
    - Wilcoxon Signed-Rank Test for pairwise comparisons.
    - Permutation tests to validate significance.
    - Multivariate analysis if multiple symptom outcomes are evaluated.
- Generate visualizations (boxplots, histograms) to inspect trends in symptom changes.
- Output: Statistical validation of treatment effects.


### Step 6: Reporting & Conclusion
- Summarize methodology, findings, and potential biases.
- Present results through:
    - Summary tables (descriptive statistics).
    - Graphs & charts (symptom trends over time).
    - Sensitivity analysis conclusions (robustness of findings).
- Output: Final research paper/report with validated conclusions.

