In [6]:
from IPython.display import display, Markdown, Latex
from dotenv import load_dotenv
load_dotenv()

True

In [1]:
# main function
from digester import ask_claude_about_paper

Change this URL.

In [2]:
paper_url = "https://paulgp.github.io/papers/gphk_feb152024.pdf"

In [3]:
addl_prompt = "Walk through a simple example at the end."
pgp_prompt = f"""

We are going to create a two-page reader's digest summary of the attached paper. The goal of this summary is:
1. What the reserachers do, in language that is not too dense but sufficiently mathematically precise, and is clear to Ph.D. level economists.
2. Why does the literature care about this? Why is it a contribution and why where does it fit?
3. What is a key figure or theorem from the paper?

{addl_prompt}

The headline paragraph of the summary should state "what is the one thing to learn from this paper?".
Your output should be in markdown.

"""

In [4]:
%%time
claude_response = ask_claude_about_paper(paper_url, pgp_prompt)

CPU times: user 55 ms, sys: 17.3 ms, total: 72.3 ms
Wall time: 33.6 s


In [7]:
Markdown(claude_response)

# Reader's Digest: Contamination Bias in Linear Regressions

**Key Takeaway**: When running linear regressions with multiple treatments and controls, the coefficient on each treatment is generally contaminated by a non-convex weighted average of other treatments' effects, even when controls are flexible enough to avoid omitted variable bias. This contamination bias arises because linear regression imposes additive separability between treatments that may not hold in reality.

## What They Do
The authors study how multiple-treatment regression coefficients combine heterogeneous treatment effects. While Angrist (1998) showed that regressions with a single binary treatment estimate convex averages of treatment effects, this paper proves this property fails with multiple treatments. They derive a general decomposition showing each treatment coefficient equals:

1. A weighted average of its own treatment effects (with weights that sum to 1)
2. Plus contamination terms from other treatments' effects (with weights summing to 0)

The contamination arises because regression's linear specification incorrectly imposes additive separability between treatments. For example, with mutually exclusive treatments, being in one treatment group mechanically precludes being in another, creating an inherently non-linear relationship.

## Why We Care
This matters because multiple-treatment regressions are ubiquitous in economics:
- Stratified RCTs with varying treatment probabilities 
- Education value-added models comparing teachers/schools
- Judge IV designs with multiple instruments
- Difference-in-difference specifications with multiple periods

The paper shows these common specifications may not estimate what researchers think they do, even absent traditional omitted variables concerns. It also provides practical solutions through either:
1. Targeting unweighted average treatment effects
2. Using "easiest-to-estimate" weighted effects
3. Employing common weights across treatments

## Key Result: Proposition 1
The paper's main theoretical result decomposes the regression coefficient βk as:

βk = E[λkk(Wi)τk(Wi)] + Σℓ≠k E[λkℓ(Wi)τℓ(Wi)]

Where:
- τk(Wi) is the conditional treatment effect
- λkk(Wi) are own-treatment weights (sum to 1)
- λkℓ(Wi) are contamination weights (sum to 0)

## Simple Example
Consider Project STAR with two treatments: small classes and teacher aides. If small class effects vary across schools and correlate with the proportion getting aides, the small class coefficient will be contaminated by aide effects. With a 45% aide assignment rate in some schools and 5% in others, the contamination weights can be large - theoretical bounds show possible contamination bias of [-1.65, 1.67] standard deviations, even though actual contamination is small due to limited correlation between effects and weights.

The paper studies nine applications and finds economically meaningful contamination bias, particularly in observational studies where treatment assignment mechanisms create more variation in the contamination weights.