# Birthweight and Smoking Analysis in R

This notebook replicates the Python analysis of birthweight and smoking data using R.

## Step 1 - Importing necessary libraries

In [12]:
library(tidyverse)
library(readODS)
library(conflicted)

## Step 2 - Reading the data file

In [14]:
# Read the ODS file
df <- read_ods("../Used data and given exercise/birthweight_smoking.ods")

head(df)

ERROR: Error: file does not exist


## Step 3 - Exploring the data

In [None]:
summary(df)

## Step 4 - Regressions of variable effects on birthweight

### Simple regression: Birthweight on Smoker

In [None]:
model1 <- lm(birthweight ~ smoker, data = df)
summary(model1)

# Calculate 95% CI
confint(model1, "smoker", level = 0.95)

The regression shows a statistically significant negative effect of smoking on birthweight (p < 0.001).

### Expanded regression with more variables

In [None]:
model2 <- lm(birthweight ~ smoker + alcohol + nprevist, data = df)
summary(model2)
confint(model2, "smoker", level = 0.95)

### Expanded regression with marital status

In [None]:
model3 <- lm(birthweight ~ smoker + alcohol + nprevist + unmarried, data = df)
summary(model3)
confint(model3, "smoker", level = 0.95)

## Step 5 - Additional regression with education

In [None]:
model4 <- lm(birthweight ~ smoker + alcohol + nprevist + educ + unmarried, data = df)
summary(model4)
confint(model4, "smoker", level = 0.95)

## Conclusions

- Smoking has a consistent negative effect on birthweight across all models
- Education shows minimal impact on birthweight
- Confidence intervals remain robust across model specifications