# Hormonal contraceptives effect on HPV infection 
## Final project in Practical Bayes 2025

**Synopsis:**
Human papillomavirus (HPV) is a common group of viruses. There are many HPV genotypes, but particular genotypes called "high-risk HPV" have been shown to cause cervical cancer.

HPV is transmitted sexually through skin-to-skin contact. The virus clears itself out in most cases thanks to the immune system. However, persistent infection with high-risk HPV genotypes can cause changes in cells (precancerous conditions) and, if untreated, may lead to cervical cancer. HPV causes also other types of cancers like penile, anal or throat cancer.

The national vaccination program against high-risk HPV started in 2018, and 12-14-year-old boys and girls are now routinely vaccinated in Estonia. However, the HPV vaccine is effective before the initiation of sexual activity and protects only against specific HPV genotypes. Therefore, HPV screening continues to be important in all age groups, but especially in unvaccinated cohorts.

In Estonia, women aged 30-65 are invited to cervical cancer screening programs. It is also known that many people don't attend screening because of different barriers. Therefore, to get a clearer picture of the high risk HPV prevalence and risk factors, we collected population-based data.

This dataset consists of cross-sectional data of 1000 Estonian women aged 30-70 years. Women were invited to a study via mail or e-mail. They filled in a questionnaire (paper or online) and also gave a self-sampling HPV test that was sent to a lab for analysis and genotyping.

Our research question currently is: **Do hormonal contraceptives increase the risk of high-risk HPV infection?**

**Variables:**
* **age**  - age at the time of study (years)
* **first_sex** - age at first intercourse (years)
* **nr_of_sex_partners** - nr of lifetime sexual partners
* **smoking** - current smoking status (1=smoker/0=non-smoker)
* **hormonal_contrcptvs** - ever used hormonal contraception (including birth control pills, hormonal iud, vaginal ring, hormonal implants, hormonal patches or shots) (1=yes/0=no)
* **hc_longest_period** - longest concecutive period of hormonal contraception usage (full years)
* **hrhpv_prevalence** - test result for high-risk HPV (1=positive/0=negative)

In [None]:
import pymc as pm
import arviz as az
import pandas as pd
import numpy as np

# Let's model!
df = pd.read_csv('hpv_prevalence.csv')