<h1>PPP2R1A</h1>

In [1]:
import genophenocorr

print(f"Using genophenocorr version {genophenocorr.__version__}")

Using genophenocorr version 0.1.1dev


## Setup

### Load HPO

We use HPO `v2023-10-09` release for this analysis.

In [2]:
import hpotk

fpath_hpo = 'https://github.com/obophenotype/human-phenotype-ontology/releases/download/v2023-10-09/hp.json'
hpo = hpotk.load_minimal_ontology(fpath_hpo)

### Load phenopackets

We'll load the phenopacket JSON files stored in the `phenopackets` folder next to the notebook.

In [3]:
from genophenocorr.preprocessing import configure_caching_cohort_creator, load_phenopacket_folder

fpath_phenopackets = 'phenopackets'
cohort_creator = configure_caching_cohort_creator(hpo)
cohort = load_phenopacket_folder(fpath_phenopackets, cohort_creator)

Patients Created: 100%|██████████| 60/60 [00:00<00:00, 341.78it/s]


### Pick a transcript

We will use [MANE Select](https://www.ncbi.nlm.nih.gov/nuccore/NM_014225.6) transcript for *PPP2R1A*.

In [4]:
tx_id = 'NM_014225.6'

## Configure the analysis


In [5]:
from genophenocorr.analysis import configure_cohort_analysis
from genophenocorr.analysis.predicate import BooleanPredicate

analysis = configure_cohort_analysis(cohort, hpo)

## Run analysis

Test for genotype-phenotype correlation between variants located in some protein region vs. the variants outside of the region.

In [6]:
from genophenocorr.model import FeatureType

by_region = analysis.compare_by_protein_feature_type(FeatureType.REGION, tx_id=tx_id)
by_region.summarize(hpo, BooleanPredicate.YES)

Variant that affects REGION protein feature type on NM_014225.6,No,No,Yes,Yes,Unnamed: 5_level_0,Unnamed: 6_level_0
Unnamed: 0_level_1,Count,Percent,Count,Percent,p value,Corrected p value
Hearing impairment [HP:0000365],1,3.571429,6,21.428571,0.25,1.0
Microcephaly [HP:0000252],0,0.000000,20,62.500000,1.00,1.0
Aplasia/Hypoplasia of the cerebrum [HP:0007364],0,0.000000,40,100.000000,1.00,1.0
Decreased head circumference [HP:0040195],0,0.000000,20,62.500000,1.00,1.0
Abnormal cerebral morphology [HP:0002060],0,0.000000,42,100.000000,1.00,1.0
...,...,...,...,...,...,...
Abnormal pulmonary valve physiology [HP:0031654],0,0.000000,1,100.000000,1.00,1.0
Abnormal heart valve physiology [HP:0031653],0,0.000000,1,100.000000,1.00,1.0
Abnormal cardiovascular system physiology [HP:0011025],0,0.000000,1,100.000000,1.00,1.0
Aggressive behavior [HP:0000718],0,0.000000,1,100.000000,1.00,1.0


TODO - finalize!