<H1>MAPK8IP3 genotype phenotype correlations</H1>

In [1]:
import genophenocorr

print(f"Using genophenocorr version {genophenocorr.__version__}")

Using genophenocorr version 0.1.1dev


## Setup

### Load HPO

We'll need HPO for the analysis

In [2]:
import hpotk

store = hpotk.configure_ontology_store()
hpo = store.load_minimal_hpo(release='v2023-10-09')
print(f'Loaded HPO v{hpo.version}')

Loaded HPO v2023-10-09


### Load phenopackets

Load the phenopackets located in the `phenopackets` folder.

In [3]:
from genophenocorr.preprocessing import configure_caching_cohort_creator, load_phenopacket_folder

fpath_phenopackets = 'phenopackets'
cohort_creator = configure_caching_cohort_creator(hpo)
cohort = load_phenopacket_folder(fpath_phenopackets, cohort_creator)

Patients Created: 100%|██████████| 20/20 [00:00<00:00, 348.98it/s]
Validated under none policy
20 phenopacket(s) found at `phenopackets`
  patient #0
    phenotype-features
     ·No diseases found.
  patient #1
    phenotype-features
     ·No diseases found.
  patient #2
    phenotype-features
     ·No diseases found.
  patient #3
    phenotype-features
     ·No diseases found.
  patient #4
    phenotype-features
     errors:
     Terms should not contain both present Delayed ability to walk [HP:0031936] and its present or excluded ancestor Motor delay [HP:0001270]
     Terms should not contain both present Delayed ability to roll over [HP:0032989] and its present or excluded ancestor Motor delay [HP:0001270]
     ·No diseases found.
  patient #5
    phenotype-features
     ·No diseases found.
  patient #6
    phenotype-features
     errors:
     Terms should not contain both present Unsteady gait [HP:0002317] and its present or excluded ancestor Abnormality of movement [HP:0100022]
  

### Pick a transcript

We use the transcript `NM_001318852.2` which is the MANE transcript of the *MAPK8IP3* gene.

In [4]:
tx_id = 'NM_001318852.2'

## Explore cohort

Explore the cohort to guide selection of the genotype-phenotype analysis.

In [5]:
from IPython.display import display, HTML
from genophenocorr.view import CohortViewable

viewer = CohortViewable(hpo=hpo, transcript_id=tx_id)
html = viewer.process(cohort=cohort)
display(HTML(html))

HPO Term,ID,Annotation Count
Global developmental delay,HP:0001263,14
"Intellectual disability, moderate",HP:0002342,9
Hypotonia,HP:0001252,9
"Intellectual disability, severe",HP:0010864,7
Delayed ability to walk,HP:0031936,6
Spastic diplegia,HP:0001264,6
Motor delay,HP:0001270,5
Thin upper lip vermilion,HP:0000219,5
Thin corpus callosum,HP:0033725,5
Cerebral atrophy,HP:0002059,4

Variant,Variant name,Variant Count
16_1762843_1762843_C_T,todo,6
16_1767834_1767834_C_T,todo,4
16_1760409_1760409_T_C,todo,2
16_1766768_1766768_C_G,todo,1
16_1706402_1706403_CG_C,todo,1
16_1748705_1748705_G_A,todo,1
16_1706450_1706450_C_G,todo,1
16_1762388_1762388_G_A,todo,1
16_1706418_1706418_G_T,todo,1
1_2408780_2408780_C_T,todo,1

Disease,Annotation Count

Variant effect,Annotation Count
MISSENSE_VARIANT,15
STOP_GAINED,3
FRAMESHIFT_VARIANT,1


## Configure the analysis

In [6]:

from genophenocorr.analysis import configure_cohort_analysis

analysis = configure_cohort_analysis(cohort, hpo)


## Correlation analysis for c.1735C>T

`NM_001318852.2:c.1735C>T` is the most commonly encountered variant in our cohort. In the following code, we investigate whether this variant displays significant genotype-phenotype correlations.

For the purpose of the analysis, the variant is denoted by its key: `16_1762843_1762843_C_T`.

Let's run the analysis and summarize the results.

In [7]:
from genophenocorr.analysis.predicate import PatientCategories

variant_key = '16_1762843_1762843_C_T'

by_variant = analysis.compare_by_variant_key(variant_key=variant_key)
by_variant.summarize(hpo, PatientCategories.YES)

>=1 allele of the variant 16_1762843_1762843_C_T,Yes,Yes,No,No,Unnamed: 5_level_0,Unnamed: 6_level_0
Unnamed: 0_level_1,Count,Percent,Count,Percent,p value,Corrected p value
Loss of ambulation [HP:0002505],3/3,100%,0/7,0%,0.008333,1.0
Inability to walk [HP:0002540],3/3,100%,0/7,0%,0.008333,1.0
Gait disturbance [HP:0001288],3/3,100%,1/7,14%,0.033333,1.0
Abnormality of movement [HP:0100022],4/4,100%,3/9,33%,0.069930,1.0
Seizure [HP:0001250],2/2,100%,2/11,18%,0.076923,1.0
...,...,...,...,...,...,...
Abnormality of the orbital region [HP:0000315],0/0,0%,3/3,100%,1.000000,1.0
Ventriculomegaly [HP:0002119],0/0,0%,1/1,100%,1.000000,1.0
Abnormal brainstem morphology [HP:0002363],2/2,100%,0/0,0%,1.000000,1.0
Motor delay [HP:0001270],4/4,100%,2/2,100%,1.000000,1.0


TODO - finalize!