# Retinal Degeneration Associated With RPGRIP1


Data from [Beryozkin A, et al. Retinal Degeneration Associated With RPGRIP1: A Review of Natural History, Mutation Spectrum, and Genotype-Phenotype Correlation in 228 Patients](https://pubmed.ncbi.nlm.nih.gov/34722527)

In [1]:
import genophenocorr

print(f"Using genophenocorr version {genophenocorr.__version__}")

Using genophenocorr version 0.1.1dev


## Setup

### Load HPO

We use HPO `v2023-10-09` release for this analysis.

In [2]:
import hpotk

store = hpotk.configure_ontology_store()
hpo = store.load_minimal_hpo(release='v2023-10-09')
print(f'Loaded HPO v{hpo.version}')

Loaded HPO v2023-10-09


### Load Phenopackets

We will load phenopacket JSON files located in `phenopackets` folder that is next to the notebook.

In [3]:
from genophenocorr.preprocessing import configure_caching_cohort_creator, load_phenopacket_folder

fpath_phenopackets = 'phenopackets'
cohort_creator = configure_caching_cohort_creator(hpo)
cohort = load_phenopacket_folder(fpath_phenopackets, cohort_creator)

Patients Created: 100%|██████████| 229/229 [00:30<00:00,  7.63it/s]
Validated under none policy
229 phenopacket(s) found at `phenopackets`
  patient #0
    phenotype-features
     ·No diseases found.
  patient #1
    phenotype-features
     ·No diseases found.
  patient #2
    phenotype-features
     ·No diseases found.
  patient #3
    phenotype-features
     ·No diseases found.
  patient #4
    phenotype-features
     ·No diseases found.
  patient #5
    phenotype-features
     ·No diseases found.
  patient #6
    phenotype-features
     ·No diseases found.
  patient #7
    phenotype-features
     ·No diseases found.
  patient #8
    phenotype-features
     ·No diseases found.
  patient #9
    phenotype-features
     ·No diseases found.
  patient #10
    phenotype-features
     ·No diseases found.
  patient #11
    phenotype-features
     ·No diseases found.
  patient #12
    phenotype-features
     ·No diseases found.
  patient #13
    phenotype-features
     ·No diseases found.
  p

### Pick transcript

We choose the [MANE Select](https://www.ncbi.nlm.nih.gov/nuccore/NM_020366.4) transcript for *RPGRIP1*.

In [4]:
tx_id = 'NM_020366.4'

## Explore cohort

Explore the cohort to guide selection of the genotype-phenotype analysis.


In [12]:
from IPython.display import display, HTML
from genophenocorr.view import CohortViewable

cv = CohortViewable(hpo=hpo, transcript_id=tx_id)
html = cv.process(cohort=cohort)

display(HTML(html))

HPO Term,ID,Annotation Count
Very low visual acuity,HP:0032122,81
Eye poking,HP:0001483,28
Abnormal electroretinogram,HP:0000512,27
Moderately reduced visual acuity,HP:0030515,24
Moderate hypermetropia,HP:0031729,12
Global developmental delay,HP:0001263,11
Constriction of peripheral visual field,HP:0001133,10
High hypermetropia,HP:0008499,9
Severely reduced visual acuity,HP:0001141,6
High myopia,HP:0011003,4

Variant,Variant name,Variant Count
14_21312457_21312458_GA_G,todo,25
14_21325943_21325943_G_T,todo,12
14_21345145_21345145_C_T,todo,8
14_21302530_21302531_AG_A,todo,8
14_21345139_21345146_CAAGGCCG_C,todo,7
14_21325252_21325252_G_A,todo,7
14_21327671_21327671_A_AT,todo,6
14_21303542_21303542_C_T,todo,5
14_21325265_21325265_A_G,todo,5
14_21317724_21317724_C_T,todo,5

Disease,Annotation Count

Variant effect,Annotation Count
STOP_GAINED,81
FRAMESHIFT_VARIANT,95
SPLICE_DONOR_VARIANT,19
SPLICE_DONOR_5TH_BASE_VARIANT,5
CODING_SEQUENCE_VARIANT,7
INTRON_VARIANT,10
MISSENSE_VARIANT,50
SPLICE_REGION_VARIANT,20
SYNONYMOUS_VARIANT,1
SPLICE_ACCEPTOR_VARIANT,12


## Configure the analysis

In [13]:
from genophenocorr.analysis import configure_cohort_analysis

analysis = configure_cohort_analysis(cohort, hpo)

## Run the analyses

Test for presence of genotype-phenotype correlations between missense variants vs. others.

In [14]:
from genophenocorr.model import VariantEffect
from genophenocorr.analysis.predicate import PatientCategories

by_missense = analysis.compare_by_variant_effect(VariantEffect.MISSENSE_VARIANT, tx_id=tx_id)
by_missense.summarize(hpo, PatientCategories.YES)

MISSENSE_VARIANT on NM_020366.4,Yes,Yes,No,No,Unnamed: 5_level_0,Unnamed: 6_level_0
Unnamed: 0_level_1,Count,Percent,Count,Percent,p value,Corrected p value
Eye poking [HP:0001483],3/5,60%,25/34,74%,0.608819,1.0
Visual impairment [HP:0000505],24/24,100%,90/90,100%,1.0,1.0
Sensory seeking [HP:4000079],3/3,100%,25/25,100%,1.0,1.0
Restricted or repetitive behaviors or interests [HP:0031432],3/3,100%,25/25,100%,1.0,1.0
Abnormal movements of face and head [HP:5200017],3/3,100%,25/25,100%,1.0,1.0
Abnormality of the eye [HP:0000478],24/24,100%,95/95,100%,1.0,1.0
Abnormality of vision [HP:0000504],24/24,100%,90/90,100%,1.0,1.0
Self-injurious behavior [HP:0100716],3/3,100%,25/25,100%,1.0,1.0
Hypermetropia [HP:0000540],4/4,100%,20/20,100%,1.0,1.0
Myopia [HP:0000545],0/0,0%,4/4,100%,1.0,1.0


Test for presence of genotype-phenotype correlations between frameshift variants vs. others.

In [15]:
by_frameshift = analysis.compare_by_variant_effect(VariantEffect.FRAMESHIFT_VARIANT, tx_id=tx_id)
by_frameshift.summarize(hpo, PatientCategories.YES)

FRAMESHIFT_VARIANT on NM_020366.4,Yes,Yes,No,No,Unnamed: 5_level_0,Unnamed: 6_level_0
Unnamed: 0_level_1,Count,Percent,Count,Percent,p value,Corrected p value
Eye poking [HP:0001483],23/27,85%,5/12,42%,0.016983,0.628369
Visual impairment [HP:0000505],48/48,100%,66/66,100%,1.0,1.0
Sensory seeking [HP:4000079],23/23,100%,5/5,100%,1.0,1.0
Restricted or repetitive behaviors or interests [HP:0031432],23/23,100%,5/5,100%,1.0,1.0
Abnormal movements of face and head [HP:5200017],23/23,100%,5/5,100%,1.0,1.0
Abnormality of the eye [HP:0000478],52/52,100%,67/67,100%,1.0,1.0
Abnormality of vision [HP:0000504],48/48,100%,66/66,100%,1.0,1.0
Self-injurious behavior [HP:0100716],23/23,100%,5/5,100%,1.0,1.0
Hypermetropia [HP:0000540],12/12,100%,12/12,100%,1.0,1.0
Myopia [HP:0000545],4/4,100%,0/0,0%,1.0,1.0


Or between subjects with >=1 allele of a variant vs. the other subjects:

In [16]:
variant_key = '14_21312457_21312458_GA_G'

by_var = analysis.compare_by_variant_key(variant_key)
by_var.summarize(hpo, PatientCategories.YES)

>=1 allele of the variant 14_21312457_21312458_GA_G,Yes,Yes,No,No,Unnamed: 5_level_0,Unnamed: 6_level_0
Unnamed: 0_level_1,Count,Percent,Count,Percent,p value,Corrected p value
Eye poking [HP:0001483],16/16,100%,12/23,52%,0.000919,0.03401
Visual impairment [HP:0000505],17/17,100%,97/97,100%,1.0,1.0
Sensory seeking [HP:4000079],16/16,100%,12/12,100%,1.0,1.0
Restricted or repetitive behaviors or interests [HP:0031432],16/16,100%,12/12,100%,1.0,1.0
Abnormal movements of face and head [HP:5200017],16/16,100%,12/12,100%,1.0,1.0
Abnormality of the eye [HP:0000478],17/17,100%,102/102,100%,1.0,1.0
Abnormality of vision [HP:0000504],17/17,100%,97/97,100%,1.0,1.0
Self-injurious behavior [HP:0100716],16/16,100%,12/12,100%,1.0,1.0
Hypermetropia [HP:0000540],3/3,100%,21/21,100%,1.0,1.0
Myopia [HP:0000545],0/0,0%,4/4,100%,1.0,1.0


TODO - finalize!