# TRAF7
Data taken from [Castilla-Vallmanya L, et al. (2020) Phenotypic spectrum and transcriptomic profile associated with germline variants in TRAF7. Genet Med. 22:1215-1226](https://pubmed.ncbi.nlm.nih.gov/32376980/).

In [1]:
import pandas as pd
pd.set_option('display.max_colwidth', None) # show entire column contents, important!
from IPython.display import display, HTML
from pyphetools.creation import *
from pyphetools.visualization import *
from pyphetools.validation import *
import pyphetools
print(f"Using pyphetools version {pyphetools.__version__}")

Using pyphetools version 0.9.71


In [2]:
PMID = "PMID:32376980"
title = "Phenotypic spectrum and transcriptomic profile associated with germline variants in TRAF7"
cite = Citation(pmid=PMID, title=title)
parser = HpoParser(hpo_json_file="../hp.json")
hpo_cr = parser.get_hpo_concept_recognizer()
hpo_version = parser.get_version()
hpo_ontology = parser.get_ontology()
metadata = MetaData(created_by="ORCID:0000-0002-5648-2155", citation=cite)
metadata.default_versions_with_hpo(version=hpo_version)
print(f"HPO version {hpo_version}")

HPO version 2024-03-06


In [3]:
df = pd.read_excel("input/TRAF7_S1.xlsx")

In [4]:
df.head(2)

Unnamed: 0,Patient,Year of birth,Gender,"Variant (cDNA, with reference to NM_032271.2)",Variant (protein). Recurrent variants are color-coded.,Inheritance,Pregnancy/Birth,Feeding difficulties,Growth,"Cognitive development, speech",...,"Limbs, extremities",Other skeletal,Cardiac,Hearing,"Eyes, vision",Urogenital,Other phenotypes,Previous molecular or metabolic investigations,Method used for TRAF7 variant identification,TRAF7 variant validated by Sanger sequencing
0,1,2010,F,c.981C>A,p.Asp327Glu,Unknown (mother was negative for the TRAF7 variant),Born at 39 wks,No,H and W = 90%ile,"Autism spectrum disorder, speech delay, learning difficulties",...,Normal,Normal,Normal,Normal,Exotropia,Renal US - normal,,"CMA showed a maternally-inherited 16p13.11 353kb deletion; this variant may be associated with neurodevelopmental disorders with incomplete penetrance. Metabolic testing normal. By ES, VUS: KIF4A, c.2927T>C, p.Val976Ala, heterozygous, inheritance unknown.",ES,Yes
1,2,2018,M,c.1089C>A,p.Asp363Glu,"Unknown (mother was negative for the TRAF7 variant, father unavailable for testing)","Born at 38 wks. Delivery was complicated by nuchal cord, patient required resuscitation and oxygen.",Yes - required NICU stay for feeding difficulty and to regulate medication. Dysphagia.,"At 7 mos, H = 10th %ile, W = 58th %ile, HC = 13th %ile","Global developmental delay; reportedly did not cry until 2 mos; at 8 mos he had a social smile, laughed, cooed, but did not babble",...,Bilateral clubfoot (appears to be familial - maternal grandfather and maternal aunt both with bilateral clubfoot),Normal,Normal,Normal,Normal,Normal,,"Comprehensive epilepsy gene panel: negative, heterozygous POLG c.3428A>G (p.Glu1143Gly) variant present - possible association with valproate-induced liver toxicity - not known to cause mitochondrial disorder; newborn screen (2018): abnormal, concerning for cystic fibrosis, subsequent sweat test normal; array CGH: normal male; exome sequencing/mitochondrial genome sequencing/deletion analysis: homoplasmic MT-ATP6 VUS: m.9133G>A (p.E203K), mother is also homoplasmic with no signs of mitochondrial disease; normal mito/metabolic labs",ES,Yes


In [5]:
generator = SimpleColumnMapperGenerator(df=df, observed="Yes", excluded="No", hpo_cr=hpo_cr)
column_mapper_list = generator.try_mapping_columns()
display(HTML(generator.to_html()))

Result,Columns
Mapped,Feeding difficulties; Epilepsy
Unmapped,"Patient; Year of birth ; Gender; Variant (cDNA, with reference to NM_032271.2); Variant (protein). Recurrent variants are color-coded.; Inheritance; Pregnancy/Birth; Growth; Cognitive development, speech; Motor development; Brain MRI; Dysmorphic craniofacial features; Oral cavity, teeth, pharynx, upper respiratory tract; Limbs, extremities; Other skeletal; Cardiac; Hearing; Eyes, vision; Urogenital; Other phenotypes; Previous molecular or metabolic investigations; Method used for TRAF7 variant identification; TRAF7 variant validated by Sanger sequencing"


In [6]:
#results = OptionColumnMapper.autoformat(df=df, 
 #                                       hpo_cr=hpo_cr, 
  #                                      omit_columns=generator.get_unmapped_columns())

In [7]:
pregnancy_birth_d = {
 'Born at 38 wks. Delivery was complicated by nuchal cord': 'Nuchal cord',
 'Polyhydramnios. Amniotic fluid leaking from 26/40 wks. Induced at 32/40 wks by dates': 'Polyhydramnios',
 'then ventilated for 48 hrs': 'Neonatal respiratory distress',
 'CPAP for further 12 days. In neonatal period': 'Neonatal respiratory distress',
 'hypotonia with very few spontaneous movements. Jaundiced for 4 wks.': 'Hypotonia',
 '9. Blue in face attributed to face up birth. Floppy. Admitted at 4 days of life for lethargy': 'Lethargy',
 'poor feeding': 'Feeding difficulties',
 'sepsis': 'Sepsis',
 'respiratory failure - intubated.': 'Respiratory failure',
 'Increased nuchal translucency': 'Increased nuchal translucency',
 'Born prematurely (29+5/7 wks)': 'Premature birth',
 'neonatal cerebral bleeding and subsequent hydrocephalus': 'Hydrocephalus',
 'Born prematurely (27 wks)': 'Premature birth',
 'neonatal feeding problems': 'Feeding difficulties',
 'vaginal delivery at 39 wks. Large anterior fontanelle. Apgar scores: 10': 'Wide anterior fontanel',
 'Suspected congenital heart defect on prenatal ultrasound; fetal echo done at 30+4/7 wks showed hypoplastic aortic arch': 'Hypoplastic aortic arch',
 'small left ventricle': 'Hypoplastic left heart',
 'mother 39 yrs and father 61 yrs at time of delivery. Decreased fetal movement compared to previous pregnancy. Full term NSVD. Breathing and feeding issues as newborn.': 'Decreased fetal movement',
 'Born at 35 wks by normal vaginal delivery. NICU for difficulty breathing': 'Dyspnea',
 'no suck or swallow reflex': 'Poor suck',
 'marked hypotonia.': 'Hypotonia',
 'natural conception. 18 wk ultrasound showed fetal ascites and dysplastic kidney. At birth': 'Fetal ascites',
 'respiratory distress': 'Neonatal respiratory distress',
 'in NICU for 2 mos (was intubated for 2 wks). Passed newborn metabolic and hearing screen.': 'Neonatal respiratory distress',
 'Intrauterine growth retardation. Early infancy: apneas and hypotonic.': 'Intrauterine growth retardation',
 'Single umbilical artery and short femur found at US scan. Term birth.': ['Short femur', "Single umbilical artery"],
 'Positive nuchal translucency screen': 'Increased nuchal translucency',
 'fetal hydronephrosis': 'Hydronephrosis',
 'Born at full term. NICU stay for 7 days for underdeveloped lung': 'Pulmonary hypoplasia',
 'patent ductus ateriosus': 'Patent ductus arteriosus',
 'failure to thrive': 'Failure to thrive',
 'jaundice': 'Jaundice',
 'Fetal US findings - right pelvic kidney': 'Pelvic kidney',
 'premature atrial contractions. Uncomplicated term delivery': 'Premature atrial contractions',}
pregnancy_birthMapper = OptionColumnMapper(column_name='Pregnancy/Birth',concept_recognizer=hpo_cr, option_d=pregnancy_birth_d)
column_mapper_list.append(pregnancy_birthMapper)
pregnancy_birthMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Nuchal cord (HP:0012498) (observed),1
1,Polyhydramnios (HP:0001561) (observed),1
2,Neonatal respiratory distress (HP:0002643) (observed),4
3,Hypotonia (HP:0001252) (observed),2
4,Lethargy (HP:0001254) (observed),1
5,Feeding difficulties (HP:0011968) (observed),2
6,Sepsis (HP:0100806) (observed),1
7,Respiratory failure (HP:0002878) (observed),1
8,Increased nuchal translucency (HP:0010880) (observed),4
9,Hydrocephalus (HP:0000238) (observed),1


In [8]:
feeding_d = {
 'Yes - required NICU stay for feeding difficulty and to regulate medication. Dysphagia.': ["Feeding difficulties",'Dysphagia'],
 'Poor feeding with reflux; all feeds by nasogastric tube from 6 wks. Thickened oral feeds from 6 mos.': 'Feeding difficulties',
 'Yes': 'Feeding difficulties',
 'needed nasogastric gavage feeds in newborn period for 5 days - received therapies. Eats normally at 12 mos.': 'Feeding difficulties',
 'particularly in infancy': 'Feeding difficulties',
 'Poor sucking at birth and failure to thrive': ['Poor suck', "Failure to thrive"],
 'GERD': 'Gastroesophageal reflux',
 'major in infancy': 'Feeding difficulties',
 'PEG at 1 yr': 'Feeding difficulties',
 'Yes early on; since improved': 'Feeding difficulties',
 'Swallowing study showing oral phase dysphagia': 'Dysphagia',
 'improved': 'Feeding difficulties',
 'gastrostomy tube': 'Feeding difficulties',
 'fundoplication': 'Feeding difficulties',
 'patulous esophagus': 'Feeding difficulties',
 'vomiting': 'Vomiting',
 'constipation. Feeding difficulties due to reduced coordination': ["Feeding difficulties",'Constipation'],
 'lower muscle tone and strength and oral aversion.': 'Oral aversion',
 'Neonatal feeding difficulties': 'Feeding difficulties',
 'Yes (neonatal)': 'Feeding difficulties',
 'Nasal regurgitation as a baby due to submucous cleft palate': 'Cleft palate',
 'milk allergy. Reflux managed with prevacid.': 'Cow milk allergy',
 'Gastrostomy tube at 4 wks': 'Feeding difficulties',
 'Yes (infancy); nasogastric tube feeding for 4 wks': 'Nasogastric tube feeding',
 'Yes was on gastrostomy tube feeds until age 6 yrs': 'Feeding difficulties',
 'severe oral inefficiency with liquids and solids': 'Feeding difficulties',
 'mild pharyngeal phase dysphagia with oral aversion': 'Oral aversion',
 'gastrostomy tube placement at 4 mos. Ongoing feeding issues at 4 yrs 4 mos but improved with treatment through intensive feeding program.': 'Feeding difficulties',
 'Failure to thrive': 'Failure to thrive',
 'tube feeding for three months': 'Tube feeding',
 'Swallowing difficulties': 'Dysphagia',
 'Yes early on': 'Feeding difficulties',
 'then improved. Later started to have difficulty swallowing certain textures.': 'Dysphagia',
 'decreased appetite & frequent gagging at 12 yrs': 'Poor appetite',
 'mild in neonatal period': 'Feeding difficulties',
 'Significant feeding difficulties; little desire to eat but will eat if fed; for the most part parents must feed him formula by syringe; tends to regurgitate food up to a few times per day; tried lansoprazole and cyproheptadine to little effect': 'Feeding difficulties',
 'Mild neonatal feeding difficulties': 'Feeding difficulties'}
excluded = {'No': 'Feeding difficulties'}
feedingMapper = OptionColumnMapper(column_name='Feeding difficulties',
                                                concept_recognizer=hpo_cr, option_d=feeding_d, excluded_d=excluded)
column_mapper_list.append(feedingMapper)
feedingMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Feeding difficulties (HP:0011968) (excluded),14
1,Feeding difficulties (HP:0011968) (observed),29
2,Dysphagia (HP:0002015) (observed),4
3,Poor suck (HP:0002033) (observed),1
4,Failure to thrive (HP:0001508) (observed),2
5,Gastroesophageal reflux (HP:0002020) (observed),3
6,Intestinal malrotation (HP:0002566) (observed),1
7,Vomiting (HP:0002013) (observed),1
8,Oral aversion (HP:0012523) (observed),2
9,Cleft palate (HP:0000175) (observed),1


In [9]:
growth_d = {
 'L = 48 cm. Failure to thrive. At 9 mos': 'Failure to thrive',
 'Short stature': 'Short stature',
 'OFC = 27 cm (each <10th %ile for gestational age). Failure to thrive. At 7 yrs': 'Failure to thrive',
 'short stature': 'Short stature',
 'low weight. Microcephaly.': ['Decreased body weight',"Microcephaly"],
 'Short: at age 8 yrs: 10 cm below the 2.5th %ile': 'Short stature',
 'H = 92 cm (-2.42 SD)': 'Short stature',
  'L = 77 cm (<<1%)': 'Short stature',
 'W = 9.56 kg (<3%)': 'Short stature',
 'HC = 35.5 cm. Poor growth; followed by Pediatric Endocrinology. At 5 yrs': 'Growth delay',
 'W = 14.5 kg (Z = -2.90)': 'Decreased body weight',
 'H = 98.6 cm (Z = -3.07).': 'Short stature',
  'L = 47 cm (SGA). Early growth: short stature and failure to thrive. At 10 yrs': 'Short stature',
 'OFC = 39.8 cm (<< 3rd %ile). At 2 yrs': 'Microcephaly',
 'W = 37 kg (-2.8 SD)': 'Decreased body weight',
 'W = 3175 g. Short stature post-natally.': 'Short stature',
 'HC = 36 cm. Postnatal onset growth deficiency. Linear growth tracking at -1.5 to -2 Z-scores since 2-3 mos of age.': 'Growth delay',
 'OFC = 33.7 cm. Relative macrocephaly.': 'Relative macrocephaly',
 'microcephaly': 'Microcephaly',
 'H = 142 cm (<3rd %ile)': 'Short stature',
 'W = 9.5 kg (<3rd %ile).': 'Decreased body weight',
 'Failure to thrive. At birth': 'Failure to thrive',
 'H = 97.5 cm (<3rd %ile)': 'Short stature',
 'W = 3.37 kg. Failure to thrive; at 26 mos': 'Failure to thrive',
 'W = 10 kg (Z = -2.5); at 34 mos': 'Decreased body weight',
 'OFC = 53 cm (‒2.5 SD).': 'Microcephaly'}
growthMapper = OptionColumnMapper(column_name='Growth',concept_recognizer=hpo_cr, option_d=growth_d)
column_mapper_list.append(growthMapper)
growthMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Failure to thrive (HP:0001508) (observed),4
1,Short stature (HP:0004322) (observed),14
2,Decreased body weight (HP:0004325) (observed),4
3,Macrocephaly (HP:0000256) (observed),1
4,Growth delay (HP:0001510) (observed),2
5,Microcephaly (HP:0000252) (observed),3
6,Relative macrocephaly (HP:0004482) (observed),1


In [10]:
cognitive_d = {'Autism spectrum disorder': 'Autistic behavior',
 'speech delay': 'Delayed speech and language development',
 'learning difficulties': 'Specific learning difficulty',
 'Global developmental delay; reportedly did not cry until 2 mos; at 8 mos he had a social smile': 'Global developmental delay',
 'Psychomotor delay': 'Global developmental delay',
 'Learning disability': 'Specific learning disability',
 'bipolar disorder': 'Bipolar affective disorder',
 'very few words. 1:1 help needed at school. Poor social skills.': 'Delayed speech and language development',
 'Moderate developmental delay. Communicates with simple speech and sign language. Major anxiety issues.': 'Global developmental delay',
 'Severe ID': 'Intellectual disability, severe',
 'no language': 'Absent speech',
 'Developmental delay. At 19 mos had 3 words. At 25 mos had 15-20 words': 'Global developmental delay',
 'could count to 3-4 and could follow two-step commands. At 4 yrs could put 3 words together and still followed two-step commands; counted to 2 then skipped numbers.':  'Global developmental delay',
 'Developmental delay. Severe ID (IQ 20-35). Delayed receptive and expressive speech; speaks few words at 7 yrs.': 'Global developmental delay',
 'Moderate ID. At 3 yrs': 'Intellectual disability, moderate',
 '10 words. At 8.5 years': 'Delayed speech and language development',
 'recognizing some sight words': 'Delayed speech and language development',
 'learning difficulties': 'Specific learning disability',
 'can answer some simple questions': 'Delayed speech and language development',
 'Autism': 'Autism',
 'global developmental delay': 'Global developmental delay',
 'single words at 2.5 yrs. Diagnosed with autism at 4 yrs. No significant behavioural problems.': 'Autism',
 'Developmental delay': 'Global developmental delay',
 'especially expressive language': 'Expressive language delay',
 'Speech delay: first began speaking (with babbling consonants) at 24 mos and used two words other than momma/dada': 'Delayed speech and language development',
 'poor articulation': 'Speech articulation difficulties',
 '5-10 signs. Receiving speech-language therapy.': 'Delayed speech and language development',
 'Mild-moderate ID (IQ 55). Language delay: first words at 3 yrs': 'Delayed speech and language development',
 'autism spectrum disorder': 'Autistic behavior',
 'ADHD combined type': 'Attention deficit hyperactivity disorder',
 'mixed receptive-expressive language disorder': 'Language impairment',
 'significant expressive language delay': 'Expressive language delay',
 'non-verbal': 'Absent speech',
 'Global developmental delay': 'Global developmental delay',
 'Learning difficulties in primary school': 'Specific learning disability',
 'Language delay. Difficulties drawing. Special needs at school': 'Delayed speech and language development',
 'At 20 mos had 4-5 words. At 31 mos had 20 words': 'Delayed speech and language development',
 'no phrases and would point. At 4 yrs 2 mos: delayed development': 'Global developmental delay',
 'can say colors and numbers and can communicate his needs': 'Delayed speech and language development',
 'Moderate difficulties at school (dyspraxia)': 'Speech apraxia',
 'Major language delay': 'Delayed speech and language development',
 'non verbal': 'Absent speech',
 'Autism and ID': 'Autism',
 'no spoken words': 'Absent speech',
 'but uses signs and an assisted communication device': 'Delayed speech and language development',
 'Mild to moderate degree of ID. Delayed speech and language skills. At 4 yrs he was signing 3- to 4-word sentences and demonstrated improved verbalizations. Attends a Deaf and Hard of Hearing Preschool Program. Provided with early intervention services beginning in infancy with occupational and physical therapy. Speech therapy initiated at 14-15 mos. No concerns with social engagement.': 'Delayed speech and language development',
 'talked at 3 yrs': 'Delayed speech and language development',
 'behavioral issues including acting out': 'Atypical behavior',
 'Developmental delay. Poor articulation but communicates by simple speech and some sign language. Behavioral issues including acting out.': 'Global developmental delay',
 'Moderate ID; most difficulties in speaking. No associated behavioral disorders.': 'Atypical behavior',
 'Developmental delays': 'Global developmental delay',
 'stranger anxiety': 'Anxiety',
 'does not wave bye. At 12 mos: puts objects in container and takes them out. At 15 mos: feeds self (spitting out a lot of food)': 'Global developmental delay',
 'Mild to moderate ID. Delayed speech; at 13 yrs': ["Intellectual disability",'Delayed speech and language development'],
 'dysarthria': 'Dysarthria',
 'at special school.': 'Global developmental delay',
 "Developmental delay. At 6 mos babbled; at 10 mos two syllable words ('mama'); at 20 mos follows commands": 'Global developmental delay',
 'laughs;\nat 2 yrs still only says two syllable words. He does not have aggression.': 'Delayed speech and language development',
 'Language delay. Drooling.': 'Delayed speech and language development',
 'developmental delays.': 'Delayed speech and language development',
 'Developmental delay. At 25 yrs': 'Global developmental delay',
 'IQ in the 60s. Ability to work at department store in processing and shipping. Able to engage in meaningful conversations.': 'Intellectual disability, mild',
 'Initially had good language development and was starting to use two-word phrases by 18-24 mos': 'Delayed speech and language development',
 'subsequent loss of language skills': 'Developmental regression',
 'eye contact and pointing contributed her diagnosis of autism spectrum disorder. At 12 yrs could speak in sentences but used language mainly to label and request and was very prompt-dependent': 'Autistic behavior',
 'self-injurious behaviors.': 'Self-injurious behavior',
 'Moderate ID. Had little speech at 4 yrs. Attended special school for education and works in a supported work environment.': 'Intellectual disability, moderate',
 'delayed speech': 'Delayed speech and language development',
 'ID. Calm': 'Intellectual disability',
 'Began babbling at 9 mos but no words at 26 mos; meets criteria for autism spectrum disorder': 'Autistic behavior',
 'IQ of 58 (mild ID). Delayed speech and language skills: first words at 37-38 mos. Attended special Preschool and School programs. Provided with early intervention services in infancy (physical and occupational therapy). Speech therapy initiated at 18-24 mos. No major social problems. At 10 yrs language skills were significantly better. Changes in behavior were observed at 13-14 yrs: fatigue': 'Intellectual disability, mild'}
cognitiveMapper = OptionColumnMapper(column_name='Cognitive development, speech',
                                                        concept_recognizer=hpo_cr, option_d=cognitive_d)
cognitiveMapper.preview_column(df)
column_mapper_list.append(cognitiveMapper)

In [11]:
motor_d = {
 'but unable to sit independently. He was able to push himself around on his back. He was just beginning to reach for objects with both hands. Diffuse hypotonia.': ["Motor delay", 'Hypotonia'],
 'Delayed motor skills': 'Delayed gross motor development',
 'Started walking at 3 yrs': 'Delayed ability to walk',
 'Hypotonia. Sat at 13 mos': 'Hypotonia',
 'walked at 30 mos.': 'Delayed ability to walk',
 'Sat at 12 mos': 'Delayed ability to sit',
 'crawled at 18 mos. Walking supported at 1 yr 6 mos and walking just after 2 yrs.': 'Delayed ability to walk',
 'Hypotonic at birth. Sat at 9 mos. Pulled to stand at 10 mos. Cruising at 11 mos. Still receives physical therapy twice per week at 12 mos.': 'Hypotonia',
 'Global delay. Walked at 2 yrs 2 mos.': 'Delayed ability to walk',
 'Global delay': 'Global developmental delay',
 'non-ambualant. Truncal hypotonia.': 'Axial hypotonia',
 'Muscle hypotonia. Delayed gross and fine motor development. Began to walk without support at 4.5 yrs.': ["Motor delay", 'Hypotonia'],
 'Weak voice': 'Weak voice',
 'delayed motor development the first 1-2 yrs': 'Motor delay',
 'Needed therapy in the past for help with fine motor skills such as self-feeding. At 14 yrs: gait: walks independently with forward flexion of the trunk at hips but can get more upright when prompted.': 'Motor delay',
 'Delayed ambulatory development. Hypotonia (decreased muscle tone in both upper and lower limbs).': 'Hypotonia',
 'Delayed motor skills.  Walked at 18 mos.': 'Delayed gross motor development',
 'Hypotonia. At 28 mos': 'Hypotonia',
 'Hypotonia': 'Hypotonia',
 'head lag at 6 mos': 'Persistent head lag',
 'walked at 20 mos': 'Delayed ability to walk',
 'Gross motor delay: sat at 14 mos': 'Delayed gross motor development',
 'cruise and pull to a stand at 24 mos but could only walk with assistance': 'Delayed ability to walk',
 'Gross and fine motor skill delay: sitting at 10 mos; at 13 mos': 'Motor delay',
 'no crawling': 'Delayed ability to crawl',
 'and difficulty with pincer grasp due to finger deformities. Walking at 2.5 yrs': 'Delayed ability to walk',
 'receiving OT and PT. Mild early hypotonia.': 'Hypotonia',
 'Delayed: independent walking at 2 yrs': 'Delayed ability to walk',
 'Hypotonia. Cerebral palsy': ["Hypotonia",'Cerebral palsy'],
 'Walked at 26 mos. Oral motricity problems. Synkinesis. Dyspraxia.': ["Delayed ability to walk",'Synkinesis'],
 'Walked at 22 mos. Synkinesis.': 'Synkinesis',
 'climb stairs and kick a ball. At 4 yrs 2 mos: delayed development': 'Global developmental delay',
 'walked at 4 yrs': 'Delayed ability to walk',
 'has trouble with balance': 'Gait imbalance',
 'does not run unassisted': 'Difficulty running',
 'uses walker for running': 'Difficulty running',
 'Global delays. Walked at 22-23 mos.': 'Global developmental delay',
 'Hypertonia. Motor delay. Sat at 16 mos': 'Hypertonia',
 'stood with help at 24 mos. Walked unsupported at 4 yrs. Oral motricity problems.': 'Delayed ability to walk',
 'Hypotonia. At 9 mos: pincer grasp': 'Hypotonia',
 'Muscular hypotonia. Walking at 11 mos.': 'Hypotonia',
 'rolling over; 12 mos: sat alone; 20 mos: attempting to pull to stand. At 2 yrs still unable to crawl or walk on his own.': 'Delayed ability to walk',
 'Walked at 28 mos': 'Delayed ability to walk',
 'fine motor development in the 2-3.5 mos range and gross motor skills in the 2-3 mos range. At 8 mos': 'Delayed fine motor development',
 'Developmental delay - borderline': 'Global developmental delay',
 'Delayed motor development': 'Motor delay',
 'walked at 19 months. At 12 yrs gross motor skills were at the level of a 4 yo and fine motor skills at the level of a 3 yo.': 'Delayed gross motor development',
 'Hypotonic in infancy': 'Hypotonia',
 'Mildly delayed gross motor milestones; more significant delays in fine motor milestones (pincer grasp at 16 mos)': 'Motor delay',
 'Sitting at 8-9 mos. Independent walking at 18 mos. Muscular hypotonia.': 'Hypotonia'}
excluded = {'Normal': 'Motor delay'}
motorMapper = OptionColumnMapper(column_name='Motor development',concept_recognizer=hpo_cr, option_d=motor_d, excluded_d=excluded)
column_mapper_list.append(motorMapper)
motorMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Motor delay (HP:0001270) (excluded),1
1,Motor delay (HP:0001270) (observed),6
2,Delayed gross motor development (HP:0002194) (observed),4
3,Delayed ability to walk (HP:0031936) (observed),12
4,Hypotonia (HP:0001252) (observed),12
5,Delayed ability to sit (HP:0025336) (observed),1
6,Global developmental delay (HP:0001263) (observed),4
7,Axial hypotonia (HP:0008936) (observed),1
8,Weak voice (HP:0001621) (observed),1
9,Persistent head lag (HP:0032988) (observed),1


In [12]:
epilepsy_d = {'Staring episodes': 'Generalized non-motor (absence) seizure',
 'Multifocal seizures with onset on first day postnatally': 'Multifocal seizures',
 'treated with phenobarbital. Breakthrough seizures occurred at 2 mos and were treated with higher doses of phenobarbital.  Developed infantile spasms (with hypsarrhythmia)  at 3 mos': 'Hypsarrhythmia',

 'Seizures since the age of 10 yrs. Mainly grand mal seizures': 'Bilateral tonic-clonic seizure',
 'rarely petit mal episodes. Now well controlled with Carbamazepine and Gabapentin.': 'Generalized non-motor (absence) seizure',
 'Convulsions at 7 mos.  Reflux anoxic seizures at 10 yrs.': 'Seizure',
 'West syndrome/infantile spasms': 'Epileptic spasm',
 'Nocturnal myoclonus; normal EEG': 'Myoclonus',
 'Seizures': 'Seizure',
 'Yes': 'Seizure',
 'from birth to 18 mos and was treated with phenobarbital until age 4 yrs': 'Seizure',
 'no seizures since 18 mos of age': 'Seizure',
 'Yes (last convulsion at 10 yrs)': 'Seizure',
}
excluded = { 'no seizures': 'Seizure',  'No': 'Seizure',}
epilepsyMapper = OptionColumnMapper(column_name='Epilepsy',concept_recognizer=hpo_cr, option_d=epilepsy_d, excluded_d=excluded)
column_mapper_list.append(epilepsyMapper)
epilepsyMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Generalized non-motor (absence) seizure (HP:0002121) (observed),2
1,Seizure (HP:0001250) (observed),8
2,Multifocal seizures (HP:0031165) (observed),1
3,Hypsarrhythmia (HP:0002521) (observed),1
4,Seizure (HP:0001250) (excluded),33
5,Bilateral tonic-clonic seizure (HP:0002069) (observed),1
6,Epileptic spasm (HP:0011097) (observed),1
7,Infantile spasms (HP:0012469) (observed),1
8,Myoclonus (HP:0001336) (observed),1
9,Encephalopathy (HP:0001298) (observed),1


In [13]:
brain_mri_d = {
 'At 1 mo: thin corpus callosum': 'Thin corpus callosum',
 'ventriculomegaly': 'Ventriculomegaly',
 'periventricular leukomalacia': 'Periventricular leukomalacia',
 'myelinization delayed': 'Delayed CNS myelination',
 'Periventricular leukomalacia': 'Periventricular leukomalacia',
 'Somewhat thin corpus callosum': 'Thin corpus callosum',
 'bilateral subdural hematomas': 'Subdural hemorrhage',
 'suggestive of subacute to early chronic in age. Also evidence of chronic subarachnoid hemorrhage overlying the left cerebral hemisphere surface. Brain parenchyma within normal limits for age.': 'Subarachnoid hemorrhage',
 'Cerebellar vermis hypoplasia': 'Cerebellar vermis hypoplasia',
 'Choroid plexus cyst on neonatal ultrasound': 'Choroid plexus cyst',
 'small focus of FLAIR hyperintensity in the right frontal lobe is nonspecific but unchanged compared to 2012. Age-advanced cerebral volume loss': 'Reduced cerebral white matter volume',
 'with ventricles and sulci enlarged for age': 'Ventriculomegaly',
 'Mildly dilated lateral and 3rd ventricles with abnormality in the positioning of the hippocampi bilaterally. Brain MRS: normal.': 'Ventriculomegaly',
 'Chiari malformation': 'Chiari malformation',
 'congenital obstructive hydrocephalus': 'Hydrocephalus',
 'Ventriculomegaly involving lateral and third ventricles.  Asymmetric decreased caliber of the cerebral aqueduct at the level of inferior tectal plate.  S/p ventriculoperitoneal shunt.': 'Ventriculomegaly',
 'Bicoronal craniosynostosis': 'Craniosynostosis',
 'Choroid cysts': 'Choroid plexus cyst',
 'prominence of lateral ventricles': 'Lateral ventricle dilatation',
 'small arachnoid cyst in left temporal fossa': 'Arachnoid cyst',
 'mild cerebellar atrophy': 'Cerebellar atrophy',
 'mild ventricular dilation': 'Ventriculomegaly',
 'peri- and juxtaventricular white matter hyperintensities': 'Periventricular white matter hyperintensities',
 'dilated Virchow Robin spaces': 'Dilation of Virchow-Robin spaces',
 'mild vermis atrophy': 'Cerebellar vermis atrophy',
 'Prominence of the ventricular system': 'Ventriculomegaly',
 'Mildly enlarged ventricles': 'Ventriculomegaly',
 'Chiari 1 malformation (cerebellar tonsils 7.2 mm below foramen magnum) with hydrocephalus. Bilateral middle ear and mastoid effusions. Underwent repeat MRI status post right frontal burr hole for third ventriculostomy': ['Chiari type I malformation','Hydrocephalus'],
 'At 5 mos: simplified gyral pattern': 'Simplified gyral pattern',
 'globally diminished white matter': 'Reduced cerebral white matter volume',
 'delayed myelination': 'Delayed CNS myelination',
 'mild ventriculomegaly of lateral ventricles': 'Lateral ventricle dilatation',
 'mild basilar invagination': 'Basilar invagination',
 'Severe congenital hydrocephalus': 'Hydrocephalus',
 'Chiari type I malformation': 'Chiari type I malformation',}
brain_mriMapper = OptionColumnMapper(column_name='Brain MRI',concept_recognizer=hpo_cr, option_d=brain_mri_d)
column_mapper_list.append(brain_mriMapper)
brain_mriMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Thin corpus callosum (HP:0033725) (observed),2
1,Ventriculomegaly (HP:0002119) (observed),8
2,Periventricular leukomalacia (HP:0006970) (observed),3
3,Delayed CNS myelination (HP:0002188) (observed),2
4,Subdural hemorrhage (HP:0100309) (observed),1
5,Subarachnoid hemorrhage (HP:0002138) (observed),1
6,Cerebellar vermis hypoplasia (HP:0001320) (observed),1
7,Choroid plexus cyst (HP:0002190) (observed),2
8,Reduced cerebral white matter volume (HP:0034295) (observed),2
9,Chiari malformation (HP:0002308) (observed),1


In [14]:
dysmorphic_d = {
 'Stork bites on back of the neck and forehead': 'Nevus flammeus nuchae',
 'Slight trigonocephaly': 'Trigonocephaly',
 'plagiocephaly': 'Plagiocephaly',
 'short palpebral fissures': 'Short palpebral fissure',
 'right ptosis': 'Ptosis',
 'short nose': 'Short nose',
 'long philtrum': 'Long philtrum',
 'thin upper lip': 'Thin upper lip vermilion',
 'low set ears': 'Low-set ears',
 'Synophrys': 'Synophrys',
 'round face': 'Round face',
 'prominent nasal bridge': 'Prominent nasal bridge',
 'smooth philtrum': 'Smooth philtrum',
 'hypoplastic ala nasi': 'Underdeveloped nasal alae',
 'overbite': 'Increased overbite',
 'Blepharophimosis': 'Blepharophimosis',
 'epicanthus': 'Epicanthus',
 'telecanthus': 'Telecanthus',
 'bulbous nasal tip': 'Bulbous nose',
 'overfolded helices': 'Overfolded helix',
 #'lateral flaring of eyebrows': 'PLACEHOLDER',
 'Neonatal: large anterior fontanelle': 'Wide anterior fontanel',
 'slightly low set ears': 'Low-set ears',
 'arched eyebrows': 'Highly arched eyebrow',
 'puffy eyes': 'Periorbital fullness',
 'epicanthic folds. At 10 mos: prominent forehead': 'Prominent forehead',
 'brachycephaly': 'Brachycephaly',
 'blepharophimosis': 'Blepharophimosis',
 'bilateral ptosis': 'Bilateral ptosis',
 'epicanthic folds': 'Epicanthus',
 'short upturned nose': 'Anteverted nares',
 'carp shaped mouth': 'U-Shaped upper lip vermilion',
 'small jaw.': 'Micrognathia',
 'Upslanted and short palpebral fissures. Probable telecanthus. Wide nasal bridge and tip. Lowset': 'Short palpebral fissure',
 'posteriorly rotated ears. Mild torticollis in newborn period.': 'Torticollis',
 'Ptosis': 'Ptosis',
 'epicanthal folds': 'Epicanthus',
 'upturned nose': 'Anteverted nares',
 'tented upper lip': 'Tented upper lip vermilion',
 'Hypertelorism': 'Hypertelorism',
 'Exam at 19 mos: bilateral epicanthal folds': 'Epicanthus',
 'borderline low-set ears': 'Low-set ears',
 'posteriorly rotated pinnae.  Exam at 4 yrs: brachycephalic': 'Posteriorly rotated ears',
 'posteriorly rotated ears.': 'Posteriorly rotated ears',
 'Dolichocephaly': 'Dolichocephaly',
 'expressionless face': 'Mask-like facies',
 'narrow forehead': 'Narrow forehead',
 'retrognathia': 'Retrognathia',
 'narrow palpebral fissures': 'Narrow palpebral fissure',
 'hypertelorism': 'Hypertelorism',
 'blepharoptosis. Low-set': 'Ptosis',
 'protruding ears with bilateral post-auricular pits.': 'Protruding ear',
 'Palpebral fissures: short': 'Short palpebral fissure',
 'ptosis (upslanting with right epicanthal fold in neonatal period). Wide and low nasal bridge. Red rash on both cheeks. Decreased facial expression. Trigonocephaly with bifrontal narrowing (neonatal: third fontanelle). Hypoplastic supraorbital ridges. Short anteverted nares. Downturned corners of mouth.': 'Trigonocephaly',
 'frontal bossing': 'Frontal bossing',
 'low-set ears with protruding helices': 'Low-set ears',
 "Widow's peak": "Widow's peak",
 'tall and broad forehead': 'Broad forehead',
 'flat nasal bridge': 'Depressed nasal bridge',
 'anteverted nares': 'Anteverted nares',
 'low set and posteriorly rotated ears': 'Posteriorly rotated ears',
 'pointed chin': 'Pointed chin',
 'ptosis/blepharophimosis': 'Ptosis',
 #'abnormal eyebrows with peak in the middle': 'PLACEHOLDER',
 #'protruding lower jaw and lip': 'PLACEHOLDER',
 'Trigonocephaly': 'Trigonocephaly',
 'ocular hypertelorism': 'Hypertelorism',
 'down-sloping palpebral fissures': 'Downslanted palpebral fissures',
 'mandibular hypoplasia. Simple': 'Micrognathia',
 'low-set and posteriorly rotated ears. Two hair whorls.': 'Posteriorly rotated ears',
 'Macrocephaly and dolichocephaly': 'Dolichocephaly',
 'delayed closure of anterior fontanelle': 'Delayed closure of the anterior fontanelle',
 'depressed nasal bridge': 'Depressed nasal bridge',
 'large forehead with prominent metopic ridge': 'Prominent metopic ridge',
 'shallow orbits': 'Shallow orbits',
 'small palpebral fissures': 'Narrow palpebral fissure',
 'ptosis': 'Ptosis',
 'small jaw': 'Micrognathia',
 'Blephariphimosis': 'Blepharophimosis',
 'broad forehead': 'Broad forehead',
 'thin lips': 'Thin vermilion border',
 'Sparse eyebrows': 'Sparse eyebrow',
 'narrow pointed nose': 'Narrow nose',
 'flat philtrum': 'Smooth philtrum',
 'Prominent metopic suture': 'Prominent metopic ridge',
 'malar flattening': 'Malar flattening',
 'relative hypertelorism by comparison to head size with normal interpupillary distance for age': 'Hypertelorism',
 'high-arched eyebrows': 'Highly arched eyebrow',
 'protruding low set ears with cupped earlobes': 'Low-set ears',
 'prominent nasal bridge with long nose': 'Long nose',
 'Bicoronal craniosynostosis requiring cranial vault reconstruction surgeries (turricephalic and brachycephalic)': 'Craniosynostosis',
 'prominent brow': 'Prominent supraorbital ridges',
 'hyperteloric': 'Hypertelorism',
 'downslanting papebral fissures': 'Downslanted palpebral fissures',
 'wide nasal bridge. External drainage of intracranial vessels.': 'Wide nasal bridge',
 'obstructed left tearduct': 'Nasolacrimal duct obstruction',
 'flat occiput': 'Flat occiput',
 'slight trigonocephaly': 'Trigonocephaly',
 #'impressions in the skull above the lateral eyebrows': 'PLACEHOLDER',
 'broad nasal septum': 'Thick nasal septum',
 'Macrocephaly': 'Macrocephaly',
 'short and down slanting palpebral fissures': 'Downslanted palpebral fissures',
 'widely-spaced eyes': 'Hypertelorism',
 'protuberant and low set ears': 'Low-set ears',
 'simplified helices': 'Simple ear',
 'narrow nasal bridge': 'Narrow nasal bridge',
 'mild micrognathia': 'Micrognathia',
 'prominent nasal bridge and glabella': 'Prominent nasal bridge',
# 'ridging of sutures': 'PLACEHOLDER',
 #'small cleft between tragus and anti-tragus': 'PLACEHOLDER',
 'prominent forehead': 'Prominent forehead',
 'blephariphimosis': 'Blepharophimosis',
 'low-set posteriorly rotated ears with abnormal helices': 'Abnormal helix morphology',
 'microretrognathia. Mild sagittal synostosis.': 'Microretrognathia',
 'Narrow upslanting palpebral fissures': 'Upslanted palpebral fissure',
 'micrognathia': 'Micrognathia',
 'bilateral epicanthal folds': 'Epicanthus',
 'posteriorly rotated ears': 'Posteriorly rotated ears',
 'Dolicocephaly with bitemporal narrowing. Prominence of midline forehead. Midface retraction with hypoplastic maxilla and prognathia': 'Mandibular prognathia',
 'upturned nose with long philtrum. Macroglossia. Small deep-set eyes with upslanting narrow palpebral fissures. Hypertelorism. Bilateral residual ptosis even after surgical repair. Long ears.': 'Long philtrum',
 'bilateral epicanthal folds with telecanthus': 'Epicanthus',
 'mild ptosis': 'Ptosis',
 #'bilateral prominence of the parietal areas': 'PLACEHOLDER',
 'broad and depressed nasal bridge': 'Depressed nasal bridge',
 'hypoplastic orbital ridges': 'Underdeveloped supraorbital ridges',
 'upslanting palpebral fissures': 'Upslanted palpebral fissure',
 'prominent eyes': 'Proptosis',
 'low-set ears': 'Low-set ears',
 'bulbous nasal tip. Dysmorphisms were more prominent in early childhood.': 'Bulbous nose',
 'Congenital bilateral ptosis and blepharophimosis': 'Congenital bilateral ptosis',
 'bitemporal narrowing': 'Narrow forehead',
 'prominent ears': 'Protruding ear',
 'small mouth': 'Narrow mouth',
 'broad nasal tip': 'Broad nasal tip',
 'microretrognathia': 'Microretrognathia',
 'downslanting palpebral fissures': 'Downslanted palpebral fissures',
 'hypoplastic nasal alae': 'Underdeveloped nasal alae',
 'broad nasal root': 'Wide nasal bridge',
 'Triangular skull shape': 'Trigonocephaly',
 'several hair whorls': 'Abnormal hair whorl',
 'prominent veins': 'Prominent superficial veins',
 'sparse eyebrows and lashes': 'Sparse eyebrow',
 'small nasal bridge with prominent nasal tip': 'Prominent nasal tip',
 'Telecanthus (22 mm from medial to lateral canthus of each eye; 32 mm between medial canthi)': 'Telecanthus',
 'high forehead': 'High forehead',
 'cupped and low set ears with anteverted lobes': 'Low-set ears',
 'Bilateral ptosis (sling surgery). Interrupted eyebrows. Sparse eyelashes laterally. Palpebral fissures small and not well formed medially (telecanthus). Broad nasal bridge and tip. Epicanthic folds. Low-set': 'Epicanthus',
 'posteriorly rotated ears. Relatively thin lips. Mildly downturned corners of mouth. Small anterior fontanelle at 5 mos. Prominent forehead.': 'Downturned corners of mouth',
 'Prominent nose': 'Prominent nose',
 'asymmetric facies': 'Facial asymmetry',
 'small philtrum': 'Hypoplastic philtrum',
 'mild hypertelorism': 'Hypertelorism',
 'ptosis (surgery)': 'Ptosis',
 'Somewhat prominent forehead': 'Prominent forehead',
 'wide nasal bridge': 'Wide nasal bridge',
 #'visible vein across nasal bridge': 'PLACEHOLDER',
 'prominent nasal tip': 'Prominent nasal tip',
 'mild bilateral ptosis': 'Bilateral ptosis',
 'distichiasis. Features noted in infancy: small chin': 'Short chin',
 'low set and posteriorly-rotated ears.': 'Posteriorly rotated ears',
 'down slanting palpebral fissures': 'Downslanted palpebral fissures',
 'facial asymmetry': 'Facial asymmetry',
 'anteverted ears': 'Anteverted ears',
 'left pre-auricular ear tag': 'Preauricular skin tag',
 'synophrys': 'Synophrys',
 'bi-temporal narrowing': 'Narrow forehead',
 'Metopic ridge': 'Prominent metopic ridge',
 'craniosynostosis (lambdoid and sagittal)': 'Craniosynostosis',
 'high arched eyebrows': 'Highly arched eyebrow',
 'low posterior hairline': 'Low posterior hairline',
 'low columella': 'Low hanging columella',
 'Brachycephaly': 'Brachycephaly'}
dysmorphicMapper = OptionColumnMapper(column_name='Dysmorphic craniofacial features',
                                                            concept_recognizer=hpo_cr, option_d=dysmorphic_d)
column_mapper_list.append(dysmorphicMapper)
dysmorphicMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Nevus flammeus nuchae (HP:0007616) (observed),1
1,Trigonocephaly (HP:0000243) (observed),8
2,Plagiocephaly (HP:0001357) (observed),3
3,Short palpebral fissure (HP:0012745) (observed),7
4,Ptosis (HP:0000508) (observed),11
...,...,...
79,Short chin (HP:0000331) (observed),1
80,Anteverted ears (HP:0040080) (observed),1
81,Preauricular skin tag (HP:0000384) (observed),1
82,Low posterior hairline (HP:0002162) (observed),1


In [15]:
oral_cavity_d = {
 'Widely spaced teeth on lower jaw': 'Widely spaced teeth',
 'Agenesis of premolar teeth. Nasal speech.': 'Agenesis of premolar',
 'Submucous cleft palate (operated). Left vocal cord paresis.': 'Cleft palate',
 'High-arched palate': 'High palate',
 'widely-spaced teeth. Velopharyngeal insufficiency until 2.5 yrs.': 'Widely spaced teeth',
 'associated with premature birth': 'Premature birth',
 'Small teeth': 'Microdontia',
 'Hypoplastic dental enamel and peg-shaped canines': 'Conical tooth',
 'Small conical tooth with supernumerary teeth in the primary dentition.': 'Conical tooth',
 'Glottis surgery and supraglottoplasty for tracheomalacia': 'Tracheomalacia',
 'hypernasality': 'Hypernasal speech',
 'Submucous cleft palate': 'Submucous cleft hard palate',
 'Cleft palate (operated)': 'Cleft palate',
 'Velopharyngeal insufficiency': 'Velopharyngeal insufficiency',
 'Delayed tooth eruption - first tooth at 14 mos': 'Delayed eruption of teeth',
 'Nasal speech due to immobility of the palate': 'Hypernasal speech',
 'velopharyngeal insufficiency': 'Velopharyngeal insufficiency',
 'Buccal-alveolar frenula': 'Accessory oral frenulum',
 'Malpositioned teeth and diastema between upper incisors': 'Tooth malposition',
 'Velopharyngeal insufficiecy': 'Velopharyngeal insufficiency',
 'nasal voice': 'Hypernasal speech',
 'History of narrow palate (s/p palate expander)': 'Narrow palate',
 'Choanal stenosis': 'Choanal stenosis',
 'Velopharyngeal insufficiency (surgery)': 'Velopharyngeal insufficiency'}
oral_cavityMapper = OptionColumnMapper(column_name='Oral cavity, teeth, pharynx, upper respiratory tract',
                                                                             concept_recognizer=hpo_cr, option_d=oral_cavity_d)
column_mapper_list.append(oral_cavityMapper)
oral_cavityMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Widely spaced teeth (HP:0000687) (observed),2
1,Agenesis of premolar (HP:0011051) (observed),1
2,Cleft palate (HP:0000175) (observed),2
3,High palate (HP:0000218) (observed),4
4,Premature birth (HP:0001622) (observed),1
5,Microdontia (HP:0000691) (observed),2
6,Conical tooth (HP:0000698) (observed),2
7,Tracheomalacia (HP:0002779) (observed),1
8,Hypernasal speech (HP:0001611) (observed),3
9,Submucous cleft hard palate (HP:0000176) (observed),2


In [16]:
limbs_d = {
 'Bilateral clubfoot (appears to be familial - maternal grandfather and maternal aunt both with bilateral clubfoot)': 'Bilateral talipes equinovarus',
 'Congenital hip dysplasia requiring hip replacement bilaterally': 'Hip dysplasia',
 'Clinodactyly Vth fingers': 'Clinodactyly',
 'subluxation of patella': 'Patellar subluxation',
 'Wide sandal gap toes I-II and long toe I. Marked palmar creases with wrinkled skin. Lateral deviation of digit II. Digit V clinodactyly. Prominent proximal interphalangeal joints. Overlapping toes.': 'Clinodactyly',
 'Bilateral mild talipes equinovarus': 'Talipes equinovarus',
 'Spastic extremities': 'Appendicular spasticity',
 'Toe II overlaps III': 'Overlapping toe',
 'flat feet': 'Pes planus',
 #'Toeing-out gait': 'PLACEHOLDER',
 'very low tone': 'Hypotonia',
 'hyperextension at knees': 'Genu recurvatum',
 'hyperlax joints': 'Joint hypermobility',
 #'tendency to roll into planovalgus position of ankles': 'PLACEHOLDER',
 'Large hands. Syndactyly': 'Syndactyly',
 'most prominent between fingers II and III and between III and IV. Has severe out-toeing bilaterally (right greater than left) which comes from the hip retroversion and severe pes planus with tight heelcords.': 'Pes planus',
 'Camptodactyly of all digits with limited extension of the 1st phalangeal joint on all digits': 'Camptodactyly',
 'but more prominent on II and III bilaterally. Limitation of elbow extension bilaterally. Congenital bilateral radial head dislocations. Lower limb X-rays: mildly abnormal epiphyses.': 'Limited elbow extension',
 'Restriction of joint motion and flexion deformity in upper limbs and knee. Flexion deformity with radial deviation of the fingers. Partial cutenous II-III syndactyly of both hands. Bilateral clinodactyly of finger V. Bilateral brachyphalangy of finger II; confirmed hypoplasia of the middle phalanx by X-ray. Wide sandal gap I-II toes. Metatarsus adductus.': 'Syndactyly',
 'Adducted thumbs': 'Adducted thumb',
 'deep palmar creases': 'Deep palmar crease',
 'Leg length discrepancy (R > L)': 'Lower limb asymmetry',
 'long fingers': 'Long fingers',
 'webbing between digits II-IV': 'Syndactyly',
 'long toes I': 'Long toe',
 #'deeply set toenails': 'PLACEHOLDER',
 'Bilateral radial clinodactyly of digit II of the hands': 'Clinodactyly of the 2nd finger',
 'middle phalanges of digits II-V delta shaped': 'Triangular shaped phalanges of the hand',
 'mild bowing of proximal ulnas': 'Ulnar bowing',
 'mild bowing and shortening of tibias': 'Tibial bowing',
 #'patellar dimples': 'PLACEHOLDER',
 'bilateral pes planus': 'Pes planus',
 'syndactyly of fingers. Bone age delayed by 2 yrs at 6.5 yrs chronological.': 'Syndactyly',
 'Upper: bilateral shortened index fingers with delta phalanges and ulnar deviation': 'Triangular shaped phalanges of the hand',
 'requiring surgical correction. Hypoplastic proximal phalanx of the thumb and middle phalanx of the middle fingers. Bilateral camptodactyly  and brachydactyly of middle fingers. Altered palmar creases and broad nails on thumb. \nLower: bilateral pes planus': 'Bilateral camptodactyly',
 'fat pads on medial surface of the soles of the feet': 'Prominent subcalcaneal fat pad',
 'and hallux valgus deformities of the toes. Curly right third toe': 'Hallux valgus',
 'mild left third toe curl. II-III syndactyly on right foot': 'Syndactyly',
 'Hypermobile somewhat broad thumbs': 'Broad thumb',
 #'interphalangeal contractures': 'PLACEHOLDER',
 'IInd toes overriding IIIrd': 'Overlapping toe',
 #'uplifted nails of halluces': 'PLACEHOLDER',
 'pedes planovalgi': 'Pes planus',
 'Bilateral mild camptodactyly of fingers II-V': 'Camptodactyly',
 'persistent fetal fingertip pads': 'Prominent fingertip pads',
 'Bilateral hamstring contractures': 'Hamstring contractures',
 'adductor and ankle spasticity': 'Spasticity',
 'metatarsus adductus': 'Metatarsus adductus',
 'Progressive brachymetatarsy of IVth toes': 'Short fourth metatarsal',
 'brachydactyly of fingers': 'Brachydactyly',
 'unilateral camptodactyly of Vth finger': 'Camptodactyly',
 'Metatarsus varus': 'Metatarsus adductus',
 'Long and slender fingers. Clinodactyly.': 'Clinodactyly',
 'Camptodactyly of toes II and fingers II and V with limited joint motion. Sandal gap. Pes planus.': 'Sandal gap',
 'Very flat and narrow feet. Feet rotated laterally. Long first toes.': 'Pes planus',
 'Some laxity to the knees and a skew foot appearance. Hockey stick palmar crease pattern.': 'Skewfoot',
 'camptodactyly': 'Camptodactyly',
 'pes valgus': 'Pes valgus',
 'overlapping toes': 'Overlapping toe',
 'Digit III brachydactyly': 'Brachydactyly',
 'Slender fingers': 'Slender finger',
# 'unilateral radial deviation of distal phalanx of finger II': 'PLACEHOLDER',
 'unilateral camptodactyly of finger V': 'Camptodactyly of finger',
# 'leg length asymmetry': 'PLACEHOLDER',
 'broad toes with mild camptodactyly': 'Camptodactyly',
 'sandal gaps': 'Sandal gap',
 'luxation of left elbow twice': 'Elbow dislocation',
 'Overriding toes. Brachymesophalangy of fingers II and V.': ["Overlapping toe",'Short middle phalanx of finger'],
 'Anomalous palmar creases with additional creases bilaterally. Overlapping toes (IV on III) and shorter toes IV and V. Limbs are hard to flex': 'Overlapping toe',
 'Webbing of finger joints': 'Syndactyly',
 'pes planus': 'Pes planus',
 'overlapping toes (II-III)': 'Overlapping toe',
 #'limited rotation of elbows': 'PLACEHOLDER',
 #'contractures of distal phalanges': 'PLACEHOLDER',
 #'Long and slightly broad toes I': 'PLACEHOLDER',
 'slightly shallow palmar creases': 'Decreased palmar creases',
 #'Progressive contractures of digits and toes': 'PLACEHOLDER',
 #'lateral deviation of fingers': 'PLACEHOLDER',
 'left single palmar crease': 'Single transverse palmar crease',
 'adducted left thumb (requiring surgery)': 'Adducted thumb',
# 'restricted supination at the left elbow': 'PLACEHOLDER',
 'syndactyly of fingers II-III': 'Syndactyly',
 #'Thumb laxity (metacarpophalangeal joint)': 'PLACEHOLDER',
 'joint contractures of elbow and lower limbs': 'Elbow contracture',
 'Pes planus': 'Pes planus'}
limbsMapper = OptionColumnMapper(column_name='Limbs, extremities',concept_recognizer=hpo_cr, option_d=limbs_d)
column_mapper_list.append(limbsMapper)
limbsMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Bilateral talipes equinovarus (HP:0001776) (observed),1
1,Hip dysplasia (HP:0001385) (observed),1
2,Clinodactyly (HP:0030084) (observed),3
3,Patellar subluxation (HP:0010499) (observed),1
4,Talipes equinovarus (HP:0001762) (observed),1
5,Appendicular spasticity (HP:0034353) (observed),1
6,Overlapping toe (HP:0001845) (observed),8
7,Pes planus (HP:0001763) (observed),9
8,Hypotonia (HP:0001252) (observed),1
9,Genu recurvatum (HP:0002816) (observed),1


In [17]:
other_skeletal_d = {
 'Short neck': 'Short neck',
 'pectus carinatum': 'Pectus carinatum',
 'Pectus carinatum': 'Pectus carinatum',
 'Short neck. Neonatal: small chest. Slim build.': 'Thoracic hypoplasia',
 'Neonatal: narrow chest (circumference 34 cm)': 'Narrow chest',
 'webbed neck. Exaggerated lumbar lordosis. Grade 2 spondylolisthesis L5/S1.': ['Webbed neck',"Spondylolisthesis"],
 'Spine: congenital progressive thoracolumbar kyphosis (Th12-L1) associated with hemivertebra L1. Operated twice (at 4 and 6 yrs); spondylodesis performed. J-shaped sella turcica. Short neck. Small chest with pectus carinatum': 'Thoracolumbar kyphosis',
 'clavicle hypoplasia': 'Short clavicles',
 'slightly curved ribs. Delayed bone age: estimated to be 4 yrs at chronologic age 6 yrs.': 'Delayed skeletal maturation',
 'Craniocervical junction stenosis with secondary hydrocephalus': 'Hydrocephalus',
 'T11-T12 hemivertebrae': 'Hemivertebrae',
 '11 pairs ribs': '11 pairs of ribs',
 'short neck': 'Short neck',
 'protruding sternum (possibly due to sternotomy). Delayed bone age.': 'Delayed skeletal maturation',
 'Narrow thorax and pectus carinatum': 'Pectus carinatum',
 'broad posterior ribs': 'Broad ribs',
 'flattened acetabular angles': 'Flat acetabular roof',
 '52 degree kyphosis and 14 degree right convexity scoliosis at the thoracolumbar junction. Narrow asymmetrical chest with prominence of right ribs. Short neck.': 'Short neck',
 'Broad thorax': 'Shield chest',
 'mild scoliosis': 'Scoliosis',
 'Thoracic scoliosis': 'Thoracic scoliosis',
 'with irregularity of the anterior margin of these vertebral bodies; anterior protrusion of the lower margin of the sternum compatible with a pectus carinatum deformity. Delayed bone age: bone age is estimated to be 3 yrs at chronologic age 5 yrs 6 mos; bone mineralization and appearance of the growth plates is normal. Short neck.': 'Pectus carinatum',
 'Barrel-shaped thorax. Spondylolisthesis L5-S1 with pain': 'Spondylolisthesis',
 'paraparesis and urinary incontinence at 10 yrs. Operated at 12 yrs: decompression and osteosynthesis': 'Urinary incontinence',
 'posterior fixation of L4-L5 and S1. Kyphosis with narrow cervical canal (C2-C3). Altered MEP and SSEP': 'Kyphosis',
 'Short and broad neck': 'Broad neck',
 'Short neck with webbing both laterally and anteriorly. Barrel shaped chest. Scoliosis. Compressed cervical spinal cord.': 'Short neck',
 'J shaped sella turcica': 'J-shaped sella turcica',
 'beaking of the L1 vertebral body': 'Anterior beaking of lumbar vertebrae',
 'slender ribs with prominent sternum. By MRI': 'Thin ribs',
 'Narrow spinal canal': 'Spinal canal stenosis',
 'bifid sacral bone': 'Bifid sacrum',
 'fusion of 4th and 5th left ribs': 'Rib fusion',
 'Mild pectus carinatum': 'Pectus carinatum',
 #'anteverted shoulders': 'PLACEHOLDER',
 #'Conus medullaris terminates at the level of the inferior endplate of L3': 'PLACEHOLDER',
 'Pectus carinatum. Short': 'Pectus carinatum',
 'webbed neck. Severe osteoporosis discovered after broken femur in 20s. Spinal fusion of L4-L5.': 'Vertebral fusion',
 'Craniocervical junction anomaly noted at 12 yrs: marked stenosis at the level of the foramen magnum': 'Small foramen magnum',
 'C1 laminectomy and C2 occipital spine fusion. Short neck.': 'Short neck',
 'cervical stenosis': 'Cervical spinal canal stenosis',
 'thoracic syrinx. At 12 years': 'Syringomyelia',
 'developed clinical signs of myelopathy and underwent cervical spine decompression.': 'Myelopathy'}
other_skeletalMapper = OptionColumnMapper(column_name='Other skeletal',concept_recognizer=hpo_cr, option_d=other_skeletal_d)
column_mapper_list.append(other_skeletalMapper)
other_skeletalMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Short neck (HP:0000470) (observed),20
1,Pectus carinatum (HP:0000768) (observed),15
2,Thoracic hypoplasia (HP:0005257) (observed),2
3,Narrow chest (HP:0000774) (observed),1
4,Webbed neck (HP:0000465) (observed),1
5,Lumbar hyperlordosis (HP:0002938) (observed),1
6,Spondylolisthesis (HP:0003302) (observed),2
7,Thoracolumbar kyphosis (HP:0005619) (observed),1
8,Hemivertebrae (HP:0002937) (observed),2
9,J-shaped sella turcica (HP:0002680) (observed),2


In [18]:
cardiac_d = {
 'Atrioventricular canal defect (surgically repaired)': 'Atrioventricular canal defect',
 'Patent ductus arteriosus (ligated) and atrial septal defect. A later echocardiogram showed intact septum.': ['Atrial septal defect',"Patent ductus arteriosus"],
 'Patent ductus arteriosus': 'Patent ductus arteriosus',
 'Echocardiogram: tiny inferior muscular ventricular septal defect with left to right shunt': 'Muscular ventricular septal defect',
 'echocardiogram showed persistent ductus arteriosus with no haemodynamic repercussion (surgery not required) and mild supravalvular pulmonary stenosis': 'Patent ductus arteriosus',
 'Persistent small patent ductus arteriosus': 'Patent ductus arteriosus',
 'Bicuspid aortic valve with mild ascending aorta dilation': 'Ascending tubular aorta aneurysm',
 'large patent ductus arteriosus with a left to right shunt': 'Patent ductus arteriosus',
 'underwent uncomplicated surgery at 10 days. Anomalous origin of the right coronary artery.': 'Abnormal coronary artery origin',
 'Atrial septal defect detected at birth and at 7 yrs is still persistent with diameter of 2 mm': 'Atrial septal defect',
 'Patent ductus arteriosus causing cardiac compromise and requiring surgical ligation': 'Patent ductus arteriosus',
 'small atrial septal defect': 'Atrial septal defect',
 'bicuspid aortic valve': 'Bicuspid aortic valve',
 #'Prenatal anomalies (see Pregnancy/Birth). Shone complex': 'PLACEHOLDER',
 'with mild to moderately hypoplastic transverse arch': 'Hypoplastic aortic arch',
 'large patent ductus arteriosus': 'Patent ductus arteriosus',
 'hypoplastic aortic and mitral valves and bicuspid aortic valve.': 'Bicuspid aortic valve',
 'Patent ductus arteriosus surgically closed': 'Patent ductus arteriosus',
 'patent foramen ovale': 'Patent foramen ovale',
 'aberrant right subclavian artery': 'Arteria lusoria',
 'Small ventricular septal defect (closed spontaneously)': 'Ventricular septal defect',
 'small-moderate patent ductus arteriosus requiring interventional closure at 2.5 yrs': 'Patent ductus arteriosus',
 'Patient had a patent ductus arteriosus and bicuspid aortic valve': 'Patent ductus arteriosus',
 'Premature atherosclerosis': 'Precocious atherosclerosis',
 'ischemic cardiopathy at 41 yrs': 'Precocious atherosclerosis',
 'Atrial septal defect': 'Atrial septal defect',
 'echocardiogram identified a patent ductus arteriosus (subsequently repaired) with mild left atrial and ventricular dilation without evidence of coarctation of the aorta; his EEG was normal.': 'Coarctation of aorta',
 'Heart echography: mild mitral insufficiency': 'Mitral regurgitation',
 'Ventricular septal defect with cleft mitral valve': 'Ventricular septal defect',
 'patent ductus arteriosus': 'Patent ductus arteriosus',
 'History of patent ductus arteriosus that closed spontaneously. Persistent left superior vena cava with drainage to the coronary sinus.   Echocardiogram at 2 yrs revealed increased trabeculations in left ventricular apex and lateral wall. Most recent echocardiogram at 4 yrs 4 mos: stable increased trabeculations in left ventricular apex and lateral wall. Left ventricular size and systolic function are normal. Subjectively': 'Persistent left superior vena cava',
 'Atrial septal defect that closed spontaneously': 'Atrial septal defect',
 'atrial septal defect': 'Atrial septal defect',
 'Patent ductus arteriosus requiring surgical coiling to close': 'Patent ductus arteriosus',
 'with subsequent difficulty to remove the coil and thus requiring open heart surgery. The patent ductus arteriosus is currently (at 2 yrs) patent at about 1 mm.': 'Patent ductus arteriosus',
 'Patent ductus arteriosus requiring surgery': 'Patent ductus arteriosus',
 'Isolated left-sided superior vena cava': 'Persistent left superior vena cava',
 'small patent ductus arteriosus': 'Patent ductus arteriosus',
 'ventricular septal defect': 'Ventricular septal defect',
 'aortic aneurysm': 'Aortic aneurysm',
 'primum atrial septal defect': 'Primum atrial septal defect',
 'cleft mitral valve - s/p repair': 'Cleft anterior mitral valve leaflet',
 'Ventricular septal defects': 'Ventricular septal defect',
 'Patent ductus arteriosus (surgery at 9 yrs)': 'Patent ductus arteriosus',
 'Patent ductus arteriosus (s/p ligation at 7 mos)': 'Patent ductus arteriosus'}
cardiacMapper = OptionColumnMapper(column_name='Cardiac',concept_recognizer=hpo_cr, option_d=cardiac_d)
column_mapper_list.append(cardiacMapper)
cardiacMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Atrioventricular canal defect (HP:0006695) (observed),1
1,Atrial septal defect (HP:0001631) (observed),7
2,Patent ductus arteriosus (HP:0001643) (observed),24
3,Muscular ventricular septal defect (HP:0011623) (observed),1
4,Ascending tubular aorta aneurysm (HP:0004970) (observed),1
5,Abnormal coronary artery origin (HP:0011636) (observed),1
6,Bicuspid aortic valve (HP:0001647) (observed),4
7,Hypoplastic aortic arch (HP:0012304) (observed),1
8,Patent foramen ovale (HP:0001655) (observed),1
9,Arteria lusoria (HP:0031014) (observed),1


In [19]:
hearing_d = {
 'Conductive deafness': 'Conductive hearing impairment',
 'chronic otitis media': 'Chronic otitis media',
 'Conductive hearing loss with hearing aids': 'Conductive hearing impairment',
 'Bilateral hearing loss': 'Hearing impairment',
 'Failed newborn and subequent hearing screens. Middle ear effusion diagnosed and hearing deemed normal after placement of pressure equalizing tubes. Chronic otitis media. Bilateral mastoid opacification could be clinically correlated.': 'Chronic otitis media',
 'Mild to moderate conductive loss in the left ear': 'Conductive hearing impairment',
 'Normal. Small external auditory canals.': 'Stenosis of the external auditory canal',
 'Stable hearing loss (sensorineural)': 'Hearing impairment',
 'bilateral hearing aids.': 'Hearing impairment',
 'Bilateral congenital hearing loss': 'Hearing impairment',
 'wears hearing aids': 'Hearing impairment',
 'Moderate-severe bilateral conductive hearing impairment suspected secondary to ossicular chain dysfunction; wears aids.': 'Bilateral conductive hearing impairment',
 'Chronic fluid due to eustachian tube dysfunction.  Required four sets of tympanostomy tubes.  Audiogram at 5 yrs: mild conductive hearing loss rising to within normal limits  in R ear and moderate conductive hearing loss rising to within normal limits in L ear.': 'Conductive hearing impairment',
 #'Bilateral myringotomy tube placement': 'PLACEHOLDER',
 'bilateral mild to moderate conductive hearing loss rising to normal with hearing aids': 'Conductive hearing impairment',
 'Failed newborn hearing screen. Bilateral mild falling to moderate sensorineural hearing loss; hearing aids recommended.': 'Sensorineural hearing impairment',
 'Hearing loss with hearing aids until age 5 yrs': 'Hearing impairment',
 'Severe rising to mild hearing loss bilaterally. Unmasked bone conduction testing  suggests a mixed component for at least one ear. Wears binaural behind-the-ear hearing aids.': 'Hearing impairment',
 'Hearing loss requiring hearing aids': 'Hearing impairment',
 'Hearing loss of 30 dB at mean': 'Hearing impairment',
 'predominant on low frequencies without need for hearing devices': 'Hearing impairment',
 'Hearing loss': 'Hearing impairment',
 'hearing devices. Cholesteatoma removed.': ['Cholesteatoma',"Hearing impairment"],
 'Failed newborn hearing screen. ABR results suggest normal hearing for 2000-4000 Hz in right ear and mild to moderate hearing loss for speech spectrum in left ear.': 'Hearing impairment',
 'Mixed hearing loss': 'Mixed hearing impairment',
 'bilaterally diagnosed at age 2 yrs. Wears hearing aids. History of pressure equalizing tubes.': 'Hearing impairment',
 'Conductive hearing loss requiring hearing aids': 'Conductive hearing impairment',
 'Suspected hearing loss': 'Hearing impairment',
}
hearingMapper = OptionColumnMapper(column_name='Hearing',concept_recognizer=hpo_cr, option_d=hearing_d)
column_mapper_list.append(hearingMapper)
hearingMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Conductive hearing impairment (HP:0000405) (observed),6
1,Chronic otitis media (HP:0000389) (observed),2
2,Hearing impairment (HP:0000365) (observed),14
3,Stenosis of the external auditory canal (HP:0000402) (observed),1
4,Bilateral conductive hearing impairment (HP:0008513) (observed),1
5,Sensorineural hearing impairment (HP:0000407) (observed),1
6,Cholesteatoma (HP:0009797) (observed),1
7,Mixed hearing impairment (HP:0000410) (observed),1


In [20]:
eyes_d = {'Exotropia': 'Exotropia',
 'Myopia': 'Myopia',
 'Poor vision in right eye due to dense amblyopia. Left partial 3rd nerve palsy with limited upgaze': 'Visual impairment',
 'Hypermetropia': 'Hypermetropia',
 'Cerebral visual impairment': 'Cerebral visual impairment',
 'convergent strabismus': 'Esodeviation',
 'nystagmus': 'Nystagmus',
 'Strabismus': 'Strabismus',
 'amblyopia': 'Amblyopia',
 'high myopia': 'High myopia',
 #'Myelinated nerve fibres (right side only)': 'PLACEHOLDER',
 #'elevated optic discs': 'PLACEHOLDER',
 #'absent optic cup': 'PLACEHOLDER',
 'Had esotropia secondary to bilateral 6th nerve palsies from hydrocephalus': 'Hydrocephalus',
 'resolved. Hyperopia.': 'Hypermetropia',
 'Minor myopia in one eye; other normal refraction. Strabismus surgery.': 'Myopia',
 'Bilateral optic disc pallor': 'Optic disc pallor',
 'cortical visual impairment (R 20/80': 'Cerebral visual impairment',
 'large angle left exotropia (secondary to craniosynostosis)': 'Craniosynostosis',
 'Brushfield spots': 'Brushfield spots',
 'Vision is good. Strabismus.': 'Strabismus',
 'Strabismus. Normal vision.': 'Strabismus',
 'Abnormal VER': 'Abnormality of visual evoked potentials',
 'Incomplete coloboma of the right optic nerve': 'Coloboma',
 'exotropia': 'Exotropia',
 'Divergent strabismus': 'Exodeviation',
 'Myopia and astigmatism': 'Astigmatism',
 'Mild myopia': 'Mild myopia',
 'Astigmatism': 'Astigmatism',
 'wears glasses for myopia': 'Myopia',
 'Mild myopia bilaterally': 'Mild myopia',
 #'slightly anomalous optic nerves': 'PLACEHOLDER',
 'Convergent right strabismus': 'Strabismus',
 'Optic atrophy': 'Optic atrophy'}
eyesMapper = OptionColumnMapper(column_name='Eyes, vision',concept_recognizer=hpo_cr, option_d=eyes_d)
column_mapper_list.append(eyesMapper)
eyesMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Exotropia (HP:0000577) (observed),2
1,Myopia (HP:0000545) (observed),3
2,Visual impairment (HP:0000505) (observed),1
3,Hypermetropia (HP:0000540) (observed),2
4,Cerebral visual impairment (HP:0100704) (observed),2
5,Esodeviation (HP:0020045) (observed),1
6,Nystagmus (HP:0000639) (observed),1
7,Strabismus (HP:0000486) (observed),4
8,Amblyopia (HP:0000646) (observed),1
9,High myopia (HP:0011003) (observed),1


In [21]:
urogenital_d = {
 #'Well-differentiated endometrioid adenocarcinoma at 36 yrs. Normal  liver': 'PLACEHOLDER',
  'Undescended testis': 'Cryptorchidism',
 'Unilateral undescended testis': 'Cryptorchidism',
 'Hypoplastic labia minora': 'Hypoplastic labia minora',
 'Undescended testes; s/p orchiopexy at 13 yrs': 'Cryptorchidism',
 'Right hydrocele testis': 'Hydrocele testis',
 'Hypospadias. Fetal hydronephrosis.': 'Hydronephrosis',
 'Early renal ultrasound showed bilateral small cortical renal cysts that later resolved. Normal renal corticomedullary differentiation': 'Renal cyst',
 'mild left pelviectasis.': 'Dilatation of the renal pelvis',
 'Bilateral undescended testes (operated)': 'Cryptorchidism',
 'Right undescended testicle': 'Cryptorchidism',
 'Left pelvic kidney': 'Pelvic kidney',
 'Multicystic-dysplastic kidney with compensatory hypertrophy of opposite kidney': 'Multicystic kidney dysplasia',
 'Bilateral mild hydronephrosis': 'Hydronephrosis',
 'possible stage I chronic renal disease versus renal dysplasia': 'Renal dysplasia',
 'malrotated right kidney.': 'Renal malrotation',
 'Clitoris hypertrophy': 'Clitoral hypertrophy',
 'Fetal hydronephrosis': 'Hydronephrosis',
 'Unilateral pelvic kidney': 'Pelvic kidney',
 'Cryptorchidism': 'Cryptorchidism',
 'Pelvic right kidney': 'Pelvic kidney',
 'Hypospadias': 'Hypospadias',
 'pelvic kidney': 'Pelvic kidney',
 'Hypospadias (mild)': 'Hypospadias',
 'unilateral cryptorchidism': 'Unilateral cryptorchidism',
 'Varicocele on the left side of scrotum': 'Varicocele'}
urogenitalMapper = OptionColumnMapper(column_name='Urogenital',concept_recognizer=hpo_cr, option_d=urogenital_d)
column_mapper_list.append(urogenitalMapper)
urogenitalMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Cryptorchidism (HP:0000028) (observed),6
1,Hypoplastic labia minora (HP:0000064) (observed),1
2,Hydrocele testis (HP:0000034) (observed),1
3,Hydronephrosis (HP:0000126) (observed),3
4,Renal cyst (HP:0000107) (observed),1
5,Dilatation of the renal pelvis (HP:0010946) (observed),1
6,Pelvic kidney (HP:0000125) (observed),4
7,Multicystic kidney dysplasia (HP:0000003) (observed),1
8,Renal dysplasia (HP:0000110) (observed),1
9,Renal malrotation (HP:0004712) (observed),1


In [22]:
other_phenotypes_d = {
 'Inguinal hernia': 'Inguinal hernia',
 'Obesity (BMI=39)': 'Obesity',
 'hirsutism on face and arms': 'Hirsutism',
 'long history of migraines. Carpal tunnel syndrome at the age of 30 yrs.': 'Constrictive median neuropathy',
 'Progressive alopecia of the vertex at 18 yrs': 'Progressive alopecia',
 'Cutis marmorata. Keratosis pilaris.': 'Cutis marmorata',
 'Umbilical hernia': 'Umbilical hernia',
 'Anemia. Low ferritin.': 'Anemia',
 'excess nuchal skin': 'Thickened nuchal skin fold',
 'Prominent scalp veins': 'Prominent scalp veins',
 'probably due to lipodystrophy': 'Lipodystrophy',
 'wide-spaced nipples. Bilateral lower extremity edema at 13 yrs': 'Wide intermamillary distance',
 'Recurrent pneumonia. Chest X-ray: suspected eventration of the diaphragm.': 'Recurrent pneumonia',
 'Prominent abdomen with umbilical hernia. Localized skin pigmentation disorder with hypopigmentation and hyperpigmentation (abdomen and back). Fine hairs.': 'Hypopigmentation of the skin',
 'Inverted nipples. Pseudotumor cerebri s/p lumboperitoneal shunt placement. Keratosis pilaris on his back with eczema of fingers and near the tips. Enuresis.': 'Inverted nipples',
 'At birth: excess nuchal skin posteriorly': 'Thickened nuchal skin fold',
 'loose skin': 'Cutis laxa',
 'Hypoglycemia. Inverted nipples.': 'Inverted nipples',
 'Inguinal hernia s/p repair': 'Inguinal hernia',
 'Cutis marmorata': 'Cutis marmorata',
 'diastasis recti': 'Diastasis recti',
 'hyperreflexia': 'Hyperreflexia',
 'hiatal hernia': 'Hiatus hernia',
 'Premature balding. Inverted nipples. Edema of lower limbs.': 'Inverted nipples',
 'Poor sweating. Dry skin. Keratosis pilaris.': 'Hyperhidrosis',
 'Poor sweating': 'Hyperhidrosis',
 'Hypothyroidism': 'Hypothyroidism',
 'Inverted nipples': 'Inverted nipples',
 'Hypertrichosis lower back. Restrictive lung disease due to left hemidiaphragm elevation and scoliosis. Poor sleep.': 'Restrictive ventilatory defect',
 'Fair complexion for his family with an ongoing mottled  appearance to the skin. Wide and low-placed nipples.': 'Wide intermamillary distance',
 'Lymphedema in the legs': 'Lymphedema',
 'supernumerary nipple': 'Supernumerary nipple',
 'inverted nipples': 'Inverted nipples',
 'Recurrent infections': 'Recurrent infections',
 'Right inguinal hernia': 'Inguinal hernia',
 'Weak cry. Small umbilical hernia.': 'Weak cry',
 'Frequent upper respiratory infections': 'Recurrent upper respiratory tract infections',
 "High pain threshold versus doesn't communicate pain": 'Pain',
 'Inguinal hernia (requiring surgery)': 'Inguinal hernia',
 'Cervical syringomyelic cavity': 'Syringomyelia',
 'Frequent infections': 'Recurrent infections',
 'atopic dermatitis': 'Atopic dermatitis'}
other_phenotypesMapper = OptionColumnMapper(column_name='Other phenotypes',concept_recognizer=hpo_cr, option_d=other_phenotypes_d)
column_mapper_list.append(other_phenotypesMapper)
other_phenotypesMapper.preview_column(df)

Unnamed: 0,mapping,count
0,Inguinal hernia (HP:0000023) (observed),7
1,Obesity (HP:0001513) (observed),1
2,Hirsutism (HP:0001007) (observed),1
3,Constrictive median neuropathy (HP:0012185) (observed),1
4,Progressive alopecia (HP:0002287) (observed),1
5,Cutis marmorata (HP:0000965) (observed),2
6,Umbilical hernia (HP:0001537) (observed),1
7,Anemia (HP:0001903) (observed),1
8,Thickened nuchal skin fold (HP:0000474) (observed),2
9,Prominent scalp veins (HP:0001043) (observed),1


In [23]:
traf7_id = "HGNC:20456"
traf7_transcript = "NM_032271.3"
vman = VariantManager(df=df, individual_column_name="Patient", allele_1_column_name="Variant (cDNA, with reference to NM_032271.2)",
                      gene_id=traf7_id, gene_symbol="TRAF7", transcript=traf7_transcript)

In [24]:
var_d = vman.get_variant_d()
print(f"Encoded {len(var_d)} unique variants")
vman.to_summary()

Encoded 23 unique variants


Unnamed: 0,status,count,alleles
0,mapped,23,"c.1936G>C , c.1555C>T, c.1964G>A, c.1873C>G, c.1673C>T, c.1851C>G, c.1494G>T, c.1885A>C, c.1850T>C, c.1570C>T, c.1708C>G, c.1089C>A, c.1204C>G, c.1223G>A, c.1148A>C, c.1975G>T, c.1097C>T, c.1673C>A, c.1109C>T, c.981C>A, c.1211T>A, c.1849T>C, c.1328T>G"
1,unmapped,0,


In [25]:
varMapper = VariantColumnMapper(variant_d=var_d,
                               variant_column_name="Variant (cDNA, with reference to NM_032271.2)",
                               default_genotype="heterozygous")

In [26]:
sexMapper = SexColumnMapper(male_symbol="M", female_symbol="F", column_name="Gender")
#sexMapper.preview_column(df)
df["Age in years"] = df[ 'Year of birth '].apply(lambda x: 2020-x)
ageMapper = AgeColumnMapper.by_year('Age in years')
#ageMapper.preview_column(df)

In [28]:
individual_column = "Patient no. (ordered by variant, N- to C-terminal. Light blue rows = patients with TRAF7 variants of unknown significance)"
encoder = CohortEncoder(df=df, 
                        hpo_cr=hpo_cr, 
                        column_mapper_list=column_mapper_list, 
                        individual_column_name="Patient", 
                        age_at_last_encounter_mapper=ageMapper, 
                        sexmapper=sexMapper,
                        variant_mapper=varMapper, 
                        metadata=metadata)
#Cardiac, facial, and digital anomalies with developmental delay 	618164 
disease = Disease(disease_id='OMIM:618164', disease_label='Cardiac, facial, and digital anomalies with developmental delay')
encoder.set_disease(disease)


In [29]:
individuals = encoder.get_individuals()

In [30]:
cvalidator = CohortValidator(cohort=individuals, ontology=hpo_ontology, min_hpo=1, allelic_requirement=AllelicRequirement.MONO_ALLELIC)
qc = QcVisualizer(cohort_validator=cvalidator)
display(HTML(qc.to_summary_html()))

Level,Error category,Count
WARNING,REDUNDANT,15
INFORMATION,NOT_MEASURED,43


In [31]:
individuals = cvalidator.get_error_free_individual_list()
table = PhenopacketTable(individual_list=individuals, metadata=metadata)
display(HTML(table.to_html()))

Individual,Disease,Genotype,Phenotypic features
1 (FEMALE; P10Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.981C>A (heterozygous),Autistic behavior (HP:0000729); Delayed speech and language development (HP:0000750); Specific learning disability (HP:0001328); Generalized non-motor (absence) seizure (HP:0002121); Exotropia (HP:0000577); excluded: Motor delay (HP:0001270); excluded: Feeding difficulties (HP:0011968)
2 (MALE; P2Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1089C>A (heterozygous),Nuchal cord (HP:0012498); Feeding difficulties (HP:0011968); Dysphagia (HP:0002015); Global developmental delay (HP:0001263); Motor delay (HP:0001270); Multifocal seizures (HP:0031165); Hypsarrhythmia (HP:0002521); Nevus flammeus nuchae (HP:0007616); Bilateral talipes equinovarus (HP:0001776)
3 (MALE; P4Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1097C>T (heterozygous),Failure to thrive (HP:0001508); Global developmental delay (HP:0001263); Delayed gross motor development (HP:0002194); Thin corpus callosum (HP:0033725); Ventriculomegaly (HP:0002119); Periventricular leukomalacia (HP:0006970); Delayed CNS myelination (HP:0002188); Trigonocephaly (HP:0000243); Plagiocephaly (HP:0001357); Short palpebral fissure (HP:0012745); Ptosis (HP:0000508); Short nose (HP:0003196); Long philtrum (HP:0000343); Thin upper lip vermilion (HP:0000219); Low-set ears (HP:0000369); Short neck (HP:0000470); Pectus carinatum (HP:0000768); Inguinal hernia (HP:0000023); excluded: Seizure (HP:0001250); excluded: Feeding difficulties (HP:0011968)
4 (FEMALE; P40Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1109C>T (heterozygous),Short stature (HP:0004322); Specific learning disability (HP:0001328); Bipolar affective disorder (HP:0007302); Delayed ability to walk (HP:0031936); Bilateral tonic-clonic seizure (HP:0002069); Generalized non-motor (absence) seizure (HP:0002121); Synophrys (HP:0000664); Round face (HP:0000311); Prominent nasal bridge (HP:0000426); Smooth philtrum (HP:0000319); Underdeveloped nasal alae (HP:0000430); Increased overbite (HP:0011094); Widely spaced teeth (HP:0000687); Hip dysplasia (HP:0001385); Short neck (HP:0000470); Myopia (HP:0000545); Obesity (HP:0001513); Hirsutism (HP:0001007); Constrictive median neuropathy (HP:0012185); excluded: Feeding difficulties (HP:0011968)
5 (FEMALE; P22Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1148A>C (heterozygous),Hypotonia (HP:0001252); Delayed ability to walk (HP:0031936); Blepharophimosis (HP:0000581); Epicanthus (HP:0000286); Telecanthus (HP:0000506); Bulbous nose (HP:0000414); Overfolded helix (HP:0000396); Agenesis of premolar (HP:0011051); Clinodactyly (HP:0030084); Patellar subluxation (HP:0010499); Pectus carinatum (HP:0000768); Atrioventricular canal defect (HP:0006695); Conductive hearing impairment (HP:0000405); Chronic otitis media (HP:0000389); Progressive alopecia (HP:0002287); excluded: Seizure (HP:0001250)
"6 (= patient DDD4K.01539 in DDD study, Nature, 2017) (FEMALE; P12Y)","Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1204C>G (heterozygous),Atrial septal defect (HP:0001631); Feeding difficulties (HP:0011968); Patent ductus arteriosus (HP:0001643); Delayed ability to walk (HP:0031936); Periorbital fullness (HP:0000629); Anteverted nares (HP:0000463); Cutis marmorata (HP:0000965); Cleft palate (HP:0000175); Neonatal respiratory distress (HP:0002643); Clinodactyly (HP:0030084); Epicanthus (HP:0000286); Blepharophimosis (HP:0000581); Visual impairment (HP:0000505); Seizure (HP:0001250); Micrognathia (HP:0000347); Delayed speech and language development (HP:0000750); Wide anterior fontanel (HP:0000260); U-Shaped upper lip vermilion (HP:0010806); Polyhydramnios (HP:0001561); Bilateral ptosis (HP:0001488); Conductive hearing impairment (HP:0000405); Brachycephaly (HP:0000248); Delayed ability to sit (HP:0025336); Highly arched eyebrow (HP:0002553); Thoracic hypoplasia (HP:0005257); Prominent forehead (HP:0011220); Low-set ears (HP:0000369); Hypotonia (HP:0001252)
7 (FEMALE; P4Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1204C>G (heterozygous),Short neck (HP:0000470); Feeding difficulties (HP:0011968); Lethargy (HP:0001254); Torticollis (HP:0000473); Short palpebral fissure (HP:0012745); Hypotonia (HP:0001252); Sepsis (HP:0100806); Respiratory failure (HP:0002878); Talipes equinovarus (HP:0001762); excluded: Seizure (HP:0001250)
"8 (= patient DDD4K.03113 in DDD study, Nature, 2017) (MALE; P13Y)","Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1211T>A (heterozygous),Increased nuchal translucency (HP:0010880); Global developmental delay (HP:0001263); Delayed ability to walk (HP:0031936); Periventricular leukomalacia (HP:0006970); Ptosis (HP:0000508); Epicanthus (HP:0000286); Blepharophimosis (HP:0000581); Anteverted nares (HP:0000463); Tented upper lip vermilion (HP:0010804); Short neck (HP:0000470); Patent ductus arteriosus (HP:0001643); Hearing impairment (HP:0000365); Hypermetropia (HP:0000540); Cryptorchidism (HP:0000028); Umbilical hernia (HP:0001537); excluded: Seizure (HP:0001250); excluded: Feeding difficulties (HP:0011968)
9 (MALE; P9Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1223G>A (heterozygous),"Patent ductus arteriosus (HP:0001643); Feeding difficulties (HP:0011968); Inguinal hernia (HP:0000023); Cerebral visual impairment (HP:0100704); Axial hypotonia (HP:0008936); Thin corpus callosum (HP:0033725); Epicanthus (HP:0000286); Blepharophimosis (HP:0000581); Esodeviation (HP:0020045); Absent speech (HP:0001344); Appendicular spasticity (HP:0034353); Cryptorchidism (HP:0000028); Infantile spasms (HP:0012469); Global developmental delay (HP:0001263); Nystagmus (HP:0000639); Hydrocephalus (HP:0000238); Hypertelorism (HP:0000316); Low-set ears (HP:0000369); Intellectual disability, severe (HP:0010864)"
10 (MALE; P9Y),"Cardiac, facial, and digital anomalies with developmental delay (OMIM:618164)",NM_032271.3:c.1223G>A (heterozygous),Posteriorly rotated ears (HP:0000358); Short palpebral fissure (HP:0012745); Low-set ears (HP:0000369); Global developmental delay (HP:0001263); Telecanthus (HP:0000506); Subarachnoid hemorrhage (HP:0002138); Epicanthus (HP:0000286); Muscular ventricular septal defect (HP:0011623); Chronic otitis media (HP:0000389); Anemia (HP:0001903); Subdural hemorrhage (HP:0100309); excluded: Seizure (HP:0001250); excluded: Feeding difficulties (HP:0011968)


In [32]:
Individual.output_individuals_as_phenopackets(individual_list=individuals,
                                              metadata=metadata)

We output 45 GA4GH phenopackets to the directory phenopackets
