First you need to link your Google Drive to the notebook in order to access the files needed for this module.

Run the cell below and follow instructions to mount the drive.

In [1]:
from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive


## Installing Biopython

At the beginning of each module, we will install **Biopython**. Biopython is a large open-source application programming interface (API) used in both bioinformatics software development and in everyday scripts for common bioinformatics tasks. It contains several packages that you will need to import which will allow you to run the analyses required for this project. 

REF:
* Cock, P. J., Antao, T., Chang, J. T., Chapman, B. A., Cox, C. J., Dalke, A., Friedberg, I., Hamelryck, T., Kauff, F., Wilczynski, B., & de Hoon, M. J. (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics (Oxford, England), 25(11), 1422–1423. https://doi.org/10.1093/bioinformatics/btp163


In [2]:
!pip install biopython

Collecting biopython
  Downloading biopython-1.79-cp37-cp37m-manylinux_2_5_x86_64.manylinux1_x86_64.whl (2.3 MB)
[K     |████████████████████████████████| 2.3 MB 26.5 MB/s 
Installing collected packages: biopython
Successfully installed biopython-1.79


# Investigating the biological impact of the mutation and its possible role in human disease
For this section, your research will focus on investigating the biological impact of the mutation you are studying. To do this, you will use the OMIM and KEGG databases.

## OMIM Search for information on genetic diseases

The **OMIM** (Online Mendelian Inheritance of Man) database contains short, referenced reviews about genetic loci and genetic diseases. It
can be a very useful resource for finding out what type of research has been done on a gene or a disease.

REF:
* http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM

## Install and import the necessary packages:

The **romim package** was created to query the OMIM database but it runs in R. 

**R** is another programming language so you will need to install **rpy2** to run R code in Google Colab.

**Methods** and **remotes** are R packages that help us both install the package and use the functions in the code.

The **XML** package will be used to read the results that you obtain from your database searches. 

REF:
* https://github.com/davetang/romim

In [3]:
# Install the rpy2 interface to run R code
%load_ext rpy2.ipython

In [4]:
%%R # This must precede all R code in Colab, to allow R code to run 

# Installing the main package
# Note how different it is from Python code
remotes::install_github('davetang/romim')

# Import the library associated with the package
library(romim)

# Intalling several packages
install.packages('XML')
install.packages('methods')
install.packages("remotes")

# Press 1 and ENTER when prompted

R[write to console]: Downloading GitHub repo davetang/romim@HEAD

R[write to console]: Installing 1 packages: XML

R[write to console]: Installing package into ‘/usr/local/lib/R/site-library’
(as ‘lib’ is unspecified)

R[write to console]: trying URL 'https://cran.rstudio.com/src/contrib/XML_3.99-0.9.tar.gz'

R[write to console]: Content type 'application/x-gzip'
R[write to console]:  length 970514 bytes (947 KB)

R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write

* checking for file ‘/tmp/Rtmp7vz4zn/remotes3d41acd31e/davetang-romim-594c6b0/DESCRIPTION’ ... OK
* preparing ‘romim’:
* checking DESCRIPTION meta-information ... OK
* checking for LF line-endings in source and make files and shell scripts
* checking for empty or unneeded directories
Omitted ‘LazyData’ from DESCRIPTION
* building ‘romim_1.0.1.tar.gz’



R[write to console]: Installing package into ‘/usr/local/lib/R/site-library’
(as ‘lib’ is unspecified)

R[write to console]: Loading required package: XML

R[write to console]: 
Attaching package: ‘XML’


R[write to console]: The following object is masked from ‘package:tools’:

    toHTML


R[write to console]: Loading required package: magrittr

R[write to console]: Loading required package: xml2

R[write to console]: Installing package into ‘/usr/local/lib/R/site-library’
(as ‘lib’ is unspecified)

R[write to console]: trying URL 'https://cran.rstudio.com/src/contrib/XML_3.99-0.9.tar.gz'

R[write to console]: Content type 'application/x-gzip'
R[write to console]:  length 970514 bytes (947 KB)

R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console]: =
R[write to console

## Obtaining the ID number (called mim number) associated with the Lesch-Nyhan syndrome entry in OMIM

In [5]:
%%R # This must precede all R code in Colab, to allow R code to run 

# To access OMIM, we will use this key which will work as our password to access the database
set_key('4PUvWRqSSD2BuprIVAP_VQ') 

# First lets get a list of the entries associated with KRAS2
# Write HPRT1 in the parenthesis below to create a list of the entries
my_list <- gene_to_omim('####')

# Now lets obtain the mim number (with get_omim) and list it with our entries
my_list_omim <- sapply(my_list, get_omim)

# This will append the title of the entry to the list
sapply(my_list_omim, get_title)

                            300323                             300322 
"HYPERURICEMIA, HPRT-RELATED; HRH"        "LESCH-NYHAN SYNDROME; LNS" 


Write down the ID number for 'LESCH-NYHAN SYNDROME; LNS' from the results above. (ID precedes the entry title)

Answer here

1. What syndrome (other than LSC) is also associated with reduced activity of HPRT1?


Answer here

## Using OMIM to obtain more information about the disease
This time you will search the OMIM 'LESCH-NYHAN SYNDROME; LNS' entry for information.

The function 'get_omim' helps you do just that since we can set certain arguments to 'TRUE' and obtain specific information about the entry.

Run the next cell to see a list of Arguments that you can access.

In [None]:
%%R
help(get_omim)

In [6]:
# Search OMIM again to obtain description information
%%R
set_key('KEY HERE')

# Using mim number to get the entry, set 'text' argument to true
omim_result <- get_omim(###, text = ###)

# Save the xml to a file called molgen.xml
saveXML(omim_result, file="NAME THIS FILE.xml")

[1] "NAME THIS FILE.xml"


In [7]:
#@title Load the results by providing the file name in this form (include file extension .xml)

# MAKING RESULTS LOOK GOOD
import xml.etree.ElementTree as ET
import csv
import pandas as pd

file_name = "NAME THIS FILE.xml" #@param {type:"string"}

tree = ET.parse(file_name)
root = tree.getroot()
 
Ref_data4 = open('refdata4.csv', 'w')
 
csvwriter = csv.writer(Ref_data4)
allele_head = []

 
count = 0
for member in root.findall('.//textSection'):
    allele = []
    ref_list = []

    if count == 0:
      des = member.find('.//textSectionTitle').tag
      allele_head.append(des)
      
      mut = member.find('.//textSectionContent').tag
      allele_head.append(mut)
     
      csvwriter.writerow(allele_head)
      count = count + 1
       
    des = member.find('.//textSectionTitle').text
    allele.append(des)
    
    mut = member.find('.//textSectionContent').text
    allele.append(mut)

    csvwriter.writerow(allele)
  
Ref_data4.close()

data4= pd.read_csv("refdata4.csv")
pd.set_option('display.max_colwidth',10000)

data4

Unnamed: 0,textSectionTitle,textSectionContent
0,Text,"A number sign (#) is used with this entry because Lesch-Nyhan syndrome (LNS) is caused by mutation in the HPRT gene ({308000}), which encodes hypoxanthine guanine phosphoribosyltransferase, on chromosome Xq26."
1,Clinical Features,"The features of the Lesch-Nyhan syndrome are mental retardation, spastic cerebral palsy, choreoathetosis, uric acid urinary stones, and self-destructive biting of fingers and lips. Megaloblastic anemia has been found in some patients ({41:van der Zee et al., 1968}).\n\nVirtually complete deficiency of HPRT residual activity (less than 1.5%) is associated with the Lesch-Nyhan syndrome, whereas partial deficiency (at least 8%) is associated with HPRT-related hyperuricemia (HRH; {300323}), also known as Kelley-Seegmiller syndrome. LNS is characterized by abnormal metabolic and neurologic manifestations. In contrast, Kelley-Seegmiller syndrome is usually associated only with the clinical manifestations of excessive purine production. Renal stones, uric acid nephropathy, and renal obstruction are often the presenting symptoms of Kelley-Seegmiller syndrome, but rarely of LNS. After puberty, the hyperuricemia in Kelley-Seegmiller syndrome may cause gout. A third group of patients, with 1.5 to 8% of HPRT activity, is associated with a neurologic variant of LNS, with uric acid overproduction and neurologic disability that varies from minor clumsiness to debilitating extrapyramidal and pyramidal motor dysfunction ({20:Jinnah and Friedmann, 2001}).\n\n{2:Bakay et al. (1979)} restudied a patient with HPRT deficiency, choreoathetosis, spasticity, dysarthria, and hyperuricemia, but normal intelligence and no self-mutilation. (A maternal uncle had been identically affected.) Although HPRT deficiency seemed to be complete, cultured fibroblasts had some capacity for metabolism of hypoxanthine and guanine. {32:Page et al. (1987)} described 2 brothers and 2 of their maternal uncles who had HPRT deficiency as the cause of mild mental retardation, spastic gait, and pyramidal tract sign. They were, furthermore, short of stature with proximally placed thumbs and fifth finger clinodactyly. Activity of the enzyme was virtually zero in lysates of red cells or hair roots, but in intact fibroblasts the level of activity was 7.5% of normal. Kinetic studies also demonstrated differences. A sister of the brothers was, by enzyme assay, heterozygous. One of the affected uncles had advanced tophaceous gout by age 32 years.\n\n<Subhead> Clinical Variability\n\n{18:Hladnik et al. (2008)} reported a family in which 5 individuals carrying the same splice site mutation in the HPRT gene showed marked phenotypic variability resulting from HPRT deficiency. One patient had classic Lesch-Nyhan syndrome with delayed development, spasticity, dystonia, and self-injurious behavior. Two patients had an intermediate phenotype with mild cognitive and learning difficulties, dystonia, and increased uric acid, but no self-injurious behavior, and 2 had mild spasticity, gout, and normal IQ. {18:Hladnik et al. (2008)} postulated that each individual had various expression of the mutant and wildtype transcript, and emphasized that individuals with the same genotype may not necessarily have the identical phenotype.\n\n{36:Sarafoglou et al. (2010)} reported a 3-generation family in which 3 individuals carrying the same missense mutation in the HPRT1 gene showed phenotypic variability. The proband presented at age 14.5 months with increased uric acid levels and later showed mildly delayed development. His cousin was diagnosed at age 26 months, and had mild generalized hypotonia, delayed motor development, focal dystonia of the lower limbs, and mild developmental impairment with speech delay. The boys' 65-year-old grandfather was more severely affected, with borderline cognitive function, severe dyslexia, spasticity, and flexion contractures leading to motor impairment. He had a long history of gout, nephrolithiasis, and progressive renal dysfunction. Medical history revealed that his symptoms had been attributed to cerebral palsy due to perinatal asphyxia. Enzymatic studies of cultured fibroblasts showed decreased activity in the proband, more severely decreased activity in the cousin, and the most severely decreased activity in the grandfather, consistent with their phenotypes. Cells from the grandfather grew more slowly than those from the grandchildren and appeared less robust.\n\n{24:Madeo et al. (2019)} described clinical features in 101 French and Italian patients with HPRT mutations, including 66 with LNS, 22 with the LNS neurologic variant (which the authors called HPRT-related neurologic dysfunction; HND), and 13 with HRH. The clinical manifestations at disease onset were nonspecific, but orange discoloration in the diaper was reported in 22% of patients. Neurologic involvement was overall more severe in LNS than in HND. The median age of presentation of involuntary movements and self-injurious behavior in LNS was 1.0 and 3 years, respectively. At least one epileptic episode was reported in approximately 13% of patients with LNS and HND. Of the approximately 40% of LNS patients and 30% of HND patients who had an MRI, a quarter of the LNS patients had variable abnormalities, the most frequent being mild to moderate brain atrophy, and all HND patients had normal findings. Language delay was common in LNS and HND, but the majority of HND patients could speak in complex sentences whereas the majority of LNS patients could speak in simple sentences or speak single words. Language comprehension was generally preserved. Approximately 60% of patients with HND had cognitive impairment. Renal disease, seen in approximately 66% of patients, occurred with a median onset age of 1.1 years. Six patients (2 with LNS, 2 with HND and 2 with HRH) developed chronic renal failure. In 3 cases, renal failure was a consequence of chronic lithiasis, and in 2 patients renal failure was secondary to focal segmental glomerulosclerosis and nephrotic syndrome. Two patients underwent kidney transplantation. Gout, seen in approximately 26% of patients, occurred with a median age of onset of 18 years and was more frequent in HND and HRH than in LNS."
2,Biochemical Features,"A 200-fold increase in the conversion of C(14)-labeled glycine to uric acid was observed by {29:Nyhan et al. (1965)}. {37:Seegmiller et al. (1967)} demonstrated deficiency in the enzyme hypoxanthine-guanine phosphoribosyltransferase (HPRT). That the enzyme deficiency resulted in excessive purine synthesis suggested that the enzyme (or the product of its function) normally plays a controlling role in purine metabolism. Resistance to 8-azaguanine in cultured diploid human fibroblasts was induced by x-ray in pioneer experiments ({1:Albertini and DeMars, 1973}). Mutation in the HPRT gene is the basis for this resistance. Lesch-Nyhan cells are resistant to 8-azaguanine. {40:Upchurch et al. (1975)} found a normal amount of cross-reacting material in 1 of 12 patients with HPRT deficiency. The others had less than 3% of the normal amount. {13:Ghangas and Milman (1975)} confirmed this by another method. {43:Wilson et al. (1986)} analyzed cell lines of 24 patients with HPRT deficiency at the levels of residual protein, mRNA, and DNA. At least 16 patients had unique mutations of the HPRT gene. Most cell lines had normal quantities of mRNA but undetectable quantities of enzyme. Eight of the patients retained significant quantities of structurally altered but functionally abnormal HPRT enzyme variants. A minority of patients lacked both enzyme and mRNA.\n\n{10:Fu et al. (2015)} created fibroblast cultures for 21 healthy controls and 36 patients with a broad spectrum of disease severity, including Lesch-Nyhan syndrome, related to HPRT deficiency. The authors assessed hypoxanthine recycling, guanine recycling, steady-state purine pools, and de novo purine synthesis. There was a strong correlation between disease severity and either hypoxanthine or guanine recycling. Intracellular purines were normal in the HPRT-deficient fibroblasts, but purine wasting was evident as increased purine metabolites excreted from cells. The normal intracellular purines in the HPRT-deficient fibroblasts were likely due in part to a compensatory increase in purine synthesis, as demonstrated by a significant increase in purinosomes. However, the increase in purine synthesis did not appear to correlate with disease severity.\n\nThere is variable disease severity in patients with Lesch-Nyhan syndrome, with an inverse relationship between HPRT1 enzyme activity measured in intact cells and clinical severity. Patients with classic Lesch-Nyhan disease, the most severe and frequent form, have the lowest HPRT enzyme activity (less than 1.5% of normal) in intact cultured fibroblasts. Patients with partial HPRT deficiency, designated as Lesch-Nyhan variants, have HPRT1 enzyme activity ranging from 1.5 to 8.0%. Individuals with an intermediate variant form known as the 'neurologic variant' are neurologically indistinguishable from patients with Lesch-Nyhan disease, but they do not have self-injurious behaviors and intelligence is normal or near-normal. The least-affected patients with the variant form have residual HPRT1 enzyme activity exceeding 8%; their only manifestations are attributed to hyperuricemia, and include gout, hematuria, and nephrolithiasis (summary by {36:Sarafoglou et al., 2010})."
3,Clinical Management,"{24:Madeo et al. (2019)} reported the results of treatment in 67 French and Italian patients with LNS (51 patients) or the LNS neurologic variant (16 patients). Although none of the drugs used for neurologic involvement and behavioral abnormalities were fully effective, some therapies were associated with limited clinical improvement. Botulinum toxin and baclofen were partially beneficial for the control of dystonia. Gabapentin was partially efficacious for dystonia and self-injurious behavior in the 13 patients who received it. Among the 12 patients with LNS who were treated with levodopa, one half reported a partial benefit for dystonia and behavior. The average age of patients who improved with levodopa was 3.2 years, whereas the average age of the patients who did not improve was 9.0 years, suggesting improved efficacy with earlier administration. Tetrabenazine was associated with improvement of dystonia in most of the 6 patients to whom it was administered. {24:Madeo et al. (2019)} also described results of treatment for renal and joint symptoms in these patients as well as in 9 patients with HRH. Fifty-four patients had improvement with allopurinol treatment, but allopurinol was interrupted in 9.3% of cases due to lack of efficacy or appearance of xanthine stones. Febuxostat was the most frequent alternative therapy. Half of patients received urinary alkalinizing agents. Three patients with persistent uric acid lithiasis and 1 patient with severe gout who were not responsive to standard treatment responded to treatment with recombinant urate oxidase."
4,Inheritance,"X-linkage was first suggested by {19:Hoefnagel et al. (1965)} and was supported by a rapidly accumulated series of families with deficiency of HPRT. {34:Rosenbloom et al. (1967)} and {25:Migeon et al. (1968)} demonstrated 2 populations of fibroblasts, as regards the relevant enzyme activity, in heterozygous females, thus providing support both for X-linkage and for the Lyon hypothesis. Studies using human-mouse somatic cell hybrids indicate, by reasoning similar to that used for locating the thymidine kinase locus to chromosome 17 ({188300}), that the HPRT locus is on the X chromosome ({27:Nabholz et al., 1969}). Mosaicism can be demonstrated by study of hair roots in women heterozygous for the Lesch-Nyhan syndrome ({38:Silvers et al., 1972}). {8:Francke et al. (1976)} studied the frequency of new mutations among affected males. The Lesch-Nyhan syndrome is particularly favorable for this purpose because no affected males reproduce, the diagnosis is unequivocal and cases come readily to attention, and particularly because heterozygosity can be demonstrated in females by the existence of 2 populations of cultured fibroblasts. There were few new mutations, contrary to the expected one-third. On the other hand, about one-half of heterozygous females were new mutations, as is predicted by theory. The finding may indicate a higher frequency of mutation in males than in females. Another possibility is the role of somatic and half-chromatid mutations ({12:Gartler and Francke, 1975}). New mutation cases of heterozygous females had elevated parental age. {42:Vogel (1977)} reviewed the evidence concerning hemophilia and the Lesch-Nyhan syndrome leading to the conclusion that the mutation rate is higher in males than in females. Evidence that the mutation rate for the Lesch-Nyhan disease may be higher in males than in females was reviewed by {8:Francke et al. (1976)} and criticized by {26:Morton and Lalouel (1977)}. {9:Francke et al. (1977)} answered the criticism. {39:Strauss et al. (1980)} showed that females heterozygous for the Lesch-Nyhan mutation have 2 populations of peripheral blood lymphocytes with regard to sensitivity to 6-thioguanine inhibition of tritiated thymidine incorporation following phytohemagglutinin stimulation. {17:Henderson et al. (1969)} concluded that the locus for HPRT is closely linked to the Xg ({314700}) locus; {16:Greene et al. (1970)} concluded, however, that the HPRT and Xg loci 'are sufficient distance from each other on the human X chromosome that linkage cannot be detected.' {28:Nyhan et al. (1970)} observed a sibship in which both HPRT deficiency and G6PD deficiency ({300908}) were segregating and found 2 of 4 recombinants. {28:Nyhan et al. (1970)} also found that heterozygotes had normal levels of HPRT in red cells. They interpreted this as indicating a selective advantage of G6PD-normal over G6PD-deficient cells. (In adrenoleukodystrophy ({300100}), it is the mutant cell that enjoys the selective advantage.)\n\n{45:Yukawa et al. (1992)} described a seemingly typical case of Lesch-Nyhan syndrome in a female with a normal karyotype. The parents were nonconsanguineous. In addition to unusual lyonization, uniparental disomy is a possible explanation."
5,Pathogenesis,"<Subhead> Pathogenesis of Mental Retardation and Self-injurious Behavior\n\n{44:Wong et al. (1996)} discussed 3 lines of evidence that had suggested that HPRT deficiency is associated with abnormal dopamine (DA) function in LNS: (1) an autopsy study of 3 LNS subjects demonstrated a marked reduction in the DA content and in the activity of DNA-synthesizing enzymes in the caudate and putamen ({23:Lloyd et al., 1981}); (2) when neonatal rats are depleted of DA with the neurotoxin 6-hydroxydopamine, self-injurious behavior, similar to that seen in LNS, occurred when the rats were challenged with 3,4-dihydroxyphenylalanine (L-DOPA) as adults ({4:Breese et al., 1990}); and (3) in an HPRT-deficient mutant mouse strain, there is a reduction of striatal tyrosine hydroxylase and in the number of striatal dopamine transporters ({21:Jinnah et al., 1994}). To establish that DA deficiency is present in LNS, {44:Wong et al. (1996)} used a ligand that binds to DA transporters to estimate the density of DA-containing neurons in the caudate and putamen of 6 subjects with classic LNS. They made comparisons with 10 control subjects and 3 patients with Rett syndrome ({312750}). Depending on the method of analysis, a 50 to 63% reduction of the binding to DA transporters in the caudate and a 64 to 75% reduction in the putamen of LNS patients was observed compared to the normal control group; similar reductions were found between Rett syndrome and LNS patients. Volumetric magnetic resonance imaging studies detected a 30% reduction in the caudate volume of LNS patients. To ensure that a reduction in the caudate volume would not confound the results, {44:Wong et al. (1996)} performed a rigorous partial volume correction of the caudate time activity curve. This correction resulted in an even greater decrease in the caudate-cerebellar ratio in LNS patients when contrasted to controls.\n\n{7:Ernst et al. (1996)} concluded that patients with Lesch-Nyhan disease have abnormally few dopaminergic nerve terminals and cell bodies. The abnormality involves all dopaminergic pathways and is not restricted to the basal ganglia. These dopaminergic deficits are pervasive and appear to be developmental in origin, which suggested that they contribute to the characteristic neuropsychiatric manifestations of the disease. These studies were done with positron-emission tomography (PET) with the tracer fluorodopa-F18. This tracer, an analog of dopa, is a large, neutral amino acid that is transported into presynaptic neurons, where it is converted by the enzyme dopa decarboxylase ({107930}) into fluorodopamine F18, which subsequently enters catecholamine-storage vesicles. Hence, data obtained with the use of fluorodopa-F18 and PET reflect dopa decarboxylase activity and dopamine-storage processes. In an accompanying editorial, {30:Nyhan and Wong (1996)} commented on the new findings and reviewed the normal function of HPRT with a diagram.\n\n{5:Ceballos-Picot et al. (2009)} demonstrated that HPRT deficiency influences early developmental processes controlling the dopaminergic phenotype. Microarray methods and quantitative PCR were applied to 10 different HPRT-deficient sublines derived from the hybrid MN9D cell line, derived from somatic fusion of embryonic mouse primary midbrain dopaminergic neurons with a mouse neuroblastoma line. There were consistent increases in mRNAs for engrailed-1 (EN1; {131290}) and -2 (EN2; {131310}), transcription factors known to play a role in the specification and survival of dopamine neurons. The increases in mRNAs were accompanied by increases in engrailed proteins, and restoration of HPRT reverted engrailed expression towards normal levels. The functional relevance of the abnormal developmental molecular signature of the HPRT-deficient MN9D cells was evident in impoverished neurite outgrowth when the cells were forced to differentiate chemically. These abnormalities were also seen in HPRT-deficient sublines from the SK-N-BE(2)-M17 human neuroblastoma line, and overexpression of engrailed was documented in primary fibroblasts from patients with Lesch-Nyhan disease. {5:Ceballos-Picot et al. (2009)} concluded that HPRT deficiency may affect dopaminergic neurons by influencing early developmental mechanisms.\n\n{6:Cristini et al. (2010)} examined the effect of HPRT deficiency on the differentiation of neurons in human neural stem cells (NSCs) isolated from human Lesch-Nyhan disease fetal brain. LNS NSCs demonstrated aberrant expression of several transcription factors and DA markers, and HPRT-deficient dopaminergic neurons demonstrated a striking deficit in neurite outgrowth. Exposure of the LNS NSCs to retinoic acid medium elicited the generation of dopaminergic neurons. The authors concluded that neurogenesis is aberrant in LNS NSCs and suggested a role for HPRT in neurodevelopment."
6,Diagnosis,"<Subhead> Prenatal Diagnosis\n\n{11:Fujimoto et al. (1968)} presented evidence that the disease can be recognized in the fetus well before 20 weeks, i.e., within the limit for elective abortion. The method used was an autoradiographic test for HPRT activity, applied to cells obtained by amniocentesis. {3:Boyle et al. (1970)} made the prenatal diagnosis and performed therapeutic abortion. {14:Gibbs et al. (1984)} showed that by ultramicroassay of HPRT it is possible to diagnose the Lesch-Nyhan syndrome on the basis of chorionic villi sampled at 8-9 weeks of gestation.\n\n{15:Graham et al. (1996)} investigated 15 pregnancies at risk for Lesch-Nyhan syndrome between 8 and 17 weeks' gestation by measurement of HPRT and APRT ({102600}) enzyme activities in chorionic villus samples (cultured and uncultured) or in cultured amniotic fluid cells. Ten pregnancies had normal enzyme levels and a normal outcome, while a further 2 predicted to be normal miscarried later in the pregnancy. Three pregnancies had low levels of residual HPRT activity in chorionic villi. Comparable levels of residual activity in the index case in 2 pregnancies and in cells from the abortus in the third case confirmed that the pregnancies were indeed affected."
7,Molecular Genetics,"For a discussion of the molecular defects involved in Lesch-Nyhan syndrome, see the HPRT1 gene ({308000})."
8,Genotype/Phenotype Correlations,"{24:Madeo et al. (2019)} described the association between HPRT mutations and clinical findings in 76 French and Italian patients, including 52 with LNS from 47 families, 19 with the LNS neurologic variant from 12 families, and 12 with HPRT-related hyperuricemia (HRH) from 6 families. They found that overall, patients with missense mutations had significantly later onset of disease and less frequent neurologic manifestations or self-injurious behavior compared to patients with other mutation types including nonsense, deletion, splicing, or complex gene rearrangements. Mutation type was not associated with the development of gout or nephropathy. Intrafamilial phenotypic variability was seen in a few families."
9,History,{22:Lesch and Nyhan (1964)} described the disorder that bears their names on the basis of 2 brothers. {31:Nyhan (1997)} gave an account of the recognition of the syndrome as an inborn error of purine metabolism.\n\n{33:Preston (2007)} provided a popular description of the discovery of the disorder and what the study of a rare disorder such as this can tell us about human behavior.


## Answer the following questions:##
Input your answer in the cell below each question and press SHIFT+ENTER.

2. Besides having high levels of urate, what observations are important for diagnosing Lesch-Nyhan Syndrome (LNS)? (read under "Clinical Features")


Answer here

3. Changes in the levels of what neurotransmitter is thought to lead to the neurological problems of Lesch-Nyhan Syndrome (read under “Pathogenesis”)?


Answer here

4. What type of inheritance pattern is seen with HPRT1? (See under "Inheritance")

Answer here

5. Are males or females more likely to get Lesch-Nyhan disease?  Please explain your answer.


Answer here

## Obtaining the ID number (called mim number) associated with the HPRT1 gene entry in OMIM

In [10]:
%%R # This must precede all R code in Colab, to allow R code to run 

# To access OMIM, we will use this key which will work as our password to access the database
set_key('4PUvWRqSSD2BuprIVAP_VQ') 

# Write HPRT1 in the parenthesis below to obtain the entry
gene_to_omim('HPRT1', show_query=TRUE)


R[write to console]: https://api.omim.org/api/entry/search?search=gene_symbol:HPRT1&include=geneMap&apiKey=4PUvWRqSSD2BuprIVAP_VQ



[1] "300323" "300322"


Now, the link above is a file with our information. Let's download and parse it.

In [14]:
%%R
# Saving the file:

# Paste the url between the quotes below
url <- '#####'

# Set destination (write the file path and a file name followed by .xml)
destfile <- '/content/drive/MyDrive/Colab_Notebooks/hprt_project_files/NAME THIS FILE.xml'

# Download the file
download.file(url, destfile)


R[write to console]: trying URL 'https://api.omim.org/api/entry/search?search=gene_symbol:HPRT1&include=geneMap&apiKey=4PUvWRqSSD2BuprIVAP_VQ'

R[write to console]: downloaded 2086 bytes




In [15]:
%%R
# Find the file in your Drive and copy and paste the path below
# Parse the file for readability
result <- xmlParse(file = 'WHERE IS THE FILE?')

# Read the file
read <- read_xml('WHERE IS THE FILE?' )

# Find the mim number
num <- xml_find_all(read, ".//mimNumber")

# Display the number
num

{xml_nodeset (4)}
[1] <mimNumber>308000</mimNumber>
[2] <mimNumber>308000</mimNumber>
[3] <mimNumber>308000</mimNumber>
[4] <mimNumber>308000</mimNumber>


Write down the mim number for the entry from the results above.

Answer here

## Using OMIM to obtain more information about the gene
This time you will search the OMIM 'Casp1' entry for information.

The function 'get_omim' helps you do just that since we can set certain arguments to 'TRUE' and obtain specific information about the entry.

Run the next cell to see a list of Arguments that you can access.

In [None]:
%%R
help(get_omim)

### Read information about allelic variants 
An allele is a variant of a gene were the DNA sequence differs between two or more variants. 

Allelic variation describes the presence or number of different allele forms at a particular locus (locus or loci = place) on a chromosome.

REF:  
* https://warwick.ac.uk/fac/sci/lifesci/research/vegin/geneticimprovement/diversitycollection/allelicvariation/


In [None]:
# Search OMIM again but with HGPRT1 mim Number to obtain description info
%%R
set_key('KEY HERE')

# Set allelicVariantList to TRUE
omim_result <- get_omim(####, ### = ###)

saveXML(omim_result, file='NAME THIS FILE.xml')

### Display the results in the form of a table

In [None]:
#@title Load the results by providing the file name in this form (include file extension .xml)

# MAKING RESULTS LOOK GOOD
import xml.etree.ElementTree as ET
import csv
import pandas as pd

file_name = "" #@param {type:"string"}

tree = ET.parse(file_name)
root = tree.getroot()
 
Ref_data4 = open('refdata4.csv', 'w')
 
csvwriter = csv.writer(Ref_data4)
allele_head = []

 
count = 0
for member in root.findall('.//allelicVariant'):
    allele = []
    ref_list = []

    if count == 0:
      des = member.find('.//mutations').tag
      allele_head.append(des)
      
      mut = member.find('.//text').tag
      allele_head.append(mut)
     
      csvwriter.writerow(allele_head)
      count = count + 1
       
    ementa = getattr(member.find('.//mutations'), 'text', None)
    allele.append(ementa)
    
    #des = member.find('.//mutations').text
    #allele.append(des)
    
    ementa2 = getattr(member.find('.//text'), 'text', None)
    allele.append(ementa2)

    #mut = member.find('.//text').text
    #allele.append(mut)

    csvwriter.writerow(allele)
  
Ref_data4.close()

data4= pd.read_csv("refdata4.csv")
pd.set_option('display.max_colwidth',10000)

pd.set_option('display.max_rows', None)
pd.set_option('display.max_columns', None)
pd.set_option('display.width', None)
pd.set_option('display.max_colwidth', -1)
data4



## Answer the following questions:##
Input your answer in the cell below each question and press SHIFT+ENTER.

Find the entry for your patient’s mutation.

6. Why are there two different ways of numbering Asp 193(4)?


Answer here

7. Write the first author and year for two articles that describe patients with the D193(4)N mutation.


Answer here

## Find references related to the  D193(4)N mutation


Now set referenceList to TRUE

In [None]:
%%R
set_key('KEY HERE') # The key must be added before every request

# Using mim number to get the article list
# Write the mim number inside the parenthesis and set 'referenceList' to 'TRUE'
omim_result <- get_omim(###, ### = ###)

# Save the results as an XML file
saveXML(omim_result, file='NAME THIS FILE.xml') # Write 'references_omim_phen.xml' as the file name

# File name will display was output for this cell


### Display the results in the form of a table

In [None]:
#@title Load the results by providing the file name in this form (include file extension .xml)

# MAKING RESULTS LOOK GOOD
import xml.etree.ElementTree as ET
import csv
import pandas as pd

file_name = '' #@param {type:"string"}


tree = ET.parse(file_name)
root = tree.getroot()
 
Ref_data = open('refdata.csv', 'w')
 
csvwriter = csv.writer(Ref_data)
authors_head = []

 
count = 0
for member in root.findall('.//reference'):
    authors = []
    ref_list = []
    if count == 0:
        author = member.find('.//authors').tag
        authors_head.append(author)

        title = member.find('.//title').tag
        authors_head.append(title)

        source = member.find('.//source').tag
        authors_head.append(source)
        
        pubmed = member.find('.//pubmedID').tag
        authors_head.append(pubmed)
       
        csvwriter.writerow(authors_head)
        count = count + 1
    
    author = member.find('.//authors').text
    authors.append(author) 
    title = member.find('.//title').text
    authors.append(title)
    source = member.find('.//source').text
    authors.append(source)

    #pubmed = member.find('.//pubmedID')
    ementa = getattr(member.find('.//pubmedID'), 'text', None)
    authors.append(ementa)
         
    csvwriter.writerow(authors)
 
 
Ref_data.close()

data= pd.read_csv("refdata.csv")
data

Write down the PubMed ID of the references you selected.

Answer here

## Reading the abstract from one of the articles

In [None]:
from Bio import Entrez
# Searching for the abstract in the database Pubmed
Entrez.email = 'YOUR EMAIL HERE'

query7 = Entrez.efetch(db='####', id = '####', rettype = '####', retmode = 'text')
# Hint for rettype (retrieval type): you want to retrieve the abstract

# Reading the query and printing it
print(query7.read())

# Closing the query
query7.close()

## Answer the following questions:
Input your answer in the cell below each question and press SHIFT+ENTER.

1. What assay is being described in the Gibbs et al. (1990) article?

Answer here