# Summer Co-Op 2020: Wasserman Lab  
### Cis-Regulatory Mechanisms of Escape from X-Chromosome Inactivation  
*Centre for Molecular Medicine and Therapeutics (CMMT)*  
*Department of Medical Genetics at the University of British Columbia (UBC)*

# Glossary:
> **1. Cis-Regulatory Elements (CREs):** Regions of non-coding DNA that regulate the transcription of neighbouring genes (e.g. *promoters, enhancers, silencers, and operators*).  
>**2. Transcription Factors (TFs):** Proteins that bind to a specific DNA sequence via DNA-binding domains.  
> **3. X-Chromosome Inactivation:** Random process by which most genes from the inactivated X-chromosome are silenced.  
> **4. ACE2:** Gene that encodes the angiotensin-converting enzyme 2, which works as a functional receptor for the spike glycoprotein of the human coronavirus SARS-Cov2.  

# Background:
## X-Chromosome Inactivation and Escape:  
> During early mammalian development, each female somatic cell random inactivates one of the X-chromosomes (Xi) to restore the balance of gene expression between sexes.  While most genes from the inactivated X-chromosome are silenced, 15–25% are known to escape X-inactivation (termed escapees). Therefore, these genes continue to have dosage differences between males and females and are attributed to sex-dependent phenotypic variability.  
> The initial silencing of ChrX is governed mainly by XIST (X-inactive specific transcript), a long non-coding RNA (lncRNA) located at the X-inactivation center (XIC). Alongside other neighboring ncRNAs (e.g., FTX and JPX),  XIST  begins the process of X-inactivation.  
> The XIST gene is exclusively transcribed from Xi but not the active X-chromosome (Xa), and its RNA products act in cis by coating the chromosome within a restricted chromosomal territory. The activity of XIC genes in recruiting the assembly of silent chromatin on the chromosome from which it is expressed results in an irreversible heterochromatinization.  
> Various methods are used for identifying escapees. For example,  loci on Xi with low methylation levels were proposed as indicators for escapee genes. Furthermore, escapees are located at the p-arm, which comprises evolutionary young segments that diverged more recently from ChrY.

## Eukaryotic Gene Regulation:
> Gene regulation is the process of controlling which genes in a cell's DNA are expressed and what functional products are ultimatelly made. Different cell types have a different set of active genes, despite the fact that almost all cells in the body contain the same DNA. Different cells of the same type might also express different genes depending on their environment and internal state.  
> Eukaryotic gene expression can be regulated at many levels:  
1. Chromatin accessibility  
2. Transcription  
3. RNA processing  
4. RNA stability  
5. Translation  
6. Protein activity  
>
> Transcription factors are proteins that regulate the transcription of genes by facilitating or dificultating the binding of RNA polymerase to the **promoter** of a gene. A transcription factor binds to DNA at a specific target sequence. 
1. Some transcription factors activate transcription and are called **activators**. In a gene, far-away clusters of binding sites for activaters are known as **enhancers**.  
2. Other transcription factors repress transcription and are called **repressors**. In a gene, far-away clusters of binding sites for repressors are known as **silencers**.  
3. **Tissue-specific enhancers/silencers** control a gene's expression in a certain part of the body.  
![Transcription Factors of Eukaryotic Cells](https://upload.wikimedia.org/wikipedia/commons/8/80/Transcription_Factors.svg)  
![Eukaryotic Protein-Coding Gene](https://upload.wikimedia.org/wikipedia/commons/5/54/Gene_structure_eukaryote_2_annotated.svg)  

## Genomic Databases and Analysis Tools:
>***The UCSC Genome Browser:*** A we-based viewer for genome sequence data and annotations.  
>1. GeneHancer track from GeneCards: Relates regulatory elements (enhancers and promoters) to their inferred interactions with target genes, based on ENCODE, Ensembl, FANTOM5, VISTA, dbSUPER, EPDNew, and UCNEbase.  
>2. ENCODE Transcription Factor Binding Site Peaks abd Clusters track sets: Show regions of transcription factor binding based on a large collection of ChIP-seq experiments performed by the ENCODE project. The tracks are organized into two types: the **Peaks** set contains the underlying ChIP-seq peaks and can be optionally filtered by cell type or transcription factor, while the **Clusters** track provides a summary display of occupancy regions for each cluster.  
>
> Many non-coding regions of the DNA host a variety of *cis*-regulatory regions that control gene expression. The **ENCODE** project aims to identify all functional elements in the human genome.
> The activity and expression of protein-coding genes can be modulated by the regulome - a variety of DNA elements, such as promoters, transcriptional regulatory sequences, and regions of chromatin structure and histone modification. The primary assays used in ENCODE are ChIP-seq, DNase I Hypersensitivity, RNA-seq, and assays of DNA methylation.  
> Conserved non-coding sequences often contain regulatory regions. Similarly, DNase I hypersensitivity analysis specifies regions likely to contain regulatory elements. However, the most commonly used method for identifying transcription factor binding sites is chromatin immunoprecipitation (ChIP).

## ACE2 Gene:
> The protein encoded by this gene belongs to the angiotensin-converting enzyme family of dipeptidyl carboxydipeptidases. It is an importante regulator of cardiovascular and renal function and acts as a receptor for the spike glycoprotein of the human coronaviruses SARS and HCoV-NL63.  
>
> **Location**: Xp22.2  
1. HGNC: [Symbol report for ACE2](https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/HGNC:13557)  
2. Ensembl: [ENSG00000130234](https://uswest.ensembl.org/Homo_sapiens/Gene/Summary?g=ENSG00000130234;r=X:15561033-15602148)  
3. GeneCards: [GC0XM015494](https://www.genecards.org/cgi-bin/carddisp.pl?gene=ACE2&keywords=gh0xj015579#summaries)  
4. NCBI: [Gene ID: 59272](https://www.ncbi.nlm.nih.gov/gene/59272)  
5. Alliance of Genome Resources: [HGNC: 13557](https://www.alliancegenome.org/gene/HGNC:13557)  
6. UCSC: [Human Gene ACE2 (ENST00000427411.1) Description and Page Index](http://genome.cse.ucsc.edu/cgi-bin/hgGene?org=Human&hgg_chrom=none&hgg_type=knownGene&hgg_gene=uc004cxb.2)  

## Coronavirus Disease 2019 (COVID-19):
> Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infects cells through S spike glycoprotein binding angiotensin-converting enzyme (ACE2) on host cells.  
> Symptoms of this infectious respiratory disease include fever, cough, sputum production, fatigue, shortness of breath, and loss of smell and taste. Some progress to viral pneumonia, acute respiratory distress syndrome (ARDS), multi-organ failure, or cytokine storm.  
> The lungs are the organs most affected by COVID 19, since the virus accesses host cells via the enzyme angiotensin-converting enzyme 2 (ACE2), which is most abundant in type II alveolar cells of the lungs. The virus also affects gastrointestinal organs, the cardiovascular system, and the kidneys. Infection with COVID-19 is related to vasocontractile responses, thrombosis, and decreased oxygenation.  
> Although SARS-COV-2 has a tropism for ACE2-expressing epithelial cells of the respiratory tract, patients with severe COVID 19 have symptoms of systemic hyperinflammation. Infected patients have elevated levels of IL-2, IL-7, IL-6, granulocyte-macrophage colony-stimulating factor (GM-CSF), interferon-γ inducible protein 10 (IP-10), monocyte chemoattractant protein 1 (MCP-1), macrophage inflammatory protein 1-α (MIP-1α), and tumour necrosis factor-α (TNF-α). Classic serum biomarkers of cytokine release syndrome (CRS) include elevated C-reactive protein (CRP), lactate dehydrogenase (LDH), D-dimer, and ferritin.  

# References:
1. [Human genes escaping X-inactivation revealed by single cell expression data](https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-019-5507-6)  
2. [Carolyn J. Brown, PhD](https://medgen.med.ubc.ca/person/carolyn-brown/)
3. [Overview: Eukaryotic gene regulation](https://www.khanacademy.org/science/biology/gene-regulation/gene-regulation-in-eukaryotes/a/overview-of-eukaryotic-gene-regulation)  
4. [Transcription factors](https://www.khanacademy.org/science/biology/gene-regulation/gene-regulation-in-eukaryotes/a/eukaryotic-transcription-factors)  
5. [Transcription factor](https://en.wikipedia.org/wiki/Transcription_factor)  
6. [ACE2 Gene (Protein Coding)](https://www.genecards.org/cgi-bin/carddisp.pl?gene=ACE2#summaries)  
7. [The sequence of human ACE2 is suboptimal for binding the S spike protein of SARS coronavirus 2](https://www.biorxiv.org/content/10.1101/2020.03.16.994236v1)  
8. [Coronavirus disease 2019](https://en.wikipedia.org/wiki/Coronavirus_disease_2019)  
9. [ENCODE](https://en.wikipedia.org/wiki/ENCODE)  
10. [Teams Scour ACE2 Sequence, Expression Data to Search for SARS-Cov-2 Infection Clues](https://www.genomeweb.com/sequencing/teams-scour-ace2-sequence-expression-data-search-sars-cov-2-infection-clues#.Xq2p1ahKiwf)  
11. [COVID-19 Pandemic Resourves at UCSC](https://genome.ucsc.edu/covid19.html)  

# UCSC Genome Browser
**ACE2 (Homo sapiens angiotensin I converting enzyme 2 (ACE2), transcript variant 2, mRNA. (from RefSeq NM_021804))**
### UCSC Default Tracks:
![Default UCSC Tracks for ACE2](attachment:hgt_genome_53def_33c420-1.png)  
##### GeneHancers around ACE2 on UCSC Golden Path with GeneCards custom track:
![hgt_genome_59d0f_341b60-1.png](attachment:hgt_genome_59d0f_341b60-1.png)  
###### Interactions around ACE2: ![FULL1-1.png](attachment:FULL1-1.png)


# Tutorials:
### Genomic Analysis:  
###### Tutorials:
• [UCSC Genome Browser](http://genome.ucsc.edu/training.html)  
• [Ensembl](http://www.ensembl.org/info/using/website/)  
• [ENCODE](https://www.genome.gov/event-calendar/ENCODE-2015-Research-Applications-Users-Meeting)  
• [Entrez Map Viewer](http://www.ncbi.nlm.nih.gov/projects/mapview/static/MapViewerHelp.html)
###### Research Articles: 
1. [Web-based resources for clinical bioinformatics](https://link-springer-com.ezproxy.library.ubc.ca/protocol/10.1007%2F978-1-60327-148-6_17)  
2. [The UCSC Genome Browser: What Every Molecular Biologist Should know](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4142428/)  

### JupyterLab:
1. [Notebook](https://jupyterlab.readthedocs.io/en/stable/developer/notebook.html)  
2. Markdown:  
    a. [Markdown Basics](https://markdown-guide.readthedocs.io/en/latest/basics.html)  
    b. [Basic Syntax](https://www.markdownguide.org/basic-syntax/)  
    
### Python:
1. [The Python Tutorial](https://docs.python.org/3.8/tutorial/)  