# Cell Types Project
*Comparing gene expression profiles and electrophysiological features of excitatory and inhibitory neurons.*

## Team Member Names & Contributions
Group 13  
Joanna A16980082  
Lorin A16982359

*Fill in your 3-4 sentence abstract here*

## Research Question

We aim to compare inhibitory and excitatory neurons through several modalities. Namely, we analyze differences in gene expression profiles and eletrophysiological features.\
For gene expression data we focus on cells highly expressing GABA and Glutamate as the framework for inhibitory and excitatory cells.\
Here we: 
1) Compare the brain regions where these two neurotransmitters are expressed most through finding cells which express related genes; 
2) Find what other genes are most expressed in the cells connected to GABA/Glutamate to establish the differences in gene expression for inhibitory and excitatory cells.

# Background and Prior Work

*Fill in your background and prior work here (~500 words). You are welcome to use additional subheadings. You should also include a paragraph describing each dataset and how you'll be using them.* 

Depression is a complex neurological disorder that significantly impacts mood, cognition, and behavior. The condition is often associated with disruptions in neurotransmitter systems, particularly the balance between excitatory (glutamate) and inhibitory (GABA) signaling. Investigating the molecular and electrophysiological profiles of neurons and glial cells can provide insights into the mechanisms underlying depression and identify potential therapeutic targets.

Glutamate and GABA are the brain’s primary excitatory and inhibitory neurotransmitters, respectively. The glutamate-GABA balance is essential for maintaining proper neural network activity. In depression, dysregulation of **glutamate** receptors, such as NMDA and AMPA receptors, can lead to excitotoxicity and impair synaptic plasticity. Reduced **GABAergic** signaling has been linked to increased stress responses and impaired inhibitory control in neural circuits.

These disruptions are particularly evident in brain regions such as the hippocampus and prefrontal cortex, which are critical for emotion regulation and cognitive function.

Electrophysiological properties, such as membrane potential and firing rates, provide functional insights into the activity of neurons in depressive states. By analyzing these markers in conjunction with gene expression data, we can identify potential molecular pathways that mediate depression-related changes in brain function.

Datasets


1.	Brain Region-Specific Gene Expression Data:\
•	Source: brainarea_vs_genes_exp_w_reannotations.tsv\
•	Description: This dataset contains gene expression profiles across multiple brain regions, allowing for the identification of region-specific patterns of glutamate and GABA pathway-related genes.


2.	Electrophysiology Data:\
•	Source: Allen Institute Cell Types Data\
•	Description: Provides electrophysiological properties of human brain cells, including firing rates, membrane potentials, and upstroke-to-downstroke ratios. These features can be linked to functional changes in excitatory and inhibitory signaling.

Prior studies have shown that increased glutamate activity and decreased GABAergic signaling are hallmarks of depression. For example, ketamine’s antidepressant effects are thought to involve modulation of NMDA receptor activity. In addition, neuroinflammation has been implicated in the dysregulation of glutamate and GABA systems, with cytokines like IL-6 and TNF-alpha affecting synaptic transmission.

### References (include links):
(1)Hu, YT., Tan, ZL., Hirjak, D. et al. Brain-wide changes in excitation-inhibition balance of major depressive disorder: a systematic review of topographic patterns of GABA- and glutamatergic alterations. Mol Psychiatry 28, 3257–3266 (2023). https://doi.org/10.1038/s41380-023-02193-x\

(2)Marc S. Lener, et al. “Glutamate and Gamma-Aminobutyric Acid Systems in the Pathophysiology of Major Depression and Antidepressant Response to Ketamine.” Biological Psychiatry, Elsevier, 12 May 2016. https://doi.org/10.1016/j.biopsych.2016.05.005

## Hypothesis


*Fill in your hypotheses here*

## Setup
*Are there packages that need to be imported, or datasets that need to be downloaded?*

In [11]:
# Python packages that need to be imported

import pandas as pd
from scipy import stats
import matplotlib as mpl
import matplotlib.pyplot as plt
import numpy as np

In [14]:
# For comparing gene expression profiles we use the the brainarea_vs_genes_exp_w_reannotations.tsv 
# data file found in homework a1-ConditionallyExpressed.

gene_df = pd.read_csv('brainarea_vs_genes_exp_w_reannotations.tsv', delimiter = '\t')
gene_df = gene_df.set_index('gene_symbol')
gene_df.head()

Unnamed: 0_level_0,CA1 field,CA2 field,CA3 field,CA4 field,"Crus I, lateral hemisphere","Crus I, paravermis","Crus II, lateral hemisphere","Crus II, paravermis",Edinger-Westphal nucleus,Heschl's gyrus,...,"temporal pole, inferior aspect","temporal pole, medial aspect","temporal pole, superior aspect",transverse gyri,trochlear nucleus,tuberomammillary nucleus,ventral tegmental area,ventromedial hypothalamic nucleus,vestibular nuclei,zona incerta
gene_symbol,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
A1BG,0.856487,-1.773695,-0.678679,-0.986914,0.826986,0.948039,0.935427,1.120774,-1.018554,0.170282,...,0.27783,0.514923,0.733368,-0.104286,-0.910245,1.03961,-0.155167,-0.444398,-0.901361,-0.23679
A1BG-AS1,0.257664,-1.373085,-0.619923,-0.636275,0.362799,0.353296,0.422766,0.346853,-0.812015,0.903358,...,1.074116,0.821031,1.219272,0.901213,-1.522431,0.598719,-1.709745,-0.054156,-1.695843,-1.155961
A1CF,-0.089614,-0.546903,0.282914,-0.528926,0.507916,0.577696,0.647671,0.306824,0.089958,0.14982,...,-0.030265,-0.187367,-0.428358,-0.465863,-0.136936,1.229487,-0.11068,-0.118175,-0.139776,0.123829
A2M,0.552415,-0.635485,-0.954995,-0.259745,-1.687391,-1.756847,-1.640242,-1.73311,-0.091695,0.003428,...,-0.058505,0.207109,-0.161808,0.18363,0.948098,-0.977692,0.911896,-0.499357,1.469386,0.557998
A2ML1,0.758031,1.549857,1.262225,1.33878,-0.289888,-0.407026,-0.358798,-0.589988,0.944684,-0.466327,...,-0.472908,-0.598317,-0.247797,-0.282673,1.396365,0.945043,0.158202,0.572771,0.073088,-0.88678


In [15]:
# For electrophysiology data we use the Allen Institute Cell Types data.

from allensdk.core.cell_types_cache import CellTypesCache
from allensdk.api.queries.cell_types_api import CellTypesApi
ctc = CellTypesCache(manifest_file='cell_types/manifest.json')
human_cells = ctc.get_cells(species=[CellTypesApi.HUMAN])
human_df = pd.DataFrame(human_cells)
human_df = human_df.set_index('id')

ephys_features = pd.DataFrame(ctc.get_ephys_features()).set_index('specimen_id')
human_ephys_df = human_df.join(ephys_features)
human_ephys_df.head()

Unnamed: 0_level_0,reporter_status,cell_soma_location,species,name,structure_layer_name,structure_area_id,structure_area_abbrev,transgenic_line,dendrite_type,apical,...,trough_t_ramp,trough_t_short_square,trough_v_long_square,trough_v_ramp,trough_v_short_square,upstroke_downstroke_ratio_long_square,upstroke_downstroke_ratio_ramp,upstroke_downstroke_ratio_short_square,vm_for_sag,vrest
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
525011903,,"[273.0, 354.0, 216.0]",Homo Sapiens,H16.03.003.01.14.02,3,12113,FroL,,spiny,intact,...,4.134987,1.375253,-53.968754,-59.51042,-71.197919,2.895461,2.559876,3.099787,-88.843758,-70.561035
528642047,,"[69.0, 254.0, 96.0]",Homo Sapiens,H16.06.009.01.02.06.05,5,12141,MTG,,aspiny,,...,,1.05116,-67.468758,,-70.875002,1.891881,,1.989616,-101.0,-69.20961
537256313,,"[322.0, 255.0, 92.0]",Homo Sapiens,H16.03.006.01.05.02,4,12141,MTG,,spiny,truncated,...,5.694547,1.3899,-52.125004,-51.520836,-72.900002,3.121182,3.464528,3.054681,-87.53125,-72.628105
519832676,,"[79.0, 273.0, 91.0]",Homo Sapiens,H16.03.001.01.09.01,3,12141,MTG,,spiny,truncated,...,9.96278,1.21102,-53.875004,-52.416668,-73.693753,4.574865,3.817988,4.980603,-84.218758,-72.547661
596020931,,"[66.0, 220.0, 105.0]",Homo Sapiens,H17.06.009.11.04.02,4,12141,MTG,,aspiny,,...,14.66734,1.336668,-63.593754,-63.239583,-75.518753,1.45289,1.441754,1.556087,-82.53125,-74.260269


## Data Wrangling

Describe your data wrangling steps here.

In [4]:
## YOUR CODE HERE
## FEEL FREE TO ADD MULTIPLE CELLS PER SECTION

## Data Analysis & Results

Include cells that describe the steps in your data analysis.

In [5]:
## YOUR CODE HERE
## FEEL FREE TO ADD MULTIPLE CELLS PER SECTION

## Conclusion & Discussion

*Fill in your discussion information here*