# Introduction to Single-Cell RNA Sequencing

Single-cell RNA sequencing (scRNA-seq) is a cutting-edge technique that allows researchers to analyze the gene expression of individual cells in a tissue sample. Unlike traditional RNA sequencing (bulk RNA-seq), which measures the average gene expression across a population of cells, scRNA-seq provides a much more granular view by examining the transcriptome (the complete set of RNA molecules) of each single cell. This is important because cells within the same tissue or organ can have very different functions, behaviors, and gene expression profiles, even if they look similar under a microscope.

## Key Concepts of Single-cell RNA-seq

1. **Single-cell Resolution:** scRNA-seq captures gene expression data at the level of individual cells, providing insights into cellular heterogeneity that would be missed with bulk RNA-seq.

2. **Cell Type Identification:** By analyzing the gene expression profiles of individual cells, researchers can classify cells into different types, even rare or novel cell types that were previously difficult to identify.

3. **Gene Expression Profiling:** scRNA-seq allows scientists to study which genes are active in each cell, and how the expression of those genes differs across the population of cells. This helps in understanding cell functions, signaling pathways, and responses to stimuli.

4. **Transcriptomic Analysis:** The process involves isolating individual cells from a sample (such as a tissue or blood sample), extracting their RNA, and then sequencing it. This data is then analyzed to understand the functional state of each cell and how cells interact within a tissue.


## Single Cell RNA Sequency workflow

- Single cell sequencing examines the sequence information from individual cells with optimized next-generation sequencing (NGS) technologies, providing a higher resolution of cellular differences and a better understanding of the function of an individual cell in the context of its microenvironment.
- For example, in cancer, sequencing the DNA of individual cells can give information about mutations carried by small populations of cells.
- In development, sequencing the RNAs expressed by individual cells can give insight into the existence and behavior of different cell types.
- In microbial systems, a population of the same species can appear to be genetically clonal, but single-cell sequencing of RNA or epigenetic modifications can reveal cell-to-cell variability that may help populations rapidly adapt to survive in changing environments.


#### 1. Sample Collection
- The process starts with an **environmental sample** (e.g., water, soil) or **organ tissue** (e.g., liver).

#### 2. Single Cell Isolation
Cells are isolated using one of the following techniques:
- **Laser Capture Microdissection (LCM):** A laser is used to cut and capture specific cells.
- **Fluorescence-Activated Cell Sorting (FACS):** Cells are sorted based on fluorescent markers.
- **Microfluidics:** Cells are separated in microdroplets using fluid dynamics.

#### 3. DNA Extraction
- The DNA from the isolated single cell is extracted.

#### 4. Multiple Displacement Amplification (MDA)
- This method amplifies the extracted DNA to generate enough material for sequencing.

#### 5. Library Preparation for Sequencing
- The amplified DNA is fragmented and prepared as a sequencing library.

#### 6. DNA Sequencing
- The sequencing machine reads the DNA fragments, generating raw sequence data.

#### 7. Data Analysis (SNP, CNV, and Cell Type Identification)
Bioinformatics analysis identifies:
- **Single Nucleotide Polymorphisms (SNPs):** Small genetic variations.
- **Copy Number Variations (CNVs):** Large duplications or deletions in the genome.
- **Cell Type Classification:** Determines different cell types based on genetic information.



![image.png](attachment:image.png)

# Single-Cell RNA Sequencing Isolation Analysis Workflow

## **Pre-Processing**

### 1. Raw Data Processing
- Data is obtained from single-cell sequencing experiments.
- The raw sequencing reads are processed to generate **count matrices**, where rows represent genes and columns represent cells.

### 2. Quality Control
- The **count depth** of each cell is examined to filter out low-quality cells.
- Ensures that cells with too few or too many detected genes are removed.

### 3. Normalization
- Adjusts for differences in sequencing depth across cells using **size factors**.
- This step ensures comparability between cells.

### 4. Data Correction (e.g., Batch Effect Removal)
- Corrects for unwanted technical variations (batch effects) that could distort biological signals.
- Visualization before and after correction shows improved clustering.

### 5. Feature Selection
- Identifies **highly variable genes**, which provide the most meaningful biological signals.
- Filters out genes with low or non-informative variability.

### 6. Visualization
- The processed data is visualized in a reduced-dimensional space (e.g., t-SNE or UMAP) to observe clustering patterns.

---

## **Downstream Analysis**

### 7. Clustering
- Cells are grouped into clusters based on their gene expression profiles.
- Helps identify distinct cell populations.

### 8. Marker Identification & Cluster Annotation
- **Marker genes** are identified for each cluster.
- Clusters are annotated to classify cell types (e.g., **stem cells, Paneth cells, goblet cells, enterocytes**).

### 9. Trajectory Inference
- **Lineage relationships** between cells are reconstructed.
- Cells are mapped along differentiation pathways (e.g., **progenitor cells to stem cells**).

### 10. Gene Dynamics
- Gene expression changes are analyzed over **pseudotime**.
- Helps track the dynamic progression of gene expression during differentiation.

### 11. Metastable States
- Identifies transitional cellular states by analyzing gene expression over time.

### 12. Differential Expression Analysis
- Compares gene expression levels between conditions.
- Volcano plots visualize significantly differentially expressed genes.

### 13. Compositional Analysis
- Compares cell composition under different experimental conditions (**Condition 1 vs. Condition 2**).
- Identifies shifts in cell populations.




![image.png](attachment:image.png)