# 📘 Heatmap of Entire Gene Expression Dataset

This notebook visualizes the correlation matrix across the entire gene expression dataset.
A heatmap of gene-gene correlations can reveal clusters of co-expressed genes, which might suggest
shared regulation or functional relationships.

## 🔧 Step 1: Import Required Libraries
We use `pandas` for data handling, `seaborn` for plotting heatmaps, and `matplotlib` for display.

In [None]:
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

## 📂 Step 2: Load the Expression Data
Ensure your dataset is in CSV format with genes as columns and samples as rows. The first column should contain sample IDs.

In [None]:
# Replace 'your_expression_data.csv' with your actual data file
df = pd.read_csv('your_expression_data.csv', index_col=0)
df.head()

## 🧮 Step 3: Compute the Correlation Matrix
We calculate Pearson correlations between all gene expression profiles.

In [None]:
corr_matrix = df.corr()
corr_matrix.head()

## 📊 Step 4: Plot the Heatmap
Visualize the full correlation matrix with a diverging colormap (`coolwarm`) to highlight positive vs negative correlations.

In [None]:
plt.figure(figsize=(15, 12))
sns.heatmap(corr_matrix, cmap='coolwarm', center=0)
plt.title('Correlation Heatmap of All Genes')
plt.show()