# [Analysis Title]

**Author**: [Your Name]  
**Date**: [YYYY-MM-DD]  
**Analysis Plan**: [Reference to analysis_plan.md or PI directive]

## Scientific Context

**Research Question**: [Biological question in 1-2 sentences]

**Biological Hypothesis**: [Expected outcome and biological reasoning]

**Significance**: [Why this matters biologically]

**Relevant Background**: [1-2 sentence summary of key literature/context]

## Setup

Load required packages and set configuration parameters.

In [None]:
# Standard imports
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Set random seed for reproducibility
np.random.seed(42)

# Configuration
DATA_DIR = "../data/"
OUTPUT_DIR = "../results/"
FIG_DIR = "../figures/"

# Plotting style
sns.set_style("whitegrid")
plt.rcParams['figure.figsize'] = (10, 6)
plt.rcParams['figure.dpi'] = 100

## Data Loading

### Biological Context

**Dataset**: [Organism/tissue/cell type]

**Features Measured**: [What biological features: genes, proteins, metabolites, etc.]

**Experimental Design**: [Sample groups, conditions, replicates]

**Biological Question**: [What this dataset addresses]

In [None]:
# Load data
# data = pd.read_csv(DATA_DIR + "data.csv")

# Initial inspection
# print(f"Data shape: {data.shape}")
# print(f"Columns: {data.columns.tolist()}")
# data.head()

## Quality Control

### Biological Context

**Expected QC Metrics**: [What pass rates/distributions are normal for this data type?]

**Positive Controls**: [What samples/features should pass?]

**Negative Controls**: [What should show no signal?]

In [None]:
# Calculate QC metrics
# Example: library sizes, detection rates, etc.

# Filtering criteria
# Example: min reads, min features, etc.

### QC Interpretation

**Pass Rates**: [Are these expected for this data type?]

**Failed Samples**: [Do failures have biological meaning or are technical?]

**Red Flags**: [Any biological concerns from QC?]

### QC Sanity Checks
- [ ] Pass rates within expected range for data type
- [ ] Positive controls pass QC
- [ ] Negative controls behave as expected
- [ ] No unexpected batch effects
- [ ] Sample metadata consistent with biology

## Analysis

### Biological Reasoning

**Method Choice**: [Why this statistical method for this biological question?]

**Biological Assumptions**: [What biological properties justify this approach?]

**Controls**: 
- Positive: [What should show signal?]
- Negative: [What should show no signal?]

**Expected Outcomes**: [What results would support/refute hypothesis?]

In [None]:
# Perform statistical analysis
# Example: differential expression, clustering, etc.

# Store results
# results = ...

## Results

### Biological Sanity Checks

#### Expression/Abundance Checks
- [ ] Magnitudes reasonable? (e.g., log2FC < 10 for transcription)
- [ ] Direction matches known biology? (check known genes)
- [ ] Positive controls behave as expected?
- [ ] Negative controls show no signal?

#### Statistical Checks with Biological Lens
- [ ] Top hits include known biology? (literature validation)
- [ ] Results robust to threshold changes?
- [ ] Batch effects vs real biology separated?
- [ ] Multiple testing correction appropriate?

#### Known Biology Validation
- Gene/feature 1: [Expected behavior] - [✓/✗]
- Gene/feature 2: [Expected behavior] - [✓/✗]
- Housekeeping genes: [Expected behavior] - [✓/✗]

In [None]:
# Validate key results against known biology
# Example: check if known genes show expected patterns

# Statistical summary
# Example: number of significant hits, effect sizes, p-values

## Visualization

In [None]:
# Create publication-quality figures
# Example: volcano plot, heatmap, UMAP, etc.

# Ensure proper labels, legends, and styling

### Biological Interpretation of Visualizations

**Pattern Observed**: [What biological pattern does the plot show?]

**Expectation Match**: [Is this expected or surprising based on hypothesis?]

**Follow-Up Questions**: [What biological questions does this raise?]

## Preliminary Biological Interpretation

### Main Findings (in biological terms)

**Finding 1**: [Biological process/mechanism identified]

**Finding 2**: [Biological implication of result]

**Finding 3**: [Biological pattern observed]

### Consistency with Known Biology

**✓ Consistent**: [Results matching expected biology/literature]

**⚠️ Surprising**: [Results contradicting expectations - need expert review]

**❓ Novel**: [Findings with no clear precedent - need literature search]

### Biological Plausibility Assessment

**Effect Sizes**: [Do fold-changes/magnitudes make biological sense?]

**Directionality**: [Do up/down regulations align with mechanism?]

**Timing/Context**: [Do results match expected kinetics/conditions?]

### Caveats and Limitations

- [Biological limitation 1]
- [Technical limitation 1]
- [Assumption that needs validation]

## Expert Interpretation Needed (if applicable)

*Use this section only if there are unexpected findings, novel results, or biological mechanisms that require expert interpretation*

### Structured Handoff to Biologist-Commentator

**Finding**: [Specific result with statistics]

**Context**: [1-2 sentence biological background]

**Issue**: [What's unexpected/unclear and why]

**Question**: [Specific question for expert]

**Validation Done**: 
- Positive controls: [✓/✗]
- Replicates consistent: [✓/✗]
- Literature: [consistent/contradicts/unclear]

**Recommended Follow-Up**:
- Validation experiments: [What would confirm?]
- Literature review: [What papers to check?]
- Additional analyses: [What would strengthen?]

## Export Results

In [None]:
# Save processed data
# results.to_csv(OUTPUT_DIR + "results.csv", index=False)

# Export figures
# plt.savefig(FIG_DIR + "figure1.pdf", dpi=300, bbox_inches='tight')
# plt.savefig(FIG_DIR + "figure1.png", dpi=300, bbox_inches='tight')

# Save summary statistics
# summary_stats = {...}
# pd.DataFrame(summary_stats).to_csv(OUTPUT_DIR + "summary_stats.csv")

## Session Information

Record package versions and execution environment for reproducibility.

In [None]:
# Package versions
import sys
import platform

print(f"Python version: {sys.version}")
print(f"Platform: {platform.platform()}")
print(f"\nPackage versions:")
print(f"numpy: {np.__version__}")
print(f"pandas: {pd.__version__}")
print(f"matplotlib: {plt.matplotlib.__version__}")
print(f"seaborn: {sns.__version__}")

# Add other relevant package versions

# Execution time
from datetime import datetime
print(f"\nAnalysis completed: {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}")