# Project Structure Summary

This note condenses the overall repository layout and the role of each research notebook so new collaborators can orient themselves quickly.

## Notebook Purposes

- **main_analysis.ipynb** – Final presentation notebook; loads processed outputs, recomputes key stats (SME, ERP contrasts) under the updated ROI definition, and produces figures/tables for GitHub.
- **notebooks/00_quality_assessment_and_subjects_selection.ipynb** – Raw-data QC dashboard; ranks participants, visualizes noise metrics, and writes the `preprocessing_summary.csv`/selection lists that downstream steps use.
- **notebooks/01_preprocessing_pipeline.ipynb** – Entry point for running the preprocessing stack; orchestrates subject selection, filtering, re-referencing, and hands off to the automation helpers in `src/preprocessing/` for ICA + artifact rejection.
- **notebooks/02a_manual_ica_single_session.ipynb** – Walkthrough for the single manually curated subject (currently `sub-003`); includes ICA inspection, component tagging, bad-segment annotation, and exports cleaned FIF files.
- **notebooks/02b_automated_ica_batch.ipynb** – Batch pipeline for the remaining subjects; merges runs, applies ICLabel-based ICA rejection plus AutoReject epoch cleaning, and logs per-session QC metrics.
- **notebooks/03_erp_analysis.ipynb** – Performs epoching, statistical tests (ERP familiarity, repetition, category effects), and writes the CSV outputs consumed by `main_analysis.ipynb`.


## Repository Layout (key paths)

```
EEG/
├── main_analysis.ipynb          # Final report notebook
├── notebooks/                   # Pipeline notebooks (00–03, plan, prototypes)
├── src/
│   ├── preprocessing/           # ICA, artifact rejection, quality modules
│   ├── analysis/                # ERP/statistics helpers and visualization utilities
│   └── utils/                   # Shared helpers (pathing, data loaders, plotting)
├── data/
│   ├── preprocessed/            # Stage-wise FIF outputs (after filtering, ICA, epochs)
│   └── derivatives/             # Additional QC derivatives when available
├── ds002680/                    # Raw BIDS-formatted source dataset
├── results/
│   ├── figures/                 # Publication-ready PNGs
│   ├── statistical_outputs/     # CSV tables (ANOVA, t-tests, ROI summaries)
│   └── quality_metrics/         # JSON/CSV logs generated during preprocessing
├── config/analysis_config.yaml  # Central config (subjects, ROI, thresholds)
├── README / docs/*.md           # Study overview and documentation
└── requirements.txt             # Python dependencies
```

This structure separates raw data, intermediate products, reusable code, and presentation notebooks so each stage can be rerun or audited independently.