# Initial Sleep-EDF data exploration

This notebook explores the [Sleep-EDF Database Expanded](https://physionet.org/content/sleep-edfx/1.0.0/) from PhysioNet, which contains 197 whole-night polysomnographic sleep recordings. The dataset includes data from two studies:

1. **Sleep Cassette (SC)** - 153 recordings from 78 healthy subjects (aged 25-101) without sleep-related disorders
2. **Sleep Telemetry (ST)** - 44 recordings from subjects with mild difficulty falling asleep

Each recording contains physiological signals recorded during sleep, along with expert annotations of sleep stages.

## Data Structure

The dataset is organized in two main folders:

- **sleep-cassette/**: contains recordings from the Sleep Cassette study
- **sleep-telemetry/**: contains recordings from the Sleep Telemetry study

For each subject, there are two primary files:
- **[ID]-PSG.edf**: the polysomnography recording containing all physiological signals
- **[ID]-Hypnogram.edf**: the annotations of sleep stages by human experts

Additional metadata is available in:
- **SC-subjects.xls**: information about subjects in the Sleep Cassette study
- **ST-subjects.xls**: information about subjects in the Sleep Telemetry study

## Signal Dictionary

### PSG Files
Each PSG file contains multiple channels:

| Channel | Description | Sampling Rate |
|---------|-------------|---------------|
| EEG Fpz-Cz | Electroencephalogram (frontal-central) | 100 Hz |
| EEG Pz-Oz | Electroencephalogram (parietal-occipital) | 100 Hz |
| EOG horizontal | Electrooculogram (eye movements) | 100 Hz |
| EMG submental | Electromyogram (chin muscle activity) | 100 Hz |
| Event marker | Annotations of events | Variable |
| Body temperature | Temperature measurements (not in all recordings) | 1 Hz |
| Respiration | Respiratory signal (not in all recordings) | 1 Hz |

### Hypnogram Files
Sleep stages are coded according to the Rechtschaffen and Kales (R&K) manual:

| Code | Sleep Stage | Description |
|------|-------------|-------------|
| 0 | W | Wake |
| 1 | 1 | Non-REM stage 1 |
| 2 | 2 | Non-REM stage 2 |
| 3 | 3 | Non-REM stage 3 |
| 4 | 4 | Non-REM stage 4 |
| 5 | R | REM sleep |
| 6 | M | Movement time |
| 9 | ? | Unknown |

Sleep stages 3 & 4 are often combined in modern scoring as N3 (slow-wave sleep).

## Technical Information

- **File Format**: European Data Format (EDF)
- **Recording Duration**: typically ~20 hours, including pre-sleep and post-sleep periods
- **Annotation Intervals**: sleep stages are annotated in 30-second epochs
- **Main Paper**: [The Sleep-EDF Database Expanded](https://doi.org/10.1109/TBME.2018.2872652)

In this notebook, we'll use the MNE-Python library to load and analyze these EDF files.