Brainwave

Reimplementation of the BrainWave model from BrainWave: A Brain Signal Foundation Model for Clinical Applications, trained on the Braintree Bank dataset.

Quick Start

Install the required dependencies (we recommend using uv for managing virtual environments):
```
uv sync  # if not using uv, use: pip install -e .[dev] with a virtual environment
```
Download and preprocess the Braintree Bank dataset (~230 GB) using the provided SLURM script (scripts/data.sh) or manually via brainsets, and update your .env file with the path to the dataset. Note you may need to adjust the script and scripts/env.sh to fit your cluster setup.
```
ROOT_DIR=/path/to/braintree_bank_dataset
```
Modify the model, training logic, and configuration files as needed (see below). Especially, you will want to set the wandb.project and wandb.entity in the configuration to log your training runs to Weights & Biases.

Run the training script:

uv run -m brainwave [CLI overrides]  # if not using uv, use: python -m brainwave

or if using SLURM:

sbatch scripts/train.sh [CLI overrides]

configs/: Contains configuration files for different training setups. We use Hydra for configuration management, allowing easy CLI overrides and organization.
train/: Contains the source code for data modules, models, and training scripts.
- model.py: Model architecture.
- mask.py: Code for creating masks for the input data, used for reconstruction learning objective.
- datamodule.py: PyTorch Lightning DataModule for loading and preprocessing the dataset.
- featurizer.py: Code for feature extraction from raw neural data. Will be applied to each sample in the dataset to create the input features for the model.
- pl_module.py: PyTorch Lightning module that wraps the model and defines training/validation steps.
- dataset.py: Data loading and preprocessing. You can probably ignore this for now.
- train.py: The main training script that orchestrates the training process.
scripts/: Scripts to run on a SLURM cluster. May need to be adjusted to your cluster configuration.
- data.sh: SLURM script to download and prepare the Braintree Bank dataset using brainsets.
- env.sh: SLURM script to set up the environment for training jobs, including loading modules and activating virtual environments. Update this to fit your cluster's environment management.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
brainwave		brainwave
configs		configs
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock