This Github repository contains data and code that was used to generate all results and figures in the pre-print "Simons Sleep Project (SSP): An open science resource for accelerating scalable digital health research in autism and other psychiatric conditions", which is availabe on BioArxiv at (link). The SSP is a comprehensive open-science resource with raw and processed data from Dreem3 EEG headbands, multi-sensor EmbracePlus smartwatches, and Withings Sleep mats, as well as parent questionnaires and daily sleep diaries. Data includes recordings from >3600 days/nights of 102 adolescents (10-17 years old) with idiopathic autism and 98 of their non-autistic siblings. All raw and processed data is freely available through the Simons Foundation Autism Research Initiative (SFARI). For further details about the project and the data please read the paper.
-
Dreem3 EEG Headband (Beacon Inc.) recordings
- Electroencephalogram (EEG) from 5 channels
- Accelerometer data
- Positiongram data
-
EmbracePlus Smartwatch (Empatica Inc.) recordings
- Accelerometer data
- Electrodermal activity
- Skin temperature
- Blood volume pulse
-
Withings Sleep Mat (under the mattress) recordings
- Respiration
- Heart rate and variability
- Pressure changes (movements)
-
Daily sleep diary reports
- Parent ratings of child mood and behavior
- Parent reported sleep measures
- Daily medication intake
-
Parent questionnaires at baseline
- Vineland Adaptive Behaviors Scale
- Child Behavior Check List
- Social Responsiveness Scale 2
- Repetitive Behaviors Scale - Revised
- Child Sleep Habits Questionnaire
- Family Inventory Sleep Habits
- Aberrant Behaviors Checklist
- Sensory Profile 2
- Medical Update Questionnaire
All data analyzed in the manuscript is included, enabling full reproducibility. Files are organized by data type:
- df_cshq: sleep measures as reported by parents on the cshq questionnaire
- df_diary: sleep measures as reported by parents on daily sleep diary questions
- df_dreem: sleep measures extracted from Dreem3 recordings
- df_embrace: sleep measures extracted from EmbracePlus recordings
- df_withings: sleep measures extracted from Withings recordings
- df_yasa: sleep measures extracted from raw EEG recordings analyzed with the Yasa sleep staging algorithm
- df_figure3: raw data from multiple devices in 24 hour recording of a single participant.
- df_questionnaires: parent reported scores on multiple questionnaires.
- SSP_participants: demographics data of participants
- merged_3nights: subset of data from participants who had simultaneous data from all devices in at least 3 nights.
Files ending in git contain all available recordings. Files ending in clean include data after pre-processing to exclude nights with extreme sleep measures. Pre-processing python code is included in the notebooks directory to clarify criteria for data inclusion and exclusion.
All analysis were performed with Python code available in the included notebooks. Analysis steps include:
- Pre-processing the data - selecting nights with minimun 3 hour of TST, maximum 16 hours of TST, and maximum 3 hours of WASO.
- Basic data exploration and visualization of available sleep metrics from the 3 wearable devices.
- Comparison of questionnaire and sleep diary data across autism and sibling groups.
- Comparison of device sleep measures across autism and sibling groups.
- Examining relationships between objective sleep measures from the devices and subjective sleep measures from questionnaires.
Python notebooks are organized according to the analyses presented in the paper with each notebook corresponding to analyses presented in a specific figure.
- Presents the availability of different data types and their overlap
- Perform mixed linear model analysis per questionnaire to identify differences across autism and sibling groups
- Present findings using box plot graphs per questionnaire.
- Presents multitude of harmonized data streams available for a single subject during a 24 hour period including:
- EEG recording from Dreem3
- Multiple sensor data from EmbracePlus
- Sleep mat measurements from Withings
- Calculates Concordance Correlation Coefficients (CCCs) for multiple sleep measures across devices, sleep diary, and CSHQ.
- Sleep Onset
- Final Awakening
- Wake After Sleep Onset
- Total Sleep Time
- Compares objective sleep metrics between autistic children and their siblings including:
- Total Sleep Time (TST)
- Wake After Sleep Onset (WASO)
- Includes statistical analyses and visualizations
- Analyzes relationships between objectively defined sleep disturbances (WASO and SOL) and behavioral symptoms as reported by parents
- Includes sleep measures extracted from raw EEG recordings using Yasa sleep staging algorithm
- Provides detailed statistical analyses and visualizations
Each notebook is self-contained and includes:
- Data loading and preprocessing steps
- Detailed analysis procedures
- Code for generating visualizations
- Statistical analyses where applicable
- Explanatory markdown cells documenting each step
Users can follow these notebooks to:
- Understand the analysis methodology
- Reproduce the results
- Extend the analyses for their own research questions
- Learn about sleep research data analysis techniques
Important Note: The visualizations and analyses in this repository are conducted on summarized sleep metrics (e.g., total sleep time, wake after sleep onset, sleep onset latency) and demographic data. The raw sensor data (EEG signals, accelerometer data, etc.) are not included in this repository but are available in the complete dataset on SFARI Base.

