Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add EEG seizure dataset and conversion script #6

Merged
merged 11 commits into from
Jan 13, 2023

Conversation

ikottlarz
Copy link
Member

Hi there!

I have added 18.csv containing an EEG recording of a patient with a seizure [1, 2], available from physionet [3].
For completeness, I have also added the python script I used to convert it from the .edf format to .csv.
Why did I use a python script? Because I found no Julia package to read .edf files
and using pyedflib via PythonCall doesn't work very nicely atm. The python file includes some further info about the dataset.

References

[1] Detti, P. (2020). Siena Scalp EEG Database (version 1.0.0). PhysioNet. https://doi.org/10.13026/5d4a-j060.

[2] Detti, Paolo, Giampaolo Vatti, and Garazi Zabalo Manrique de Lara. “EEG Synchronization Analysis for Seizure Prediction: A Study on Data of Noninvasive Recordings.” Processes 8, no. 7 (July 16, 2020): 846. https://doi.org/10.3390/pr8070846.

[3] Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

@ikottlarz
Copy link
Member Author

@Datseris I have now included 4 EEG time series, downsampled to 4 Hz. The file is now 8.5MB, this should be okay?

exercise_data/18.csv Outdated Show resolved Hide resolved
@@ -14,4 +14,5 @@
14 = Dataset 93 of https://webdav.tuebingen.mpg.de/cause-effect/ (citation = J. M. Mooij, J. Peters, D. Janzing, J. Zscheischler, B. Schoelkopf: "Distinguishing cause from effect using observational data: methods and benchmarks"). First column of data is average precipitation over 1948 to 2004 in mm/day, second is average runoff
15 = Same as 14, but dataset 73: first column is CO2 emissions, second column is energy use (kg of oil equivalent)
16 = Long-time simulation data for orbital parameters of Earth, from http://vo.imcce.fr/insola/earth/online/earth/online/index.php with DOI: 10.1051/0004-6361:20041335. The data (each column) are: time (1000yr), eccentricity, climatic precession, obliquity (radians), insolation (W/m^2). Latitude is 65, record at day of summer solstice (21 June, true longitude = 90).
17 = Vostok Ice Core Data for 420,000 Years. First column is time in years (increasing towards the past), second column is estimated temperature difference w.r.t. current day temperature. Notice that the data are not equi-temporally sampled. The data were downloaded from https://www.ncdc.noaa.gov/paleo-search/ with suggested data citation: "Petit, J.R., et al., 2001, Vostok Ice Core Data for 420,000 Years, IGBP PAGES/World Data Center for Paleoclimatology Data Contribution Series #2001-076. NOAA/NGDC Paleoclimatology Program, Boulder CO, USA." and see https://doi.org/10.1038/20859
17 = Vostok Ice Core Data for 420,000 Years. First column is time in years (increasing towards the past), second column is estimated temperature difference w.r.t. current day temperature. Notice that the data are not equi-temporally sampled. The data were downloaded from https://www.ncdc.noaa.gov/paleo-search/ with suggested data citation: "Petit, J.R., et al., 2001, Vostok Ice Core Data for 420,000 Years, IGBP PAGES/World Data Center for Paleoclimatology Data Contribution Series #2001-076. NOAA/NGDC Paleoclimatology Program, Boulder CO, USA." and see https://doi.org/10.1038/20859
18 = EEG recordings of a patient from the Siena Scalp EEG Database. Each Column is one EEG signal at electrode F8 (right frontal region, patient has right-hemisphere seizures) from patient 0, downsampled to 64Hz. The seizures in the recordings are documented to have started at t=[1143.0s, 1220.0s, 1006.0s, 904.0s] and ended at t=[1213.0s, 1274.0s, 1080.0s, 971.0s], respectively. These time series were downloaded from https://physionet.org/files/siena-scalp-eeg/1.0.0/PN00/PN00-1.edf and converted into the csv format using convert_18.py
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are these times that you document here the times of the real data, of the subsampled data? Isn't it simpler to give row indices as the start and end?

@Datseris Datseris merged commit ee435e6 into JuliaDynamics:master Jan 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants