Skip to content

yilaili/cryoRL-pytorch-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

CryoRL-pytorch-data

Dataset ("Aldolase") used in https://github.com/IBM/CryoRL-pytorch and Optimized path planning surpasses human efficiency in cryo-EM imaging.

The dataset consists of image files cropped and converted from original cryo-EM images in mrc format.

Note that our previous paper CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection used a classifier instead of a regressor. The train-test split was slightly different as well.

Note: You will need git lfs (https://git-lfs.github.com/) to download the dataset.
You can also download it from our shared Google drive: https://drive.google.com/drive/folders/1znPXk5fJ9aujWDfeaU3LJlLyVjnuod_Y?usp=sharing

Data preparation description

Patch level images were converted from mrc to png format (8-bit) using e2proc2d.py.

With hole coordinates from Leginon database (obtained when collecting the systematic dataset as ground truth), individual hole images were cropped with boxes of 150x150 px.

The CTFMaxRes values were also obtained from Leginon database and matched with the corresponding hole images.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published