Skip to content

Latest commit

 

History

History
35 lines (25 loc) · 2.68 KB

Datasets.rst

File metadata and controls

35 lines (25 loc) · 2.68 KB

Datasets

Datasets for training, testing and casestudy

Our work used datasets S1676, S236, S543 to investigate the prediction of stability changes on protein (Table S1).

Table S1 Datasets used to build, evaluate and independently test in MU3DSP

Dataset Total Variants (Proteins) Destabilizing Variants (Proteins) Stabilizing Variants (Proteins) Stabilizing Variants (Proteins) Additional Details
S1676 1676 (67) 1,223 (64) 424 (53) 29(4) Unique Variants/Averaged DDG
S236 236 (22) 192 (18) 42 (14) 2(2) Unique Variants/Averaged DDG
S543 543(55) 426(48) 107(37) 10(6) Unique Variants/Averaged DDG
p53 42 (1) 31(1) 11 (1) 0 One Protein

Databases and references

Folkman, L., et al., EASE-MM: Sequence-Based Prediction of Mutation-Induced Stability Changes with Feature-Based Multiple Models. J Mol Biol, 2016. 428(6): p. 1394-1405.

Pires, D.E., D.B. Ascher, and T.L. Blundell, mCSM: predicting the effects of mutations in proteins using graph-based signatures. Bioinformatics, 2014. 30(3): p. 335-42.

Datasets with features

Downloading