Skip to content

In the proposed approach, seven different features are extracted. Then, the extracted features are combined, ranked based on the Fisher’s score (F-score), and the most efficient ones are selected.

jimy2020/Malonylation-site-prediction

Repository files navigation

Malonylation site prediction

Lysine malonylation is one of the most important post-translational modifications (PTMs). It affects the functionality of cells. Malonylation site prediction in proteins can unfold the mechanisms of cellular functionalities. Experimental methods are one of the due prediction approaches. But they are typically costly and time-consuming to implement. Recently, methods based on machine-learning solutions have been proposed to tackle this problem. Such practices have been shown to reduce costs and time complexities and increase accuracy. However, these approaches also have specific shortcomings, including inappropriate feature extraction out of protein sequences, high-dimensional features, and inefficient underlying classifiers. A machine learning-based method is proposed in this paper to cope with these problems. In the proposed approach, seven different features are extracted. Then, the extracted features are combined, ranked based on the Fisher’s score (F-score), and the most efficient ones are selected. Afterward, malonylation sites are predicted using various classifiers. Simulation results show that the proposed method has acceptable performance compared with some state-of-the-art approaches. In addition, the XGBOOST classifier, founded on extracted features such as TFCRF, has a higher prediction rate than the other methods.

Requirement:

This program is suitable for Python3

This program uses the FScore method in the feature selection phase.

In the classifier folder, we have implemented various classifier methods.
In the data folder, There are original datas.
In the classifier folder, we have implemented various classifier methods.
In the Feature_extraction folder, we have implemented various feature extraction methods. Note that the weight folder is the same as the TFCRF method.

Malonylation-site-prediction:

For this purpose it is only need to run main.py file.

Feedback:

Its pleasure for me to have your comment. j.pirgazi@mazust.ac.ir

About

In the proposed approach, seven different features are extracted. Then, the extracted features are combined, ranked based on the Fisher’s score (F-score), and the most efficient ones are selected.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages