Using Genetic Algorithm to Predict the Severity of Hemophilia A
and Identify Potential Biomarkers from FVIII Mutations Data
Explore the docs »
v1 02/22 - 09/22
v2 03/23 - 04/23
Report Bug
Table of Contents
Hemophilia A is a genetic disorder caused by mutations in the FVIII protein, which is classified into three severities: mild, moderate, and severe.
This project utilizes several data files, including a distance matrix for amino acid mutations, a file with the relative surface area values for each position and corresponding amino acid, and a point mutation database with information on the location, domain, mutation type, and severity of the disease.
These data files are merged to create a comprehensive dataset. A genetic algorithm is then applied to the dataset to determine if it is possible to predict the severity of the disease based on the provided information. The algorithm searches for the best combination of features that will accurately predict the disease severity. By using this approach, the project aims to identify potential biomarkers that can assist in the diagnosis and treatment of Hemophilia A.
Install the following python libraries: numpy, pandas, scikit-learn, matplotlib, pygad and BioPython
pip install numpy
pip install pandas
pip install scikit-learn
pip install matplotlib
pip install pygad
pip install Bio