GitHub - suneethi80/drug_side_effects_predictionmodel_nss: nss capstone project

Predicting side effects caused due to drug-drug interactions

Introduction: Pharmaceutical drugs are important part of our daily lives. While some relieve symptoms, others help to cure disease or condition. However, these drugs also cause side effects ranging from mild to life threatening. These side effects could be caused by a single drug or due to interaction between 2 drugs. That is, some drugs produce undesirable side effects with intake of other drugs. This results in life threatening conditions and hospitalizations.

Why the need to predict? Predicting undesirable side effects due to intake of more than one drug helps pharmaceutical companies, the patients and the doctors. Pharmaceutical companies spend millions of dollars and 15-20 years to bring a drug to the market. Only about 2-5 percentage of drugs discovered in the lab get approved by FDA and successfully reach the market. Prediction will also benefit physicians when patients encounter side effects that were not reported before.

Sources:

Side effects due to drug-drug pair: Two_sides dataset http://tatonettilab.org/offsides/ ∙ The dataset contains - drug pairs labelled as drug_1 and drug_2. - All the side effects caused due to the drug pairs - 1716 unique drug pairs - about 200,000 drug-drug pairs containing differnt combinations of the drug drug pairs
Chemical structure profile for drugs: Pawel's Dataset :http://members.cbio.mines-paristech.fr/~yyamanishi/side-effect/ ∙ The dataset contains 888 chemical structure profile for each of the drugs

Predictive Models: Since the dataset is wide and 200,000 drug-drug pairs, the models were built only on a subset of the data. This subset of the data are those drug-drug pairs that have most common side effects. For example, fever, diarrhoea were the most common side effects due to drug-drug interactions.

Each of the side effects were the target or response variable. The chemical structures were used as features or predictors. There were 888 chemical structures. Hence, this is like a multi class single label classification problem.

Tools Used:Pandas and its library scikit-learn, Seaborn

Models Used: Classification models: Logistic regression, Random Forest, LASSO and XGBoost

Notes on notebooks: Each of the notebooks is organized in a stepwise manner and named accordingly along with numbering. For example 1 represents the first notebook and the description (in the case notebook 1 is importing dataset) is mentioned along with the number.

References: Pauwels, E., Stoven, V., Yamanishi, Y., 2011. Predicting drug side-effect profiles: a chemical fragment-based approach. BMC Bioinform. 12 (1), 169. – Source of chemical substructures data DrugBankTM : Original source for Pauwel’s dataset Two_sides Dataset: http://tatonettilab.org/offsides/

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.gitignore		.gitignore
1a_drug_side_effects_1_importing_datasets.ipynb		1a_drug_side_effects_1_importing_datasets.ipynb
2_drug_side_effects_two_sides_chemstr_merge.ipynb		2_drug_side_effects_two_sides_chemstr_merge.ipynb
3_Liu_dataset_cleanup.ipynb		3_Liu_dataset_cleanup.ipynb
4_data_exploration.ipynb		4_data_exploration.ipynb
5_drug_side_effects_model_LR.ipynb		5_drug_side_effects_model_LR.ipynb
5b_drug_side_effects_model_LR-with_oversampling.ipynb		5b_drug_side_effects_model_LR-with_oversampling.ipynb
6_drugs_side_effects_random_forrest.ipynb		6_drugs_side_effects_random_forrest.ipynb
7_drug_side_effects_LASSO.ipynb		7_drug_side_effects_LASSO.ipynb
8_drug_side_effects_XGB.ipynb		8_drug_side_effects_XGB.ipynb
README.md		README.md
drug_side_effects_1_data_exploration.ipynb		drug_side_effects_1_data_exploration.ipynb
drug_side_effects_1_importing_datasets.ipynb		drug_side_effects_1_importing_datasets.ipynb
drug_side_effects_3_df.ipynb		drug_side_effects_3_df.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

1a_drug_side_effects_1_importing_datasets.ipynb

1a_drug_side_effects_1_importing_datasets.ipynb

2_drug_side_effects_two_sides_chemstr_merge.ipynb

2_drug_side_effects_two_sides_chemstr_merge.ipynb

3_Liu_dataset_cleanup.ipynb

3_Liu_dataset_cleanup.ipynb

4_data_exploration.ipynb

4_data_exploration.ipynb

5_drug_side_effects_model_LR.ipynb

5_drug_side_effects_model_LR.ipynb

5b_drug_side_effects_model_LR-with_oversampling.ipynb

5b_drug_side_effects_model_LR-with_oversampling.ipynb

6_drugs_side_effects_random_forrest.ipynb

6_drugs_side_effects_random_forrest.ipynb

7_drug_side_effects_LASSO.ipynb

7_drug_side_effects_LASSO.ipynb

8_drug_side_effects_XGB.ipynb

8_drug_side_effects_XGB.ipynb

README.md

README.md

drug_side_effects_1_data_exploration.ipynb

drug_side_effects_1_data_exploration.ipynb

drug_side_effects_1_importing_datasets.ipynb

drug_side_effects_1_importing_datasets.ipynb

drug_side_effects_3_df.ipynb

drug_side_effects_3_df.ipynb

Repository files navigation

About

Releases

Packages

Languages

suneethi80/drug_side_effects_predictionmodel_nss

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages