Skip to content

NUS CS4220: Building a Weighted Gaussian Naive Bayes Model for prediction of gene essentiality in cancer cell-lines.

Notifications You must be signed in to change notification settings

Amandahsr/PredictingGeneEssentiality_CS4220

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CS4220 Group Project: Predicting Gene Essentiality in Cancer Cell-Lines using a Weighted Naive Bayes Model

This repository details the code used for the group project in the module CS4220 Knowledge Discovery Methods in Bioinformatics. All code in this repository is formulated and implemented jointly by Amanda Ho Shan Rui, Benjamin Tan Jee Min, Chan Sheng You and Chen Tianying, Tiana. The gene essentiality problem is adapted from the BROAD-DREAM Gene Essentiality Prediction Challenge.

Scripts:

  1. Pre-processing.R details the pre-processing steps taken in R.
  2. InformationGain.py details the steps taken to derive weight vectors from the weight matrix for our testing and training data. The weight vectors are then used as input for our Weighted Naive Bayes Model.
  3. WeightedNaiveBayes.py details the training and testing of our Weighted Naive Bayes model and benchmarking with the Gaussian Naive Bayes model.

Project Overview:

About

NUS CS4220: Building a Weighted Gaussian Naive Bayes Model for prediction of gene essentiality in cancer cell-lines.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published