Skip to content

VinayDeshpande/Project3_Protein_Interaction-

Repository files navigation

Project Report

Project 3 of CS5331-004: Visualization and Visual Analytics - Fall 2016

Study of Genes and Its Imapct on Cancer types

-Link to Demo: http://myweb.ttu.edu/videshpa/index.html

-Youtube video: https://youtu.be/AViFyIenAz0

Click on the image to watch the video video

This project is mainly based on the Research of Cancer Studies. In this project, the main criteria is to understand the research work going on and also need to overlay experimental data (from cancer studies) on top of a protein network. The main source of the data has been acquired form http://www.cbioportal.org/. We as a Team have not only considered the above website but also go beyond the expectations and gathered data from different publications pertaining to Research on Cancer and its intensity .

Data Pre-Processing

The API(Application Program Interface) is requested from the Javascript code written and obtain the relavant data.The data mainly consists of different Case Studies on a particular Gene . The details of it is as shown below ..

1

The Requested data from the Cbioportal needs to be proccessed and filtered according to the requirement. In this Project our Main aim or motive is to show the Male and Female ratio based on a Gene.

2

The data is in the form of Array of Objects which consits of Male Count and the Female count with respect to each Cancer type for each Gene Clicked by the user .

3

The Requirment is enhanced by sorting down the male and female count . The Legend shown are Male and Female . The user Clicks on the Male Legend to sort the data based on the Male count. When the user clicks on the female count the data is sorted based on the Female count . The motive of this requirement is to mainly understand the the effect of each Gene on a cancer tye based on the Gender . ###Rendering the data 4

Sorting the Cancers based on its impact on Males

male sort

Sorting the Cancers based on its impact on Females

female sort

The Requirment is further enhanced by showing the intensity of each Gene on a Particular Cancer type . This mainly helps the user understand the effect of each Gene on a Cancer type . The data obtained for this requirment has been obtained from the publications in the http://www.proteinatlas.org/ . The CSV data obtained is from reliable sources since the data is from the authorized publications . The CSV Data shows exactly how the gene is affecting a particular cancer type . The CSV data has been synchronized dynamically with the data obtained by the Cbioportal website to visualize it better.

High Intensity

high intensity

Medium Intensity

medium intennsity

Low Intensity

low intensity

The Next Requirement is to show exactly what the Cancer Gene Looks like . The best possible way to do that is to show the user on how the Gene looks liks and its structure . The images have been obtained from the relaible sources and the image is rendered based on the request of the user . Although an user who does not have any knwoledge about the study of cancer may find it hard to understand the structure of the data , but along with the visuzalized data it becomes easier for the user to corelate between the data and the gene and get some basic knowledge of it ..

Here is the Screen shot which show s a particular Gene along with its effect and the Gender impact of that Gene.

The Project shows the data in the form of Bar Graph which makes it quite easy for any type of the user to read and understand the data. Results have shown that Bar graph is the best way to visualize the data which makes it quite easy for any lay man to understand it well.

Future Work

1)The image retrieved can be animated further using cola.js library.
2)The visiualization used even though efficient ,can be further enhanced .
3) Use of Three.js Library to visually improve the appearnace and animate it.

Weekly Report

Week 1 (11/18/2016 - 11/21/2016)

Meeting 18/11/2016
--Discussed basic idea about the project. Distributed various tasks among the group members.

###

Arun Kumar Jagerkal
1)Study of Web API.(done) 2)Connecting and Retrieving Information related to different Proteins from the cBioPortal Database.(done and showed sample data)

###

Vinay Arvind Deshpande
1)Setting up GitHub Repository.(done) 2)Research on different proteins which includes all the information related to cancer studies .(found protienalias website)

###

Bhavya Batra
1)Research on cBioPortal website of all the proteins (done) 2)Analyze what information can be rendered efficiently. (done and discussed ideas with the professor).

Week 2 (11/21/2016 - 11/28/2016)

Meeting 21/11/2016
###

Arun Kumar Jagerkal
1)Need to get the statistical information when passed as a query.(done) 2)Mutation rates based on gender. (done) 3)Connecting and Retrieving Information related to different Proteins from the cBioPortal Database.(done)

###

Vinay Arvind Deshpande
1)Research on proteinalias website for Web API to get protein data.(done) 2)Work on the data to get different cancer types based on protiens(done) .

###

Bhavya Batra
1)Research on how to visualize the data. (done) 2)Fetch protein based images for different types of cancers.(Found the RestAPI) (Working with Arun and Vinay )

Week 3 (11/28/2016 - 11/05/2016)

Meeting 21/11/2016
###

Arun Kumar Jagerkal
1)Need to get the statistical information when passed as a query. 2)Mutation rates based on gender. 3)Connecting and Retrieving Information related to different Proteins from the cBioPortal Database. 4)Implementation of the above requrement.

###

Vinay Arvind Deshpande
1)Research on proteinalias website for Web API to get protein data. 2)Implementation of the above requrement.

###

Bhavya Batra
1)Research on how to visualize the data. 2)Fetch protein based images for different types of cancers. 3)Implementation of the above requrement.

About

In this project, we will work on the protein interaction network.We need to overlay experimental data (from cancer studies) on top of a protein network.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published