CPTAC-proteomics-pipeline

This is a repository for all of the data processing scripts for the transfer of CPTAC data into cBioPortal as part of 2016's Google Summer of Code. The purpose of this is not only to produce flat text files for import into the cBioPortal database, but it's also to do data exploration and cross-dataset normalization. This has been incorporated into the cBioPortal visualization interface.

Usage

This is a pretty specific package, so we designed it so that it was easy to use on-the-fly. First, clone the repo and cd in:

git clone https://github.com/cBioPortal/CPTAC-proteomics-pipeline.git
cd CPTAC-proteomics-pipeline

If you would like to have all the CPTAC files we used, please run the wget script:

./wget.sh

Please visit the tutorial, which goes through all the elements of the API.

NOTE: As shown in the tutorial, to import the classes, just add the relative location of the ms2cbioportal.py script to your current working directory. For example, since the tutorial is nested inside the repo:

import sys
sys.path.append('../')

Acknowledgements

Thanks to my PI David Fenyo and the GSoC mentors at MSKCC, JJ Gao and Zack Heins, for guidance.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
tutorial		tutorial
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
api_ms_example.r		api_ms_example.r
manifest.txt		manifest.txt
molar_mass.pkl		molar_mass.pkl
ms2cbioportal.py		ms2cbioportal.py
wget.sh		wget.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CPTAC-proteomics-pipeline

Usage

Acknowledgements

About

Releases

Packages

Languages

License

cBioPortal/CPTAC-proteomics-pipeline

Folders and files

Latest commit

History

Repository files navigation

CPTAC-proteomics-pipeline

Usage

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages