Formation of spectral libraries by representative spectra #9

percolator · 2019-10-13T20:07:29Z

Abstract

Methods to represent multiple spectra
Spectral library searching offers a sensitive yet fast method to match spectra from mass spectrometry-based proteomics experiments. The technique was first introduced for searching spectra from data-dependent acquisition (DDA) but has proven essential for the analysis of data-independent acquisition spectra.
As an input, the technique requires spectral libraries. Such entities could be assembled from previously acquired DDA MS2 spectra. One critical step of this assembly process is the integration of the potentially large number of spectra that stem from an individual peptide-species into a single representative spectrum. Here, we will implement and benchmark a couple of such strategies to form representative spectra for the use in spectral libraries.

Work plan

Different strategies have been suggested for forming representative spectra. Frank et al. (JPR 2008) list five strategies, where one selects the representative spectrum to be:

The "best spectrum”: the spectrum that maximizes a certain score, e.g., percent of explained intensity or percent of explained b/y ions.
The “consensus spectrum”: a virtual spectrum constructed by averaging all spectra in the cluster. (Tabb et al. JASMS 2005)
The “most similar spectrum”: the spectrum that has the highest average similarity to the other cluster members (Tabb et al. Anal Chem 2003).
The “de novo spectrum”: the spectrum that has the highest score when submitted to de novo sequencing.
The random spectrum: a spectrum chosen from the cluster at random.

In this workshop, we will first establish datasets and code to benchmark different methods to form representative spectra. We will implement a couple of the methods mentioned above as well as further improvements from such methods, benchmark the methods and examine their properties. Ideally, we form separate teams implementing different methods.

Technical details

We will mainly use Python 3.7.

Contact information

Lukas Käll
KTH - Royal Institute of Technology
Stockholm, Sweden
lukas.kall@scilifelab.se

ypriverol · 2019-11-01T14:34:23Z

I'm in!!!

percolator · 2020-01-09T17:43:08Z

A repository for the hackathon is available through this link
https://github.com/statisticalbiotechnology/specpride

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formation of spectral libraries by representative spectra #9

Formation of spectral libraries by representative spectra #9

percolator commented Oct 13, 2019

ypriverol commented Nov 1, 2019

percolator commented Jan 9, 2020

Formation of spectral libraries by representative spectra #9

Formation of spectral libraries by representative spectra #9

Comments

percolator commented Oct 13, 2019

Abstract

Work plan

Technical details

Contact information

ypriverol commented Nov 1, 2019

percolator commented Jan 9, 2020