Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

openSearch issue: MSFragger mass calibration + peptideProphet #50

Closed
adithirv opened this issue Nov 28, 2019 · 17 comments
Closed

openSearch issue: MSFragger mass calibration + peptideProphet #50

adithirv opened this issue Nov 28, 2019 · 17 comments

Comments

@adithirv
Copy link

Hi MSFragger team,

I am running MSFragger open search using MSFragger-2.2 and philosopher build 20190319. I am getting this message in the fragger run: "Not enough data to perform mass calibration, using the uncalibrated data". Subsequently, the mixture model quality test is failing for all charges in the peptide prophet step. Utimately, the protein prophet does not find any peptide prophet results and fails completely. Would you know what is causing this problem?
My proteomics data were generated using LTQ Orbitrap XL or LTQ FT Ultra mass spectrometer.
I alternatively tried open search using data generated from Q Exactive and this problem was not there. I am attaching my fragger.params and log file.

fragger_params.txt

log-fragpipe-run-at_2019-11-28_13-40-32.log

Thanks in advance
Adithi

@fcyu
Copy link
Member

fcyu commented Nov 28, 2019 via email

@adithirv
Copy link
Author

Hi Fengchao,

Thanks for your reply.
The data were generated from either LTQ Orbitrap XL or LTQ FT. So, I would say they are high resolution MS/MS.

Best
Adithi

@fcyu
Copy link
Member

fcyu commented Nov 28, 2019

Hi Adithi,

Could you please try the following for me:

  1. Set the fragment_mass_tolerance to 300 PPM and re-run MSFragger.
  2. Open your mzML with an editor and look for the tag with accession MS:1000512 from a MS/MS scan. Your file may not contain this tag, so it would be better try the 1st one first.

Thanks,

Fengchao

@anesvi
Copy link
Collaborator

anesvi commented Nov 28, 2019 via email

@adithirv
Copy link
Author

Hi Fengchao,

I tried the first solution that you provided me. It is not not showing any mass calibration errors. But, I run into memory issues now. I will try to fix it and keep you posted if it runs through completely.

@alexey: Ok. Apologies then. I looked up online and asked mass spec experts, both gave me the conclusion that it is high resolution. Does that influence the mass calibration error I am getting?

Best
Adithi

@anesvi
Copy link
Collaborator

anesvi commented Nov 29, 2019 via email

@fcyu
Copy link
Member

fcyu commented Nov 29, 2019

Hi Adithi,

I agree with Alexey that your data has low resolution MS/MS spectra. You may confirm it by looking for the tag with MS:1000512 in your mzML file.

And mass calibration works well with both high and low resolution data as long as you put an appropriate fragment tolerance.

Best,

Fengchao

@fcyu fcyu closed this as completed Nov 29, 2019
@adithirv
Copy link
Author

adithirv commented Dec 3, 2019

Hi Fengchao and Alexey,

Thanks again for your detailed reply and suggestions. You guys provide great and fastest support which makes MSFragger more enjoyable to use.

My data now ran through successfully in both open and closed search with 300 ppm fragment tolerance. I had the mass calibration and downstream error in both the search mode but now they both are solved. Sure, fragment tolerance had to be adjusted. I have previously used MS-GF+ with the same data and a precursor tolerance of 20ppm. With MS-GF+, the fragment tolerance cannot be set. So, I was not aware of a frag. tol. setting that would work. Next, I am going to try closed search with proteogenomics database and looking forward to see how MSFragger performs there.

I checked for the MS:1000512 tag in my mzML files. Indeed, my file has this tag for MS/MS scan.

Thanks once again,
Best
Adithi

@jossmi
Copy link

jossmi commented Jan 13, 2021

Hi MSFragger team,

I am also running into this same issue when doing an open search: I am getting an error of "Not enough data to perform mass calibration. Using the uncalibrated data.". I had tried it at 15 and 20 ppm for the fragment tolerance, but running it at 300 ppm fragment tolerance did not change the error message.

I am running this on a Fusion Lumos, with MS1 Orbitrap resolution of 30,000 and MS2 Orbitrap resolution of 30,000. This seems well within the typical definition of high resolution, but is it not high enough for the mass calibration in MSFragger?

I am running MSFragger 3.1.1 and Philospher 3.3.12 on FragPipe 14.0. I've attached my log and parameters from the 15 ppm run.
fragger_params.txt
log_2020-12-08_00-25-12.txt

Thanks,
Josh

@fcyu
Copy link
Member

fcyu commented Jan 13, 2021

Hi Jsoh,

Does your data have some labelling or PTM enrichment?

Best,

Fengchao

@jossmi
Copy link

jossmi commented Jan 13, 2021

Fengchao,

Thanks for the speedy reply! No, no labeling or enrichment. No reduction or alkylation either (unusual, I know, but that on purpose for our assay). Its an overlapping-window DIA run on trypsin-LysC digest of human serum. The goal is to identify unknown modifications. I've specified some Cys mods as variable modifications that we know could be present.

Josh

@fcyu
Copy link
Member

fcyu commented Jan 13, 2021

Open search is for DDA only.

@anesvi
Copy link
Collaborator

anesvi commented Jan 13, 2021 via email

@jossmi
Copy link

jossmi commented Jan 13, 2021

Fengchao and Alexey,

...I did not know that. Now I feel pretty foolish for having spent so long trying to get this to work with my DIA data. Is that specified anywhere? I might have missed that detail in the papers or documentation.

We are actually only interested in albumin modifications. I've tried it both with the whole human database and with a custom database I generated with Philosopher, using only human albumin, decoys, and contaminants.

So, to clarify, DIA-Umpire can do open searching of my DIA data and identify unknown modifications? What would the pipeline look like in that case - would I still run Crystal-C, PeptideProphet, and ProteinProphet downstream from DIA-Umpire, similar to how I was attempting with MSFragger?

Thanks,
Josh

@fcyu
Copy link
Member

fcyu commented Jan 13, 2021

You need to check 'Enable DIA-Umpire`
image

Then, load 'DIA-Umpire_SpecLib` workflow:
image

Then, change the MSFragger and PeptideProphet setting to open search. You need to uncheck Crystal-C because it will crash with DIA-Umpire's outputted mzML, I think.

Best,

Fengchao

@anesvi
Copy link
Collaborator

anesvi commented Jan 13, 2021 via email

@jossmi
Copy link

jossmi commented Jan 13, 2021

Thanks so much for your help.

Josh

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants