Update GNPSExport.cpp #5594

eeko-kon · 2021-10-16T19:37:44Z

I added a few more details on the proposed workflow (command examples) mainly - I think it would be great to add a link to an emptyfile.idXML at the IDMapper Requirements section
"Even in untargeted metabolomics/proteomics, an empty idXML or mzid (peptide annotation format) file is needed as an input which can be found here link/to/emptyfile.idXML.

Description

Please include a summary of the change and which issue is fixed.

Checklist:

Make sure that you are listed in the AUTHORS file
Add relevant changes and new features to the CHANGELOG file
I have commented my code, particularly in hard-to-understand areas
New and existing unit tests pass locally with my changes
Updated or added python bindings for changed or new classes. (Tick if no updates were necessary.)

How can I get additional information on failed tests during CI:

If your PR is failing you can check out

http://cdash.openms.de/index.php?project=OpenMS and look for your PR. If you click in the column that lists the failed tests you will get detailed error messages.

Note:

Once you opened a PR try to minimize the number of pushes to it as every push will trigger CI (automated builds and test) and is rather heavy on our infrastructure (e.g., if several pushes per day are performed).

I added a few more details on the proposed workflow (command examples) mainly - I think it would be great to add a link to an emptyfile.idXML at the IDMapper Requirements section "Even in untargeted metabolomics/proteomics, an empty idXML or mzid (peptide annotation format) file is needed as an input which can be found here link/to/emptyfile.idXML.

timosachsenberg · 2021-10-18T08:00:25Z

src/topp/GNPSExport.cpp

 on the consensusXML file and corresponding mzML files to generate the files needed for FBMN on GNPS.
 These two files are:

 	- The MS/MS spectral data file (.MGF format) which is generated  with the GNPSExport util.
-	- The feature quantification table (.CSV format) which is generated with the TextExport util.
+	- The feature quantification table (.TXT format) which is generated with the TextExport util.


is .txt the correct output format (TextExporter supports tsv, csv and txt)?

When selecting OpenMS as a preprocessing tool in the FBMN workflow (GNPS), the required file format of the Feature Quant table is .txt (TextExporter supports that, yes).

is the .txt output of textexporter compatible with the FBMN workflow? Or expects it a txt file that has the format of the .csv file?

It should be compatible because I have used it repeatedly and it works very well. TextExporter generates a txt file that has the format of a tsv basically! So txt with tab-separated data.

Quoting the documentation from GNPS:

In brief, after running an OpenMS "metabolomics" pipeline, the GNPSExport TOPP tool can be used on the consensusXML file and corresponding mzML files to generate the files needed for FBMN on GNPS. These two files are: The feature quantification table (.TXT format) which is generated with the TextExport tool. The MS2 spectral summary file (.MGF format) which is generated with the GNPSExport tool.

ok great. is there a way we can test that the txt format works?

I literally just saw this comment- sorry! So I have used those txt files generated by GNPSexport (OpenMS) in FBMN-GNPS and it works perfectly - examples:
https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=441a29dc057747f094330148d40493e0
https://gnps.ucsd.edu/ProteoSAFe/status.jsp?task=af126b5cd46840b79acf4b58cece09ec#

src/topp/GNPSExport.cpp

timosachsenberg · 2021-10-18T08:04:52Z

src/topp/GNPSExport.cpp

+  	GNPSExport -ini iniFile-GNPSExport.ini -in_cm filtered.consensusXML -in_mzml inputFile0.mzML inputFile1.mzML -out GNPSExport_output.mgf
+  9. Run the @ref TOPP_TextExporter on the "filtered consensusXML file" to export an .TXT file.
+  	TextExporter -in FileFilter.consensusXML -out FeatureQuantificationTable.txt
+  10. Upload your files to GNPS and run the Feature-Based Molecular Networking workflow. Instructions are here:


is there a way to automatize that? e.g. we could in principle also upload the data from the tool and download results

That's a great idea. Should I ask Ming during GNPS office hours (tomorrow 6 pm)?

timosachsenberg · 2021-10-18T08:05:16Z

src/topp/GNPSExport.cpp

 Requirements:
 	- The IDMapper has to be run on the featureXML files, in order to associate MS2 scan(s) (peptide annotation) with each
-	features. These peptide annotations are used by the GNPSExport.
+	feature, using a peptide annotation file (idXML). Even in untargeted metabolomics/proteomics, an empty idXML or mzid (peptide annotation format) file is needed as an input. 


having the empty idXML seems awkward. Could we make this optional?

it is also confusing that it talks about protein / peptide annotations

Small corrections

timosachsenberg · 2021-10-18T18:02:58Z

rebuild jenkins

timosachsenberg · 2021-12-21T16:20:55Z

@eeko-kon can you check if this is up-to-date with what you are currently doing? I would give it another quick review so we can merge that.

A few changes to make the text clearer

This comment has been minimized.

Sign in to view

timosachsenberg reviewed Oct 18, 2021

View reviewed changes

Update GNPSExport.cpp

ceccab1

Small corrections

eeko-kon and others added 2 commits December 21, 2021 18:15

Update GNPSExport.cpp

3311fd4

A few changes to make the text clearer

Merge remote-tracking branch 'origin/develop' into patch-1

4b3e400

timosachsenberg approved these changes Jan 5, 2022

View reviewed changes

timosachsenberg merged commit 3c38f48 into OpenMS:develop Jan 5, 2022

eeko-kon deleted the patch-1 branch January 5, 2022 20:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update GNPSExport.cpp #5594

Update GNPSExport.cpp #5594

eeko-kon commented Oct 16, 2021

This comment has been minimized.

timosachsenberg Oct 18, 2021

eeko-kon Oct 18, 2021

timosachsenberg Oct 18, 2021

eeko-kon Oct 18, 2021

eeko-kon Oct 18, 2021

timosachsenberg Oct 18, 2021

eeko-kon Nov 15, 2021

timosachsenberg Oct 18, 2021

eeko-kon Nov 15, 2021

timosachsenberg Oct 18, 2021

timosachsenberg Oct 18, 2021

timosachsenberg commented Oct 18, 2021

timosachsenberg commented Dec 21, 2021

Update GNPSExport.cpp #5594

Update GNPSExport.cpp #5594

Conversation

eeko-kon commented Oct 16, 2021

Description

Checklist:

How can I get additional information on failed tests during CI:

Note:

This comment has been minimized.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

timosachsenberg commented Oct 18, 2021

timosachsenberg commented Dec 21, 2021