# CARA-Metabolomics LC-MS/MS Annotation analysis - HILIC neg

**Author**: Louis-Felix Nothias, Feb 2021

**Objective**: 
- Explore the annotations with feature-based molecular networking (FBMN)
- Check if SIRIUS annotations are consistent with spectral library matches at the molecular formula (MF) and class level. We can also check with for analogue library match.

**Additional ideas**

In [1]:
import pandas as pd     
import numpy as np
import altair as alt
from get_stats_annotation import *
from check_annotation import *
from visualize_annotation import *
pd.set_option('mode.chained_assignment', None)

### Prepare input annotation files

In [2]:
# Importing the feature metadata for CMN and FBMN
FBMN = pd.read_csv('input/FBMN/HILICneg_feature_metadata.tsv',  sep='\t', index_col=0, header=0, low_memory=False)

### Lets look at the annotations metadata per tools

The annotation metadata originating from a specific share the same prefix:

**GNPS tools**:

- **Molecular networking** (column prefix: `'GNPS_'`).
- **Spectral library search** (column prefix: `'GNPS_LIB_'`).
- **Spectral library search in analogue mode** (column prefix: `'GNPS_LIBA_'`).
- **Passatutto FDR controlled spectral lib match** (column prefix: `'PASSA_FDR_'`).

**SIRIUS tools**:

- **ZODIAC**: Molecular formula annotation (column prefix: `'SIR_MF_Zod'`).
- **CSI:FingerID**: Putative structure annotation (column prefix: `'CSI_'`).
- **CANOPUS**: Putative chemical class annotation (column prefix: `'CAN_'`).


In [3]:
#### We can limit to a subset of features

In [4]:
features_of_interest_p_value = pd.read_csv('Input/FBMN/statistical_analysis_results/HILIC_neg.p_value.tsv',  sep='\t', index_col=0, header=0)

print('Features to filter: '+ str(features_of_interest_p_value.shape))
print('Features in the table before: '+ str(FBMN.shape[0]))
FBMN  = FBMN[FBMN.index.isin(features_of_interest_p_value.index)]
print('Features in the table after p value filter: '+ str(FBMN.shape[0]))

Features to filter: (6957, 14)
Features in the table before: 11230
Features in the table after p value filter: 6957


In [5]:
# This prints the metadata columns per annotations using the respective prefix.
def show_metadata_tools(table, metadata_prefix):
    metadata = []
    for x in table.columns:
        if x.startswith(metadata_prefix):
            metadata.append(str(x))
    print(metadata)

In [6]:
show_metadata_tools(FBMN,'GNPS_LIB_')

['GNPS_LIB_SpectrumID', 'GNPS_LIB_Compound_Name', 'GNPS_LIB_Ion_Source', 'GNPS_LIB_Instrument', 'GNPS_LIB_Compound_Source', 'GNPS_LIB_PI', 'GNPS_LIB_Data_Collector', 'GNPS_LIB_Adduct', 'GNPS_LIB_Precursor_MZ', 'GNPS_LIB_ExactMass', 'GNPS_LIB_Charge', 'GNPS_LIB_CAS_Number', 'GNPS_LIB_Pubmed_ID', 'GNPS_LIB_Smiles', 'GNPS_LIB_INCHI', 'GNPS_LIB_INCHI_AUX', 'GNPS_LIB_Library_Class', 'GNPS_LIB_IonMode', 'GNPS_LIB_UpdateWorkflowName', 'GNPS_LIB_LibraryQualityString', 'GNPS_LIB_SpectrumFile', 'GNPS_LIB_LibraryName', 'GNPS_LIB_MQScore', 'GNPS_LIB_Organism', 'GNPS_LIB_TIC_Query', 'GNPS_LIB_RT_Query', 'GNPS_LIB_MZErrorPPM', 'GNPS_LIB_SharedPeaks', 'GNPS_LIB_MassDiff', 'GNPS_LIB_LibMZ', 'GNPS_LIB_SpecMZ', 'GNPS_LIB_SpecCharge', 'GNPS_LIB_FileScanUniqueID', 'GNPS_LIB_NumberHits', 'GNPS_LIB_tags', 'GNPS_LIB_MoleculeExplorerDatasets', 'GNPS_LIB_MoleculeExplorerFiles', 'GNPS_LIB_InChIKey', 'GNPS_LIB_InChIKey-Planar', 'GNPS_LIB_superclass', 'GNPS_LIB_class', 'GNPS_LIB_subclass']


# Feature-Based Molecular Networking
## General annotation statistics

In [7]:
zodiac_score_thresh=0.7
ionisation_mode='neg'
ppm_error=25

In [8]:
get_stats_annotation(FBMN, zodiac_score_thresh, ionisation_mode, ppm_error)

Features = 6957
 
==== GNPS =====
In networks = 1884
Number of networks = 396
Valid library annotations = 263
Library annotations in analogue mode= 658
PASSATUTTO FDR-controlled library annotations = 80
PASSATUTTO FDR-controlled library annotations at 20% FDR = 10
PASSATUTTO FDR-controlled library annotations at 10% FDR = 0
 
==== SIRIUS =====
Features with SIRIUS annotation = 3714
SIRIUS ZODIAC MF with ZodiacScore > 0.7 = 2200
CSIFingerID annotations = 198
CANOPUS annotations = 335
CANOPUS annotations at the subclass level= 330
CANOPUS annotations at the subclass level= 313
CANOPUS annotations at the level 5 = 258
 
==== General annotation statistics =====
Number of features = 6957
Annotated features = 3774
Annotated features or in network = 3955
Single nodes = 5073
Single nodes and unnnannotated = 3020
 


### View all annotations

In [9]:
get_stats_annotation.final_table_rel
get_stats_annotation.final_table

Unnamed: 0,Annotation tool,Count
0,Features,6957
1,GNPS - in networks,1884
2,GNPS - lib. match,263
3,GNPS - lib. match analogue,658
4,PASSATUTTO FDR 20%,10
5,PASSATUTTO FDR 10%,0
6,SIRIUS - Annotated features,3714
7,SIRIUS - MF with ZodScore >0.7,2200
8,SIRIUS - structure,198
9,SIRIUS - chemical class,335


In [10]:
make_barchart(get_stats_annotation.final_table) # Absolute count
make_barchart_rel(get_stats_annotation.final_table_rel)

### View GNPS annotations

In [11]:
make_barchart(get_stats_annotation.table_gnps) # Absolute count
make_barchart_rel(get_stats_annotation.table_gnps_rel)

### View SIRIUS annotations

In [12]:
make_barchart(get_stats_annotation.table_sirius) # Absolute count
make_barchart_rel(get_stats_annotation.table_sirius_rel)

# Look at GNPS/SIRIUS annotation consistency

In [13]:
check_matching_annotations(FBMN, zodiac_score_thresh, ionisation_mode='neg', library_mode = 'regul', canopus_level= 'spec', 
                           cosine=0.7, shared_peaks=6, ppm_error=25)

=== Looking at match between GNPS library in REGULAR mode and SIRIUS annotation ===
Usable GNPS/SIRIUS annotations = 62
Usable GNPS/SIRIUS annot. w. ZodiacScore > 0.7 = 62
Check with CANOPUS SPECIFIC classification levels
 
MF match = 56
MF match score = 54
 
Classified pairs considered = 10
Superclass annotation pairs = 10
Superclass match all = 10, 1.00%
Class annotation pairs = 10
Class match = 10, 1.00%
Subclass annotation pairs = 10
Subclass match all = 10, 1.00%


### Molecular formula annotation consistency between GNPS/SIRIUS

In [14]:
check_matching_annotations.table_matching

Unnamed: 0,Matching level,Count,Relative
0,Usable MF pairs,62,1.05
1,Usable MF pairs w. ZodiacScore>0.7,59,1.0
2,Matching molecular formula,56,0.95
3,Matching molecular w. ZodiacScore>0.7,54,0.92


In [15]:
make_barchart_match(check_matching_annotations.table_matching) ### This one for absolute values
make_barchart_match_rel(check_matching_annotations.table_matching)

### Chemical class annotation consistency between GNPS-SIRIUS/CANOPUS

In [16]:
check_matching_annotations.table_class_matching

Unnamed: 0,Matching level,Count,Relative
0,Available pairs,11,1.1
1,Classified pairs w. ZodiacScore>0.7,10,1.0
2,Matching superclass,10,1.0
3,Matching class,10,1.0
4,Matching subclass,10,1.0


In [17]:
check_matching_annotations.table_class_matching
make_barchart_match(check_matching_annotations.table_class_matching) ### This one for absolute values
make_barchart_match_rel(check_matching_annotations.table_class_matching)

# Additional, detailed views (Optional) 
## Below are to view distribution of correct/incorrect annotations

### View Molecular Formula (only for REGULAR library search)

In [18]:
dist_plot(check_matching_annotations.MF_pairs,'MF_match', zodiac_score_thresh)

### View classification results

In [19]:
# Superclass level
#dist_plot(check_matching_annotations.superclass_match_all_total,'Match_GNPSsuperclass-SIRIUS',zodiac_score_thresh)
dist_plot(check_matching_annotations.class_match_all_total,'Match_GNPSclass-SIRIUS',zodiac_score_thresh)
#dist_plot(check_matching_annotations.subclass_match_all_total,'Match_GNPSsubclass-SIRIUS',zodiac_score_thresh)

### View details of incorrect MF annotations (only for REGULAR library search)

In [20]:
# This is used to display the entire table in the notebook
from IPython.display import display, HTML
show_non_matching_MF = check_matching_annotations.MF_no_match[['SIR_MF_Zod_ZodiacScore','GNPS_LIB_INCHI_MF','SIR_MF_Zod_molecularFormula',
                                                               'GNPS_LIB_Adduct','SIR_MF_Zod_adduct',
                                                               'GNPS_LIB_superclass','CAN_superclass',
                                                               'GNPS_LIB_class','CAN_class',
                                                               'GNPS_LIB_MQScore', 'GNPS_LIB_MZErrorPPM', 'GNPS_LIB_SharedPeaks', 'GNPS_LIB_SpecCharge','GNPS_LIB_SpecMZ']]
show_non_matching_MF.sort_values(['GNPS_LIB_SpecMZ','SIR_MF_Zod_ZodiacScore'], inplace=True, ascending = (False, False))

display(HTML(show_non_matching_MF.to_html()))

Unnamed: 0_level_0,SIR_MF_Zod_ZodiacScore,GNPS_LIB_INCHI_MF,SIR_MF_Zod_molecularFormula,GNPS_LIB_Adduct,SIR_MF_Zod_adduct,GNPS_LIB_superclass,CAN_superclass,GNPS_LIB_class,CAN_class,GNPS_LIB_MQScore,GNPS_LIB_MZErrorPPM,GNPS_LIB_SharedPeaks,GNPS_LIB_SpecCharge,GNPS_LIB_SpecMZ
#featureID,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1
9782,0.903,C41H78NO8P,C43H73N3O7,M-H,[M - H]-,Lipids and lipid-like molecules,,Glycerophospholipids,,0.875271,7.15122,9.0,1.0,742.544
7794,0.672,C16H19N3O5,C10H19N7O6,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.731953,21.5932,9.0,1.0,332.132
2517,0.798,C16H17N3O4,C13H18FN3O5,M-H,[M - H]-,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,0.928006,2.81747,9.0,1.0,314.114
4599,0.923,C15H18N4O3,C12H20N3O6,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.817218,5.3712,8.0,1.0,301.129
2602,0.996,C13H16N2O5,C11H16BN3O3S,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.860339,4.70176,10.0,1.0,279.097
6255,0.742,C9H16N2O5S,C7H12F2N6O4,M-H,[M - H2O - H]-,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,0.899743,1.50807,7.0,1.0,263.07


### View details of correct MF annotations (only for REGULAR library search)

In [21]:
# This is used to display the entire table in the notebook
from IPython.display import display, HTML
show_matching_MF = check_matching_annotations.MF_match[['SIR_MF_Zod_ZodiacScore','GNPS_LIB_INCHI_MF','SIR_MF_Zod_molecularFormula',
                                                               'GNPS_LIB_Adduct','SIR_MF_Zod_adduct',
                                                               'GNPS_LIB_superclass','CAN_superclass',
                                                               'GNPS_LIB_class','CAN_class',
                                                               'GNPS_LIB_MQScore', 'GNPS_LIB_MZErrorPPM', 'GNPS_LIB_SharedPeaks', 'GNPS_LIB_SpecCharge','GNPS_LIB_SpecMZ']]
show_matching_MF.sort_values(['GNPS_LIB_SpecMZ','SIR_MF_Zod_ZodiacScore'], inplace=True, ascending = (False, False))

display(HTML(show_matching_MF.to_html()))

Unnamed: 0_level_0,SIR_MF_Zod_ZodiacScore,GNPS_LIB_INCHI_MF,SIR_MF_Zod_molecularFormula,GNPS_LIB_Adduct,SIR_MF_Zod_adduct,GNPS_LIB_superclass,CAN_superclass,GNPS_LIB_class,CAN_class,GNPS_LIB_MQScore,GNPS_LIB_MZErrorPPM,GNPS_LIB_SharedPeaks,GNPS_LIB_SpecCharge,GNPS_LIB_SpecMZ
#featureID,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1
5640,1.0,C12H24O11,C12H24O11,M+Cl,[M + Cl]-,Lipids and lipid-like molecules,,Fatty Acyls,,0.926844,6.60098,9.0,1.0,379.099
2396,1.0,C12H24O11,C12H24O11,M-H,[M - H]-,Lipids and lipid-like molecules,,Fatty Acyls,,0.770627,5.24747,8.0,1.0,343.126
2505,0.999,C16H19N3O5,C16H19N3O5,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.782125,5.05372,11.0,1.0,332.127
2759,0.998,C16H19N3O5,C16H19N3O5,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.797983,0.918858,10.0,1.0,332.125
2204,1.0,C15H17N3O5,C15H17N3O5,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.836371,3.74143,7.0,1.0,318.108
5194,1.0,C15H18N4O4,C15H18N4O4,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.81888,4.04174,8.0,1.0,317.124
2076,1.0,C14H18N2O6,C14H18N2O6,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.905501,3.55419,11.0,1.0,309.108
4148,0.631,C10H17N3O6S,C10H17N3O6S,M-H,[M - H]-,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,0.820715,2.29323,9.0,1.0,306.076
2578,1.0,C13H16N2O6,C13H16N2O6,M-H,[M - H]-,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,0.817214,4.03325,8.0,1.0,295.092
2156,1.0,C15H22N2O4,C15H22N2O4,M-H,[M - H]-,Organic acids and derivatives,,Carboxylic acids and derivatives,,0.813241,3.43537,7.0,1.0,293.15


### View details of incorrect/correct class annotations (valid only for REGULAR library search)

In [22]:
from IPython.display import display, HTML
show_matching_class = check_matching_annotations.class_match_all_total[[
                                                               'GNPS_LIB_superclass','CAN_superclass',
                                                               'GNPS_LIB_class','CAN_class',
                                                               'GNPS_LIB_subclass','CAN_subclass',
                                                               'SIR_MF_Zod_ZodiacScore','SIR_MF_Zod_molecularFormula',
                                                               'GNPS_LIB_Adduct','SIR_MF_Zod_adduct',
                                                               'GNPS_LIB_MQScore', 'GNPS_LIB_MZErrorPPM', 'GNPS_LIB_SharedPeaks',
                                                               'GNPS_LIB_SpecMZ','Match_GNPSsuperclass-SIRIUS',
                                                               'Match_GNPSclass-SIRIUS','Match_GNPSsubclass-SIRIUS']]
show_matching_class.sort_values(['Match_GNPSsuperclass-SIRIUS','Match_GNPSclass-SIRIUS','Match_GNPSsubclass-SIRIUS',
                                 'GNPS_LIB_SpecMZ','SIR_MF_Zod_ZodiacScore'], inplace=True, ascending = (False,False,False,False, False))

display(HTML(show_matching_class.to_html()))

Unnamed: 0_level_0,GNPS_LIB_superclass,CAN_superclass,GNPS_LIB_class,CAN_class,GNPS_LIB_subclass,CAN_subclass,SIR_MF_Zod_ZodiacScore,SIR_MF_Zod_molecularFormula,GNPS_LIB_Adduct,SIR_MF_Zod_adduct,GNPS_LIB_MQScore,GNPS_LIB_MZErrorPPM,GNPS_LIB_SharedPeaks,GNPS_LIB_SpecMZ,Match_GNPSsuperclass-SIRIUS,Match_GNPSclass-SIRIUS,Match_GNPSsubclass-SIRIUS
#featureID,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1
2517,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",0.798,C13H18FN3O5,M-H,[M - H]-,0.928006,2.81747,9.0,314.114,yes,yes,yes
2578,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C13H16N2O6,M-H,[M - H]-,0.817214,4.03325,8.0,295.092,yes,yes,yes
6255,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",0.742,C7H12F2N6O4,M-H,[M - H2O - H]-,0.899743,1.50807,7.0,263.07,yes,yes,yes
2144,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C9H16N2O6,M-H,[M - H]-,0.871602,3.64343,9.0,247.093,yes,yes,yes
2675,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",0.939,C10H16N2O5,M-H,[M - H]-,0.911643,5.77465,7.0,243.097,yes,yes,yes
2555,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C7H12N2O6,M-H,[M - H]-,0.961894,0.905515,7.0,219.062,yes,yes,yes
3355,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C9H17NO5,M-H,[M - H]-,0.954453,1.81899,7.0,218.103,yes,yes,yes
2252,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C7H12N2O5,M-H,[M - H]-,0.967416,3.9825,8.0,203.066,yes,yes,yes
1717,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C6H12N2O5,M-H,[M - H]-,0.975915,0.559026,7.0,191.067,yes,yes,yes
4067,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C7H11NO5,M-H,[M - H]-,0.92544,0.486837,8.0,188.056,yes,yes,yes


### View details of correct/incorrect class annotations (valid only for ANALOGUE library search)

In [23]:
### View details of correct class annotations (only for REGULAR library search)
from IPython.display import display, HTML
show_matching_class = check_matching_annotations.class_match_all_total[[
                                                               'GNPS_LIBA_superclass','CAN_superclass',
                                                               'GNPS_LIBA_class','CAN_class',
                                                               'GNPS_LIBA_subclass','CAN_subclass',
                                                               'SIR_MF_Zod_ZodiacScore','SIR_MF_Zod_molecularFormula',
                                                               'GNPS_LIB_Adduct','SIR_MF_Zod_adduct',
                                                               'GNPS_LIB_MQScore', 'GNPS_LIB_MZErrorPPM', 'GNPS_LIB_SharedPeaks',
                                                               'GNPS_LIB_SpecMZ','Match_GNPSsuperclass-SIRIUS',
                                                               'Match_GNPSclass-SIRIUS','Match_GNPSsubclass-SIRIUS']]
show_matching_class.sort_values(['Match_GNPSsuperclass-SIRIUS','Match_GNPSclass-SIRIUS','Match_GNPSsubclass-SIRIUS',
                                 'GNPS_LIB_SpecMZ','SIR_MF_Zod_ZodiacScore'], inplace=True, ascending = (False,False,False,False, False))

display(HTML(show_matching_class.to_html()))

Unnamed: 0_level_0,GNPS_LIBA_superclass,CAN_superclass,GNPS_LIBA_class,CAN_class,GNPS_LIBA_subclass,CAN_subclass,SIR_MF_Zod_ZodiacScore,SIR_MF_Zod_molecularFormula,GNPS_LIB_Adduct,SIR_MF_Zod_adduct,GNPS_LIB_MQScore,GNPS_LIB_MZErrorPPM,GNPS_LIB_SharedPeaks,GNPS_LIB_SpecMZ,Match_GNPSsuperclass-SIRIUS,Match_GNPSclass-SIRIUS,Match_GNPSsubclass-SIRIUS
#featureID,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1
2517,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",0.798,C13H18FN3O5,M-H,[M - H]-,0.928006,2.81747,9.0,314.114,yes,yes,yes
2578,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C13H16N2O6,M-H,[M - H]-,0.817214,4.03325,8.0,295.092,yes,yes,yes
6255,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",0.742,C7H12F2N6O4,M-H,[M - H2O - H]-,0.899743,1.50807,7.0,263.07,yes,yes,yes
2144,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C9H16N2O6,M-H,[M - H]-,0.871602,3.64343,9.0,247.093,yes,yes,yes
2675,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",0.939,C10H16N2O5,M-H,[M - H]-,0.911643,5.77465,7.0,243.097,yes,yes,yes
2555,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C7H12N2O6,M-H,[M - H]-,0.961894,0.905515,7.0,219.062,yes,yes,yes
3355,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C9H17NO5,M-H,[M - H]-,0.954453,1.81899,7.0,218.103,yes,yes,yes
2252,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C7H12N2O5,M-H,[M - H]-,0.967416,3.9825,8.0,203.066,yes,yes,yes
1717,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C6H12N2O5,M-H,[M - H]-,0.975915,0.559026,7.0,191.067,yes,yes,yes
4067,Organic acids and derivatives,Organic acids and derivatives,Carboxylic acids and derivatives,Carboxylic acids and derivatives,"Amino acids, peptides, and analogues","Amino acids, peptides, and analogues",1.0,C7H11NO5,M-H,[M - H]-,0.92544,0.486837,8.0,188.056,yes,yes,yes
