## Comparative Analysis of Root and Leaf Metabolomes in Protium Species

This notebook performs a comparative analysis of metabolite richness and structural diversity between root and leaf tissues in 31 Protium species using GCMS data.

In [None]:
# Import necessary libraries
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from scipy.stats import pearsonr

# Load the metabolite data
root_data = pd.read_csv('protium_roots.csv')
leaf_data = pd.read_csv('protium_leaves.csv')

# Display the first few rows
root_data.head()
leaf_data.head()

### Calculate Metabolite Richness and Structural Diversity

Using the loaded data, we calculate the number of metabolites (richness) and a structural diversity index for both roots and leaves.

In [None]:
# Calculate richness
root_richness = root_data.groupby('Species')['Metabolite'].nunique()
leaf_richness = leaf_data.groupby('Species')['Metabolite'].nunique()

# Assume structural diversity is represented by Shannon Diversity Index
from sklearn.preprocessing import LabelEncoder
from scipy.stats import entropy


def calculate_shannon(df):
    counts = df['Metabolite'].value_counts()
    return entropy(counts)

root_struct_div = root_data.groupby('Species').apply(calculate_shannon)
leaf_struct_div = leaf_data.groupby('Species').apply(calculate_shannon)

# Combine into a summary DataFrame
summary = pd.DataFrame({'Root Richness': root_richness,
                        'Root Structural Diversity': root_struct_div,
                        'Leaf Richness': leaf_richness,
                        'Leaf Structural Diversity': leaf_struct_div})
summary.reset_index(inplace=True)

### Correlation Analysis

We examine the correlation between phylogenetic distance and compound structural distance in roots and leaves.

In [None]:
# Load phylogenetic distance data
phylo_data = pd.read_csv('protium_phylogeny.csv')

# Merge with summary data
merged = summary.merge(phylo_data, on='Species')

# Calculate Pearson correlation
corr_root, p_root = pearsonr(merged['Phylogenetic Distance'], merged['Root Structural Diversity'])
corr_leaf, p_leaf = pearsonr(merged['Phylogenetic Distance'], merged['Leaf Structural Diversity'])

print(f'Correlation in Roots: {corr_root}, p-value: {p_root}')
print(f'Correlation in Leaves: {corr_leaf}, p-value: {p_leaf}')

### Visualization

Plotting the correlation between phylogenetic distance and compound structural diversity.

In [None]:
# Plotting
sns.scatterplot(data=merged, x='Phylogenetic Distance', y='Root Structural Diversity', label='Roots')
sns.scatterplot(data=merged, x='Phylogenetic Distance', y='Leaf Structural Diversity', label='Leaves')
plt.title('Phylogenetic Distance vs Compound Structural Diversity')
plt.xlabel('Phylogenetic Distance')
plt.ylabel('Compound Structural Diversity')
plt.legend()
plt.show()

The resulting plots and correlation coefficients provide insights into how phylogenetic relationships influence metabolite diversity in different plant tissues.





***
### [**Evolve This Code**](https://biologpt.com/?q=Evolve%20Code%3A%20This%20Python%20code%20analyzes%20GCMS%20data%20to%20compare%20metabolite%20richness%20and%20structural%20diversity%20between%20Protium%20roots%20and%20leaves.%0A%0AInclude%20normalization%20steps%20for%20metabolite%20abundance%20data%20and%20integrate%20more%20sophisticated%20diversity%20indices%20to%20enhance%20analysis%20robustness.%0A%0AEvolutionary%20trajectories%20of%20shoots%20roots%20plant%20volatile%20metabolomes%20Protium%0A%0A%23%23%20Comparative%20Analysis%20of%20Root%20and%20Leaf%20Metabolomes%20in%20Protium%20Species%0A%0AThis%20notebook%20performs%20a%20comparative%20analysis%20of%20metabolite%20richness%20and%20structural%20diversity%20between%20root%20and%20leaf%20tissues%20in%2031%20Protium%20species%20using%20GCMS%20data.%0A%0A%23%20Import%20necessary%20libraries%0Aimport%20pandas%20as%20pd%0Aimport%20matplotlib.pyplot%20as%20plt%0Aimport%20seaborn%20as%20sns%0Afrom%20scipy.stats%20import%20pearsonr%0A%0A%23%20Load%20the%20metabolite%20data%0Aroot_data%20%3D%20pd.read_csv%28%27protium_roots.csv%27%29%0Aleaf_data%20%3D%20pd.read_csv%28%27protium_leaves.csv%27%29%0A%0A%23%20Display%20the%20first%20few%20rows%0Aroot_data.head%28%29%0Aleaf_data.head%28%29%0A%0A%23%23%23%20Calculate%20Metabolite%20Richness%20and%20Structural%20Diversity%0A%0AUsing%20the%20loaded%20data%2C%20we%20calculate%20the%20number%20of%20metabolites%20%28richness%29%20and%20a%20structural%20diversity%20index%20for%20both%20roots%20and%20leaves.%0A%0A%23%20Calculate%20richness%0Aroot_richness%20%3D%20root_data.groupby%28%27Species%27%29%5B%27Metabolite%27%5D.nunique%28%29%0Aleaf_richness%20%3D%20leaf_data.groupby%28%27Species%27%29%5B%27Metabolite%27%5D.nunique%28%29%0A%0A%23%20Assume%20structural%20diversity%20is%20represented%20by%20Shannon%20Diversity%20Index%0Afrom%20sklearn.preprocessing%20import%20LabelEncoder%0Afrom%20scipy.stats%20import%20entropy%0A%0A%0Adef%20calculate_shannon%28df%29%3A%0A%20%20%20%20counts%20%3D%20df%5B%27Metabolite%27%5D.value_counts%28%29%0A%20%20%20%20return%20entropy%28counts%29%0A%0Aroot_struct_div%20%3D%20root_data.groupby%28%27Species%27%29.apply%28calculate_shannon%29%0Aleaf_struct_div%20%3D%20leaf_data.groupby%28%27Species%27%29.apply%28calculate_shannon%29%0A%0A%23%20Combine%20into%20a%20summary%20DataFrame%0Asummary%20%3D%20pd.DataFrame%28%7B%27Root%20Richness%27%3A%20root_richness%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%27Root%20Structural%20Diversity%27%3A%20root_struct_div%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%27Leaf%20Richness%27%3A%20leaf_richness%2C%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%27Leaf%20Structural%20Diversity%27%3A%20leaf_struct_div%7D%29%0Asummary.reset_index%28inplace%3DTrue%29%0A%0A%23%23%23%20Correlation%20Analysis%0A%0AWe%20examine%20the%20correlation%20between%20phylogenetic%20distance%20and%20compound%20structural%20distance%20in%20roots%20and%20leaves.%0A%0A%23%20Load%20phylogenetic%20distance%20data%0Aphylo_data%20%3D%20pd.read_csv%28%27protium_phylogeny.csv%27%29%0A%0A%23%20Merge%20with%20summary%20data%0Amerged%20%3D%20summary.merge%28phylo_data%2C%20on%3D%27Species%27%29%0A%0A%23%20Calculate%20Pearson%20correlation%0Acorr_root%2C%20p_root%20%3D%20pearsonr%28merged%5B%27Phylogenetic%20Distance%27%5D%2C%20merged%5B%27Root%20Structural%20Diversity%27%5D%29%0Acorr_leaf%2C%20p_leaf%20%3D%20pearsonr%28merged%5B%27Phylogenetic%20Distance%27%5D%2C%20merged%5B%27Leaf%20Structural%20Diversity%27%5D%29%0A%0Aprint%28f%27Correlation%20in%20Roots%3A%20%7Bcorr_root%7D%2C%20p-value%3A%20%7Bp_root%7D%27%29%0Aprint%28f%27Correlation%20in%20Leaves%3A%20%7Bcorr_leaf%7D%2C%20p-value%3A%20%7Bp_leaf%7D%27%29%0A%0A%23%23%23%20Visualization%0A%0APlotting%20the%20correlation%20between%20phylogenetic%20distance%20and%20compound%20structural%20diversity.%0A%0A%23%20Plotting%0Asns.scatterplot%28data%3Dmerged%2C%20x%3D%27Phylogenetic%20Distance%27%2C%20y%3D%27Root%20Structural%20Diversity%27%2C%20label%3D%27Roots%27%29%0Asns.scatterplot%28data%3Dmerged%2C%20x%3D%27Phylogenetic%20Distance%27%2C%20y%3D%27Leaf%20Structural%20Diversity%27%2C%20label%3D%27Leaves%27%29%0Aplt.title%28%27Phylogenetic%20Distance%20vs%20Compound%20Structural%20Diversity%27%29%0Aplt.xlabel%28%27Phylogenetic%20Distance%27%29%0Aplt.ylabel%28%27Compound%20Structural%20Diversity%27%29%0Aplt.legend%28%29%0Aplt.show%28%29%0A%0AThe%20resulting%20plots%20and%20correlation%20coefficients%20provide%20insights%20into%20how%20phylogenetic%20relationships%20influence%20metabolite%20diversity%20in%20different%20plant%20tissues.%0A%0A)
***

### [Created with BioloGPT](https://biologpt.com/?q=Paper%20Review%3A%20Evolutionary%20Trajectories%20of%20Shoots%20vs.%20Roots%3A%20Plant%20Volatile%20Metabolomes%20Are%20Richer%20but%20Less%20Structurally%20Diverse%20Belowground%20in%20the%20Tropical%20Tree%20Genus%20Protium)
[![BioloGPT Logo](https://biologpt.com/static/icons/bioinformatics_wizard.png)](https://biologpt.com/)
***