Spectrum Plots by axelwalter · Pull Request #16 · OpenMS/pyopenms_viz

axelwalter · 2024-07-18T12:52:46Z

Added the full functionality for spectrum plots.

Due to time constraints there are no example notebooks yet (will be in a separate PR). However, I included a temporary streamlit app file showcasing everything (app-devel-spectrum.py).

Some questions / discussion points remain:

new functionality to plot heatmaps in scatter (needed in feature heatmap, ion mobility spectrum, peak map...)
- any comments on the implementation welcome, since I am not sure how this integrates with user passing backend specific parameters
changed parameter names x to mz and y to intensity for spectra because of ion mobility plots, should it stay x and y for consistency with others?
need help with Bokeh tooltips (works with ion mobility, but not with vline spectra)
matplotlib grid lines on top of scatter in ion mobility plot
how to update global settings (e.g. axis labels) setting self.xlabel = "new label" in plot methods does not transfer to _create_figure method: want to automatically set default axis labels if ion mobility is set

- useful if each item to plot gets a separate trace in figure (e.g. peaks in a spectrum)

- colors based on group by - custom colors for individual peaks

mz ion annotations (with colors) sequence custom annotation select top n intensity peaks for annotation or all

- includes major changes to scatter plot, now with heatmap functionality - will be useful for feature heatmap, peak map, ...

singjc · 2024-07-19T01:47:47Z

new functionality to plot heatmaps in scatter (needed in feature heatmap, ion mobility spectrum, peak map...)
any comments on the implementation welcome, since I am not sure how this integrates with user passing backend specific parameters

I'm not too sure I understand what is needed? Do we need to implement a different kind of heatmap plot?

changed parameter names x to mz and y to intensity for spectra because of ion mobility plots, should it stay x and y for consistency with others?

I personally would prefer to leave it as x and y for consistency with the other kinds of plots, when using the plot method. Otherwise, it could get confusing knowing which to use when wanting a different kind of plot?

I was looking into including a higher-level plot method for each specific kind, i.e. df.plot_spectrum, df.plot_chromatogram, etc. With these we could then do df.plot_spectrum(mz = "mz", intensity = "int"), df.plot_chromatogram(rt = "rt", intensity = "int"). Pandas can be extended with additional accessors, which is how we can attach these higher-level plot methods to the dataframe object. Still currently looking into this to see how easy/feasible it is.

See: https://pandas.pydata.org/pandas-docs/stable/development/extending.html

need help with Bokeh tooltips (works with ion mobility, but not with vline spectra)

I can try look into this. Did you use the new app-devel-spectrum.py to test for this?

matplotlib grid lines on top of scatter in ion mobility plot

Do we need to draw the grid lines before drawing the scatter points?

how to update global settings (e.g. axis labels) setting self.xlabel = "new label" in plot methods does not transfer to _create_figure method: want to automatically set default axis labels if ion mobility is set

The base plot config is used to update all the defaults if specific parameters (i.e. xlabel) is not passed directly to the plot method.

So if df.plot("rt", "int", "chromatogram", xlabel = "RT"), then self.xlabel should be "RT", otherwise if df.plot("rt", "int", "chromatogram") then self.xlabel should be "X-axis", because it gets updated by the config.

What we could do, is add a mapping for different axis labels that correspond to their kind of plot, and then we can pass kind to the _BasePlotConfig to configure the default values for axis-labels and other params that of plot specific.

singjc

I briefly skimmed, but everything looks fine to me. I was wondering about two things though:

When a Spectrum plot is requested, and ion_mobility is set, then it generates an ion mobility vs m/z PeakMap. I think this can already be done by using the feature_heatmap (which we can rename to PeakMap) kind of plot instead? I just think it might be confusing, if you're asking for a spectrum plot, but provide an ion mobility column to generate a PeakMap plot?
For the scatter plot marker cycler, if we use by to plot by a grouped df, I'm not sure how this will look for a RT vs IM peak map, but I think it might look weird with all the different markers? The RT vs IM peak map is more dense then a MZ vs IM peak map, so all the different markers might make the RT vs IM peak map hard to visualize.

There is a README in the test/test_data that is still in the pyopenms_viz folder, I don't think it got moved with the rest of test/test_data to the main project folder for some reason. Can you add some info in that README about the two new test tsv files, just so we know how they were generated / where they come from.

@jcharkow may have more suggestions / comments.

singjc · 2024-07-19T02:00:52Z

pyopenms_viz/_core.py

+        mz: str,
+        intensity: str,


I would prefer to keep these as x and y

Yes, that's what I would prefer too. Just did that for the ion mobility option to make things clear. Acutally I really like the suggestion to rename the FeatureHeatmapPlot to a generic PeakMap which can plot ion mobility spectra, MS experiments, feature heatmaps etc. Then we could keep it simply with x and y and the PeakMap would have an optional "intensity".

Yes I would keep "x" and "y" too, For 3D maps I would label "z" instead of intensity as well

singjc · 2024-07-19T02:04:49Z

pyopenms_viz/_core.py

+        if self.ion_mobility is None:
+            self.plot(spectrum, reference_spectrum, mz, intensity, **kwargs)
+        else:
+            self.plot_ion_mobility(spectrum, mz, ion_mobility, intensity, **kwargs)


Couldn't this be done with the FeatureHeatmap (which we can rename to PeakMap) Plot? i.e. df.plot(kind='feature_heatmap', x='mz', y='im')?

The reason for implementing it in scatter was to plot universal peak maps with scatter. But renaming the FeatureHeatMap to PeakMap would be very nice and clear.

singjc · 2024-07-19T02:10:22Z

pyopenms_viz/_core.py

-    def plot(self, x, y, **kwargs):
+    def plot(self, spectrum, reference_spectrum, x, y, **kwargs):
+        """Standard spectrum plot with m/z on x-axis, intensity on y-axis and optional mirror spectrum."""
+        kwargs.pop("fig", None)  # remove figure from **kwargs if exists


Why does this occur?

Passing the spectrum and reference_spectrum? I modify them with the prepare_data method (relative intensities etc). before calling either the normal spectrum plot method or the ion mobility spectrum plot.

axelwalter · 2024-07-19T09:28:40Z

I briefly skimmed, but everything looks fine to me. I was wondering about two things though:

1. When a Spectrum plot is requested, and ion_mobility is set, then it generates an ion mobility vs m/z PeakMap. I think this can already be done by using the feature_heatmap (which we can rename to PeakMap) kind of plot instead? I just think it might be confusing, if you're asking for a spectrum plot, but provide an ion mobility column to generate a PeakMap plot?

Yes, best solution!

2. For the scatter plot marker cycler, if we use `by` to plot by a grouped df, I'm not sure how this will look for a RT vs IM peak map, but I think it might look weird  with all the different markers? The RT vs IM peak map is more dense then a MZ vs IM peak map, so all the different markers might make the RT vs IM peak map hard to visualize.

Hm it is optional and in such a case the user doesn't need to use "by"? It looks nice for scatter plots which are not too crowded. I get your concern but would leave that to the user to chose the right settings which look good.

axelwalter · 2024-07-19T09:40:34Z

new functionality to plot heatmaps in scatter (needed in feature heatmap, ion mobility spectrum, peak map...)
any comments on the implementation welcome, since I am not sure how this integrates with user passing backend specific parameters

I'm not too sure I understand what is needed? Do we need to implement a different kind of heatmap plot?

The question was how can the user modify the markers in the current setup when we create a marker_dict for the heat map it is kind of pre-defined. One way would be to update the marker_dict with values from a custom passed marker_dict. But this is probably not very important in real use cases.

changed parameter names x to mz and y to intensity for spectra because of ion mobility plots, should it stay x and y for consistency with others?

I personally would prefer to leave it as x and y for consistency with the other kinds of plots, when using the plot method. Otherwise, it could get confusing knowing which to use when wanting a different kind of plot?

True, and if we plot ion mobility spectra as peak maps that will be perfectly fine.

I was looking into including a higher-level plot method for each specific kind, i.e. df.plot_spectrum, df.plot_chromatogram, etc. With these we could then do df.plot_spectrum(mz = "mz", intensity = "int"), df.plot_chromatogram(rt = "rt", intensity = "int"). Pandas can be extended with additional accessors, which is how we can attach these higher-level plot methods to the dataframe object. Still currently looking into this to see how easy/feasible it is.

See: https://pandas.pydata.org/pandas-docs/stable/development/extending.html

This would be nice to have but the current method by chosing the kind of plot seems simple enough. I would set this as a potential future feature if on time constraints right now.

need help with Bokeh tooltips (works with ion mobility, but not with vline spectra)

I can try look into this. Did you use the new app-devel-spectrum.py to test for this?

Thanks! Yes, tested everything with the app-devel-spectrum.py. I changed the method to get tooltips a bit where you pass a dict of tooltip item names as appering in the tooltip as keys and dataframe columns as values. This works in Plotly entirely and in the Bokeh ion mobility plot, but not in the Bokeh spectrum plot.

matplotlib grid lines on top of scatter in ion mobility plot

Do we need to draw the grid lines before drawing the scatter points?

Would need to remove the order, should work yes. First update plot aes and then plot.

how to update global settings (e.g. axis labels) setting self.xlabel = "new label" in plot methods does not transfer to _create_figure method: want to automatically set default axis labels if ion mobility is set

The base plot config is used to update all the defaults if specific parameters (i.e. xlabel) is not passed directly to the plot method.

So if df.plot("rt", "int", "chromatogram", xlabel = "RT"), then self.xlabel should be "RT", otherwise if df.plot("rt", "int", "chromatogram") then self.xlabel should be "X-axis", because it gets updated by the config.

What we could do, is add a mapping for different axis labels that correspond to their kind of plot, and then we can pass kind to the _BasePlotConfig to configure the default values for axis-labels and other params that of plot specific.

If we have a generic PeakMap Plot which will be used to plot feature heatmaps, ion mobility spectra and experiments we can set sane defaults right away and don't need to change that (like now with spectrum and ion mobility spec).

axelwalter · 2024-07-19T09:42:51Z

@singjc thanks for the review and feedback! If @jcharkow agrees with changing FeatureHeatmap to PeakMap which will be used to plot the different kinds of peak maps (feature heatmaps, ion mobility spectra, MS experiments) I will implement that next week and push changes.

jcharkow · 2024-07-19T12:33:12Z

@singjc thanks for the review and feedback! If @jcharkow agrees with changing FeatureHeatmap to PeakMap which will be used to plot the different kinds of peak maps (feature heatmaps, ion mobility spectra, MS experiments) I will implement that next week and push changes.

Yes I agree with changing FeatureHeatMap to PeakMap. Will review the rest of the code shortly

jcharkow

I have not tested it but looks good. Just a few minor suggestions. I like the idea of combining FeatureHeatMap into a PeakMap object and am excited to see those changes.

pyopenms_viz/_bokeh/core.py

pyopenms_viz/_matplotlib/core.py

- can be used to plot feature heatmaps, MS experiment 2D peak maps, ion mobility spectra

similar to ColorGenerator returns a generator with marker shapes based on the plotting engine

determine peak color in _get_colors only, not in _get_annotations bokeh peak color

sort z values to plot highest on top relative intensity peak binning example notebook

axelwalter · 2024-07-23T13:59:56Z

Thanks for the suggestions @singjc and @jcharkow ! Implemented the changes and added example notebooks, where the plotting backend can be selected in the first cell. The example streamlit app could be implemented in the OpenMS web app template, and we stick with notebooks here, what do you think?

singjc · 2024-07-23T14:04:00Z

Thanks for the suggestions @singjc and @jcharkow ! Implemented the changes and added example notebooks, where the plotting backend can be selected in the first cell. The example streamlit app could be implemented in the OpenMS web app template, and we stick with notebooks here, what do you think?

Great, thanks for making the changes! I think that sounds good and makes sense 👍. We can probably include a link to the OpenMS web app template repo in the README to redirect/showcase using the plotting framework in a web app?

axelwalter · 2024-07-23T14:07:19Z

Great, thanks for making the changes! I think that sounds good and makes sense 👍. We can probably include a link to the OpenMS web app template repo in the README to redirect/showcase using the plotting framework in a web app?

Sure sounds great! Will add a pyopenms-viz page to the template app and link here in the README once this repo is public / released.

axelwalter · 2024-07-23T14:09:30Z

One thing missing here which I will add eventually is the 3D peak map with matplotlib.

singjc · 2024-07-23T14:20:30Z

One thing missing here which I will add eventually is the 3D peak map with matplotlib.

Oh right, I think you did have code to do this in the MSExperimentPlotter in the initial version? Do you think we could re-use some of that code? Do we need to add another base class for 3D peak maps?

axelwalter · 2024-07-23T14:45:13Z

Oh right, I think you did have code to do this in the MSExperimentPlotter in the initial version? Do you think we could re-use some of that code? Do we need to add another base class for 3D peak maps?

Yes the code is in the initial version. We could adopt the VLine Plot to color lines with intensity values. However, the figure itself needs to be initialized differently as a 3D figure.

axelwalter added 16 commits July 15, 2024 13:04

plotly tooltip for multiple traces

2e15aa1

- useful if each item to plot gets a separate trace in figure (e.g. peaks in a spectrum)

update tooltips for bokeh plot

beb0709

remove unused import

2ef45d3

add test dataframes for MSExperiment and Spectrum

1119c73

spectrum custom peak colors

78fc43e

custom colors for mirror spec

1b1b157

relative intensity

630b29a

update spectrum peak colors

22e5369

- colors based on group by - custom colors for individual peaks

show legend in spectrum plots only if group by

aad0735

modify x-axis range

d79c883

don't show legend for vline traces in plotly by default

b713cad

never show legend for a single spectrum in plotly

e337da6

color generator for peaks and annotations

ef2165d

spectrum annotations

4bae4a2

mz ion annotations (with colors) sequence custom annotation select top n intensity peaks for annotation or all

ion mobility spectrum plot

a160fe6

- includes major changes to scatter plot, now with heatmap functionality - will be useful for feature heatmap, peak map, ...

temporary file for streamlit app showcasing new spectrum plots

cd71cf9

axelwalter requested review from jcharkow and singjc and removed request for singjc July 18, 2024 12:52

singjc approved these changes Jul 19, 2024

View reviewed changes

jcharkow approved these changes Jul 19, 2024

View reviewed changes

axelwalter added 3 commits July 22, 2024 09:07

use ColumnDataSource in Bokeh VLine Plot

6decf85

FeatureHeatmap to PeakMap

2490a0c

- can be used to plot feature heatmaps, MS experiment 2D peak maps, ion mobility spectra

remove ion mobility from spectrum plot, use x and y as paramters

7cb7858

axelwalter added 12 commits July 22, 2024 11:28

matplotlib scatter markers on top of grid lines

92e0d59

add MarkerShapeGenerator

29e5ea0

similar to ColorGenerator returns a generator with marker shapes based on the plotting engine

update plotting backend names according to pyproject.toml

813fb30

move add_annotations to VLinePlot

de84011

line color fixes

e40a083

determine peak color in _get_colors only, not in _get_annotations bokeh peak color

add spectrum example notebook

8e7d3ff

peak map updates

9eace3d

sort z values to plot highest on top relative intensity peak binning example notebook

peak map optional log intensity

3a7f0d5

hide legend in plotly peak map if not grouped

3ab11e4

copy peak map data to not modify original

a5a8012

peak map example notebook

4f760c2

remove spectrum streamlit app

170a2be

axelwalter merged commit 6fe4712 into OpenMS:main Jul 31, 2024

Conversation

axelwalter commented Jul 18, 2024

Uh oh!

singjc commented Jul 19, 2024

Uh oh!

singjc left a comment

Choose a reason for hiding this comment

Uh oh!

singjc Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

axelwalter Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

jcharkow Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

singjc Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

axelwalter Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

singjc Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

axelwalter Jul 19, 2024

Choose a reason for hiding this comment

Uh oh!

axelwalter commented Jul 19, 2024

Uh oh!

axelwalter commented Jul 19, 2024

Uh oh!

axelwalter commented Jul 19, 2024

Uh oh!

jcharkow commented Jul 19, 2024

Uh oh!

jcharkow left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

axelwalter commented Jul 23, 2024

Uh oh!

singjc commented Jul 23, 2024

Uh oh!

axelwalter commented Jul 23, 2024

Uh oh!

axelwalter commented Jul 23, 2024

Uh oh!

singjc commented Jul 23, 2024

Uh oh!

axelwalter commented Jul 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants