Logger.report_plotly changes the original plot/ screws up formatting #247

lakshaykc · 2020-11-15T02:53:14Z

I'm trying to report plotly figures via Logger.report_plotly method. Many of the figure attributes are missing when I view in the web api and other elements (such as colors) change randomly.

Here is the original figure

and this is what happens in the web app

Below, I log the differences between the original and the web-app plot.

Line colors are specified in the script (i.e I'm not using default colors). The line colors randomly change in the web app plot. Every time I re-run the experiment, the color scheme changes on the web app.
Heading and Y-axes labels are missing.
Space formatting/ aspect ratio of the figure is not ok. I have to scroll left-right quite a bit. The figure should have better default sizing that fits into the window
Legend location is specified for the original figure but it gets changed for the web app. Also there is some strange string that appends to the label. Here the labels should be "train_mse", "valid_mse" but instead they are "train_mse.80e", "valid_mse.80e". I don't understand where the ".80e" is coming from.

I have following questions:

Does Logger.report_plotly not log all the figure attributes? Where does the plotly figure object change?
How can I improve the default formatting on the web app? I want to have multiple subplots.

Here is the script that I used to generate this example for reproducibility.

from trains import Task, Logger
from plotly.subplots import make_subplots
from trains.backend_api.session.client import APIClient
import plotly.graph_objects as go
import numpy as np

# Generate plots
n_cols = 2
n_rows = 2
N = 10

fig_1 = make_subplots(rows=n_rows, cols=n_cols,
                      shared_yaxes=True, shared_xaxes=True,
                      horizontal_spacing=0.005, vertical_spacing=0.15)

fig_2 = make_subplots(rows=n_rows, cols=n_cols,
                      shared_yaxes=True, shared_xaxes=True,
                      horizontal_spacing=0.005, vertical_spacing=0.15)

pe_settings = {'metrics': ['train_mse', 'valid_mse'],
               'colors': ['dodgerblue', 'darkorange']}

uq_settings = {'metrics': ['train_uq_loss', 'valid_uq_loss'],
               'colors': ['mediumblue', 'lightcoral']}

show_legend = True

for i in range(n_cols):
    for j in range(n_rows):
        # PLot 1
        for k, metric in enumerate(pe_settings['metrics']):
            trace = go.Scatter(x=np.arange(1, 10),
                               y=np.random.rand(10),
                               name=metric,
                               showlegend=show_legend,
                               line=dict(color=pe_settings['colors'][k]))

            fig_1.add_trace(trace, row=j + 1, col=i + 1)

        # plot 2
        for k, metric in enumerate(pe_settings['metrics']):
            trace = go.Scatter(x=np.arange(1, 10),
                               y=np.random.rand(10),
                               name=metric,
                               showlegend=show_legend,
                               line=dict(color=pe_settings['colors'][k]))

            fig_2.add_trace(trace, row=j + 1, col=i + 1)

        show_legend = False

margin_val = 100

fig_1.update_layout(title_text="Training history ",
                    xaxis_title='Epochs',
                    yaxis_title='Loss 1',
                    legend_orientation="h",
                    legend=dict(x=0, y=1.025),
                    margin=dict(l=margin_val, r=margin_val, t=margin_val, b=margin_val),
                    # height=200 * n_rows
                    )

fig_2.update_layout(title_text="Training history ",
                    xaxis_title='Epochs',
                    yaxis_title='Loss 2',
                    legend_orientation="h",
                    legend=dict(x=0, y=1.025),
                    margin=dict(l=margin_val, r=margin_val, t=margin_val, b=margin_val),
                    # height=200 * n_rows
                    )

# y axes titles i.e dates
# use last i_children and m_child to get iterative dates
for k in range(n_rows):
    y_axis_title = "{start_date} to {end_date}".format(start_date='201501',
                                                       end_date='201912')

    fig_1.update_yaxes(title_text=y_axis_title, row=k + 1, col=1)
    fig_2.update_yaxes(title_text=y_axis_title, row=k + 1, col=1)

fig_1.show()

# remove existing experiment if it exists
client = APIClient()
prev_task = Task.get_task(task_name='plotly_test', project_name='examples')
if prev_task:
    client.tasks.delete(task=prev_task.id, force=True)

# Log to trains
task = Task.init(project_name='examples', task_name='plotly_test')
Logger.current_logger().report_plotly(title='LOSS 1', series='Series 1', iteration=0, figure=fig_1)
Logger.current_logger().report_plotly(title='LOSS 2', series='Series 2', iteration=0, figure=fig_2)

The text was updated successfully, but these errors were encountered:

bmartinn · 2020-11-16T01:10:27Z

Hi @lakshaykc

Thank you for the code sample, this is great!

Regrading the colors, the UI enables to you change the colors in real time and it will remember the selected colors.
The colors selection is done based on the combination of title & series .
The idea is to give you the ability to control the colors while viewing (this is especially helpful when comparing multiple experiments).
The same goes for the legend, it's location is changed but in the UI when you hover over the plot, you can hide/show it (see the second icon from the right).

We are working on improving the automatic color selection, the new implementation will find a palette that separates different series inside the same plot (but colors on different plots could be shared)

Regrading (2) , I ran your code sample, and I have the Y-axis labels, that said the heading (plot title) is missing. I'll look into that.

Specifically to your example, I would actually report the 4 joined plots as 4 individual plots. This will allow the UI more flexibility in
layout optimization, don't you think ?

lakshaykc · 2020-11-16T01:52:05Z

Thanks for the response @bmartinn.

Having automatic color scheme is nice. But I notice the color scheme changes even if I run the same experiment again. I was wondering where that randomness is coming from. This is not a big issue, but having consistency helps to understand plots slightly faster. For my case, each experiment has between 20-50 plots, so that's why color default color scheme becomes important. But I do understand your point of having the ability to change the color for comparison. It is quite helpful. Not that our points are contradictory.

As an example, I use a plot like this where I'm looking at two different loss metrics. It is one experiment where the model is an ensemble of 10 models. So for each loss metric I have train-valid curve for each ensemble model member.

Specifically to your example, I would actually report the 4 joined plots as 4 individual plots. This will allow the UI more flexibility in
layout optimization, don't you think ?

I was thinking about it but it would be harder to quickly analyze the run for this use case. Here there is only one row for each loss metric. But in other cases I also have to plot multiple rows where each row signifies something (eg. time period of training). So the formatting might be better if I create individual plots but it would be more work to go through them. That being said, it is still quite doable. I was just wondering if using sub plots for such cases would be more optimal. Thoughts?

The same goes for the legend, it's location is changed but in the UI when you hover over the plot, you can hide/show it (see the second icon from the right).

Thanks for the tip on showing/hiding the legend. Did you notice the labels on the legend have random sting attached at the end of them? In the image in the original post the string is '.80e' but in other cases it goes as long as 10-12 characters (eg. train-use.80ejsh83si8dsh). I'm not sure where they get changed.

Regrading (2) , I ran your code sample, and I have the Y-axis labels, that said the heading (plot title) is missing. I'll look into that.

That's strange. I still don't see the y-axis labels. I'll try another sample and see if I can get it to show.

lakshaykc · 2020-11-16T19:00:16Z

Hi @bmartinn

I tried what you suggested of having individual plots instead of subplots. Here is a simple script to create this example.

from trains import Task, Logger
from plotly.subplots import make_subplots
from trains.backend_api.session.client import APIClient
import plotly.graph_objects as go
import numpy as np

n_figs = 4
N = 10

# remove existing experiment
client = APIClient()
prev_task = Task.get_task(task_name='plotly_test', project_name='examples')
if prev_task:
    client.tasks.delete(task=prev_task.id, force=True)

task = Task.init(project_name='examples', task_name='plotly_test')

for i in range(n_figs):
    trace_1 = go.Scatter(x=np.arange(N), y=np.random.rand(N), name='train_mse')
    trace_2 = go.Scatter(x=np.arange(N), y=np.random.rand(N), name='valid_mse')

    fig = go.Figure()
    fig.add_trace(trace_1)
    fig.add_trace(trace_2)

    fig.update_layout(title_text="Training history ",
                      xaxis_title='Epochs',
                      yaxis_title='Loss',
                      )
    fig.update_yaxes(title_text='2015 to 2019')

    Logger.current_logger().report_plotly(title='LOSS ' + str(i), series='Loss ' + str(i), iteration=0, figure=fig)

As you can see in the gif below, only the title of the first plot 'Loss 0' is shown. Other titles are missing. Also, y-axis titles are missing for all the four plots.

And finally it would be nice to have the figure fit into the plot window instead of scrolling left and right. What do you think?

bmartinn · 2020-11-16T22:39:21Z

@lakshaykc thank you for the screen garbs they are very useful !
I want to differentiate here between bugs, and UI improvements.
Let's start with the bugs (let me know If I missed any):

Plot title is missing when reporting a plotly figure with sub-plots
Plotly figure legend has the task id next to the series name (here you see train_mse.80e instead of trains_mse (This is the reason for the unstable colors, as they are derived form the title/series combination and the series name is constantly changing this causing a new random color to be picked)
Plotly plots are not constraint to the screen width (i.e. scroll issue)

Now for usability:

... It is one experiment where the model is an ensemble of 10 models. ...

Maybe it is just me, but I would put all 10 losses on the same graph, in the UI you can enable / disable each series by click on the name itself (double click to get a solo view of the series). At least in my mind it is a lot easier to compare, no?!
(BTW: In the next version we will have group/split toggle switch, so you could see all 10 series on the same graph or, with a single click, split them across 10 graphs)

As you can see in the gif below...

That is strange, I ran your code against the demo-server (see here)
No issue with scrolling (tested on both chrome / firefox)... I was also able to see the Y axis label (title is till missing though)
See screenshot:

lakshaykc · 2020-11-17T00:01:55Z

@bmartinn When I launch the demo-server link you put above on safari and chrome, I still don't see the y-axis labels. See the gifs below.

Safari's formatting is weird - too much scrolling to do and the window is only using half of the screen (bottom half of the window is dark). Chrome is at least getting those things right. Another difference between them is that [Object Object] appears on left top of chrome example and not on Safari. I'm guessing it is supposed to be series title?!

Safari

Chrome

lakshaykc · 2020-11-17T00:08:41Z

Plotly figure legend has the task id next to the series name (here you see train_mse.80e instead of trains_mse (This is the reason for the unstable colors, as they are derived form the title/series combination and the series name is constantly changing this causing a new random color to be picked)

Ok, that is good to know. Why is the series name constantly changing? In the code samples above, I set the series name to be constant ('Series 1', 'Series 2') for a figure. Why doesn't the demo-server example not have task id at the end?

Plotly plots are not constraint to the screen width (i.e. scroll issue)

Got it. Thanks. From the examples above, it seems more like a browser issue.

lakshaykc · 2020-11-17T00:29:07Z

Now for usability:

... It is one experiment where the model is an ensemble of 10 models. ...

Maybe it is just me, but I would put all 10 losses on the same graph, in the UI you can enable / disable each series by click on the name itself (double click to get a solo view of the series). At least in my mind it is a lot easier to compare, no?!
(BTW: In the next version we will have group/split toggle switch, so you could see all 10 series on the same graph or, with a single click, split them across 10 graphs)

Looking forward to the group/split toggle switch!

Regarding putting all 10 losses on the same graph - I should clarify the comparison first. Here the 10 models are not comparable to each other. They might have different validation sets. The comparison is done between the training loss and its respective validation loss. So comparing all the 10 models with each other is irrelevant.

Having all 10 models in the same plot would cause confusion. Manually selecting/deselecting the right curves would be some manual work. With the example figure I posted with 10 columns, I can get a good snapshot without much effort. The basic idea is to see how training is progressing for each of my models where these models are independent of each other. I hope this makes the use case a little more clear.

One thing that confuses me a little bit - if all trains cares about is a plotly figure, a normal figure or a subplot shouldn't make a difference right?

That being said, I do understand my use case might be niche and not as widely applicable wrt to number of plots. That's why I was focussing on doing all the work on plotly side rather than trains side.

NumesSanguis · 2020-11-17T01:38:34Z

Not related to any of the bugs mentioned here, but for UI plot consistency, it would be great if there is some name matching across different plots.

For example if I report the following values:

# 1st plot
"accuracy/train"
"accuracy/val"
"accuracy/test"

# 2nd plot
"auroc/train"
"auroc/val"
"auroc/test"

It would be great if "train" would be the same color in both plots, and also that this color stays the same for any experiment in the same project.

bmartinn · 2020-11-17T08:20:31Z

@lakshaykc I think figured what the issue is, go to the profile page (here) and disable the HiDPI support (I think there is a bug there with the plots and HiDPI, and I think this is the culprit of the scroll issue, Y-axis is a derivative as it is on the right of the plot which is hidden)

.. Here the 10 models are not comparable to each other. They might have different validation sets. ...

Understood, that makes sense. If this is the case maybe just 10 different graphs (no subplots) then you can just toggle them on/off in the left panel?!

One thing that confuses me a little bit - if all trains cares about is a plotly figure, a normal figure or a subplot shouldn't make a difference right?

In theory, definitely correct. In practice since we do change the plotly layout a bit (legend colors etc.) bugs might pop, as you have noticed :)

@NumesSanguis this is a good point. The color design for the next version is the following (I might be missing details but this is the general gist). Per plot (i.e. graph) select colors (from a pre-made color palette) based on the series names.
This means that in your examples trains/val/test will have the same colors in both plots. That said, if you have a third plot with 4 series (for example train/val/test/average), then the colors might not match the previous 3. The reason for that is the color matching will be done on a per plot basis and also based on the order of series names, so matching between the plots is out of scope. Do remember you will still be able to change colors per plot/series, so you could manually match the colors, and it is stored, so the next time you execute an experiment all colors will match.

NumesSanguis · 2020-11-17T09:41:54Z

@bmartinn Thank you for the breakdown of the new functionality. In my specific use-case this would still fall short (without manual modification), because for some metrics I would calculate "train", "val" and "test", but for others only "val" and "test". Just to let you know 1 user's use-case ^-^

bmartinn · 2020-11-17T10:20:23Z

Thanks @NumesSanguis !
If you have an improvement on the color selection algorithm, please feel free to suggest.
(The problem we have with just hashing the title.series then selecting based on the hash, there is no way to guarantee unique colors, and this is actually the problem with the current implementation that does exactly that)

lakshaykc · 2020-11-17T17:30:54Z

@bmartinn Bingo! That was it. The After disabling HiDPI, the UI is working as expected and the plots are looking sharp. I'm guessing HiDPI stands for Higher DPI for the image?! Do we need to create a bug report/new issue for this problem or this issue is good enough?

Also, I've noticed the UI on Safari is not as good as Chrome (haven't tested on Firefox). The formatting and layout is not optimal. Dimensions of the web pages (fonts, icons, etc.) are much smaller.. I will create a separate issue for that.

lakshaykc · 2020-11-17T17:37:09Z

@bmartinn looks great now!

I'm still seeing the task id appended to the legend label. Is that a bug too?

bmartinn · 2020-11-17T23:52:25Z

Yes @lakshaykc you are correct the "task id" appearing in the legend is a bug (Third one on my bug list)

Regrading browser support, we test on both Chrome and Firefox, together they cover almost the entire market, I guess these days even on MacOS Safari is left behind :)

BTW:
HiDPI High Dots Per Inch (i.e. auto scaling done by the OS based on the resolution / pixel size, we try to adjust the layout based on that, and scale back the OS "zoom" effect, sometimes with less success than we would have hoped...)

lakshaykc · 2020-11-18T05:58:09Z

Great, thanks!

Yes, I agree re Safari. I'm trying out the new Safari that comes with OS Big Sur. Will stick to Chrome when using trains :)

allegroai-git closed this as completed Jan 17, 2021

RobinBaumann mentioned this issue May 31, 2021

Direct Plotly reporting does not retain colorscale. #373

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logger.report_plotly changes the original plot/ screws up formatting #247

Logger.report_plotly changes the original plot/ screws up formatting #247

lakshaykc commented Nov 15, 2020

bmartinn commented Nov 16, 2020 •

edited

lakshaykc commented Nov 16, 2020

lakshaykc commented Nov 16, 2020

bmartinn commented Nov 16, 2020

lakshaykc commented Nov 17, 2020

lakshaykc commented Nov 17, 2020

lakshaykc commented Nov 17, 2020

NumesSanguis commented Nov 17, 2020

bmartinn commented Nov 17, 2020

NumesSanguis commented Nov 17, 2020 •

edited

bmartinn commented Nov 17, 2020

lakshaykc commented Nov 17, 2020

lakshaykc commented Nov 17, 2020 •

edited

bmartinn commented Nov 17, 2020

lakshaykc commented Nov 18, 2020

Logger.report_plotly changes the original plot/ screws up formatting #247

Logger.report_plotly changes the original plot/ screws up formatting #247

Comments

lakshaykc commented Nov 15, 2020

bmartinn commented Nov 16, 2020 • edited

lakshaykc commented Nov 16, 2020

lakshaykc commented Nov 16, 2020

bmartinn commented Nov 16, 2020

lakshaykc commented Nov 17, 2020

lakshaykc commented Nov 17, 2020

lakshaykc commented Nov 17, 2020

NumesSanguis commented Nov 17, 2020

bmartinn commented Nov 17, 2020

NumesSanguis commented Nov 17, 2020 • edited

bmartinn commented Nov 17, 2020

lakshaykc commented Nov 17, 2020

lakshaykc commented Nov 17, 2020 • edited

bmartinn commented Nov 17, 2020

lakshaykc commented Nov 18, 2020

bmartinn commented Nov 16, 2020 •

edited

NumesSanguis commented Nov 17, 2020 •

edited

lakshaykc commented Nov 17, 2020 •

edited