Visualization bug #54

matthewcarbone · 2018-08-09T21:38:51Z

I noticed something with the Reporting class. I'm not sure if reporting.data is supposed to print cleanly, but it doesn't for me:

The standard output looks like this

But I think an easy fix is to do this:

Thoughts?

The text was updated successfully, but these errors were encountered:

mikkokotila · 2018-08-10T12:23:25Z

Thanks! I have new reporting almost rebuilt, which is now also more focused giving us access to details that are vital to the next level of abstraction i.e. optimizing the optimizing. Once we get the current dev to master, I'll add them to dev. What do you think about merging with master now?

matthewcarbone · 2018-08-10T14:35:12Z

I think we could but if you’re refactoring Reporting we might as well wait until you’re done with that. Is there any reason to merge sooner?

mikkokotila · 2018-08-10T14:38:17Z

Ok great, yes I think it's good we merge now first. For reporting I'm actually writing it from scratch on the basis of what I found was useful. What do you think would be useful features to add in it?

matthewcarbone · 2018-08-10T18:50:31Z

@mikkokotila Definitely a few things from my experience with it that I would recommend.

Allow for the option to sort by different quantities (validation accuracy, validation loss, training loss, etc.).
Definitely let the user easily trim the columns. In other words, maybe we don't care about anything other than the validation accuracy. Therefore, we can let the user easily not display the training loss, validation loss and training accuracy. There is a very easy way to do this with pandas that I've used before. Let me know if you have any trouble finding/using it and I can dig it up.
Easy callbacks of various histories. Perhaps there's a way to get Scan to save all those. This would be incredibly useful I think.
Can also trim the rows, so maybe only display the top 5 permutations. This is also easy to do in pandas I think.
Customizable display options (e.g. if user is running on terminal, let them set something to true/false so that instead of trying to display using pandas it just saves the .csv).

Can't think of anything else off the top of my head but yeah that's a good place to start! Let me know if there's anything I can do. Otherwise when you're done I'll try to help comment/cleanup and whatnot. 👍

mikkokotila · 2018-08-15T18:09:49Z

@x94carbone thanks a lot. In terms of the callbacks, could you share some examples for the use-cases. Generally storing the history is of course easy, and it seems that the data size should be totally ok as long as we handle it in an array the right way during, and dump to the df only in the end of the process.

For the displaying stuff I made the whole thing simpler in the way where you have a class object that takes in one parameter, which is the log, and then after that you have various properties (like peak round). Then it would be very easy to extend it. The code is super clean without any of the complexities of the current one.

matthewcarbone · 2018-08-16T13:48:26Z

Nice, that sounds good. By callbacks do you just mean how I manipulate the dataframe to trim columns and that sort of thing?

matthewcarbone · 2018-08-16T14:01:31Z

Well, in case that was what you were referring to, here's the answer. For the current state of Talos:

First, if using a notebook, this will make the output easier for you to deal with.

import pandas as pd
from IPython.display import clear_output, display

Load the data you just ran using Talos, but suppress the output since it's not always going to be clearly formatted depending on some unknown factors.

r = ta.Reporting('my_exp_1.csv')
clear_output()

Example of removing the columns in remove_columns and sorting the list by sort_by.

remove_columns = ['round_epochs', 'acc', 'loss']
sort_by = 'val_fbeta_score'
rr = pd.DataFrame(r.data.sort_values(sort_by, ascending=True)).drop(remove_columns, axis=1)
display(rr)

Is this helpful?

channhan007 · 2018-08-23T17:48:51Z

Hi there,
For the format of the output, I don't know why the column names are not in the right order.
Do you guys know how to fix this ?
Thanks

mikkokotila · 2018-08-24T19:59:13Z

@channhan007 the whole reporting piece is going to be replaced with a new much cleaner approach that also supports the various use-cases including far better integration with plots. In the meantime for the simple table view, you might just use:

from pandas import read_csv

read_csv('experiment.csv')

channhan007 · 2018-08-24T20:24:50Z

@mikkokotila Thank you for your response. I look at the csv output file and the column labels are not in the right order. I have to rearrange those column names.

mikkokotila · 2018-08-26T15:22:24Z

The new reporting is now included in dev. The old is removed, so closing here. Later we can have separate issues for new features for Reporting(). It's in /utils/reporting.py

matthewcarbone added the priority: LOW lowest priority label Aug 9, 2018

matthewcarbone mentioned this issue Aug 16, 2018

How to use f1 measure for "best" model? #56

Closed

mikkokotila closed this as completed Aug 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualization bug #54

Visualization bug #54

matthewcarbone commented Aug 9, 2018 •

edited

mikkokotila commented Aug 10, 2018

matthewcarbone commented Aug 10, 2018

mikkokotila commented Aug 10, 2018

matthewcarbone commented Aug 10, 2018

mikkokotila commented Aug 15, 2018

matthewcarbone commented Aug 16, 2018

matthewcarbone commented Aug 16, 2018

channhan007 commented Aug 23, 2018

mikkokotila commented Aug 24, 2018

channhan007 commented Aug 24, 2018

mikkokotila commented Aug 26, 2018

Visualization bug #54

Visualization bug #54

Comments

matthewcarbone commented Aug 9, 2018 • edited

mikkokotila commented Aug 10, 2018

matthewcarbone commented Aug 10, 2018

mikkokotila commented Aug 10, 2018

matthewcarbone commented Aug 10, 2018

mikkokotila commented Aug 15, 2018

matthewcarbone commented Aug 16, 2018

matthewcarbone commented Aug 16, 2018

channhan007 commented Aug 23, 2018

mikkokotila commented Aug 24, 2018

channhan007 commented Aug 24, 2018

mikkokotila commented Aug 26, 2018

matthewcarbone commented Aug 9, 2018 •

edited