Robust handling of inconsistent TabularInput keys #45

naeioi · 2020-03-11T18:59:49Z

Currently, CsvOutput emits a warning if the keys of a TabularInput change after the first call to logger.log(TabularInput). A new key not seen before will be ignored and an old key not presented will be left blank. In other words, CsvOutput conservatively handles dynamic fieldnames.

This behaviour of CsvOutput makes it tricky to log performance of Multi- and Meta- ML algorithms, where there are usually per-task fields but not every task is presented in every iteration, resulting in missing of logs for some tasks.

The desired behaviour to handle inconsistent keys should be

When a new key is encountered
- Expand header with the new key.
- Expand old rows with empty cells for the new key.
If the value of any key is missing, leave the cell blank.

The text was updated successfully, but these errors were encountered:

avnishn · 2020-05-19T06:06:48Z

Introduction

Dowel is a tool that the garage Team uses for logging results from our various Reinforcement learning experiments.

Dowel can be used to log different types of data such as floats or strings. The logs can be logged to stdout (the console), CSV files, and Tensorboard.

You can check out an example of how Dowel is used here. In fact, almost all parts of the Dowel API are used in this example.

The problem

After statistics such as loss have been logged, and a call to logger.dump_all() is made for the first time, new tabular data can’t be written to a CSV output. This is because currently data cannot be inconsistently logged to CSV, meaning that on every single call to dump_all, the same logger keys must appear. Data that is inconsistently logged will not appear in the CSV output. This is a design flaw that we have been able to work around but affects our workflows.

Your goal is to solve the problem as well as introduce tests into our testing framework in order to verify your solution.

Some General Instructions

Fork Dowel and install all necessary dependencies.
Take a look at this toy example which when run exposes the bug and the accompanying issue mentioned above.
When you have finished writing your solution and tests, upload a PR onto your fork, not onto the upstream repository.
When you are done email us back with the link to your pull request.

If you have any questions, open an issue in your fork, and tag @avnishn and @zequnyu. Our preferred mode of communication on any questions that you have is through github issues and pull requests, as this is how the Garage team communicates generally. For this reason, we won’t respond to any direct emails with regards to help with your project. We will however respond to any other questions that you have via email (interview scheduling, etc).

Best of luck, and let us know if there are any issues as early on as possible

Before this commit, adding a new key in the tabular type after the first call would lead to a warning. Now, if a new key is added after the initial call CSV_output will rewrite the CSV file to include additional columns corresponding to the new keys. This allows for dynamic keys when using Tabulars. Resolves rlworkgroup#45

This commit adds robust handling of inconsistent TabularInput keys, with two implementations. The current behavior of ignoring new keys is kept as the default, but the users can now optionally specify how to record new keys and the corresponding values. (They must consider the trade-off between the two implementations.)

naeioi mentioned this issue Mar 11, 2020

Add offline meta-testing and per-task logging to MAML rlworkgroup/garage#1187

Closed

naeioi mentioned this issue Apr 21, 2020

Fix off-policy algos tabular record not working rlworkgroup/garage#1331

Merged

dxlin17 mentioned this issue May 19, 2020

Add Python 3.8 build #47

Closed

terickson87 mentioned this issue May 19, 2020

Fix heterogeneous key csv logging terickson87/dowel#1

Open

jialingt mentioned this issue May 20, 2020

Fix the inconsistent tabularinput key jialingt/dowel#1

Open

SuperElephant mentioned this issue May 20, 2020

enhance handling inconsistent tabularInput keys + test SuperElephant/dowel#1

Open

suprememichael added a commit to suprememichael/dowel that referenced this issue May 20, 2020

Fix rlworkgroup#45

24591b2

suprememichael mentioned this issue May 20, 2020

Fix #45 Robust handling of inconsistent TabularInput keys suprememichael/dowel#1

Open

lunjohnzhang mentioned this issue May 20, 2020

fix inconsistent TabularInput keys bug lunjohnzhang/dowel#1

Open

terickson87 added a commit to terickson87/dowel that referenced this issue May 20, 2020

Appears to have fixed issue rlworkgroup#45 (#1 locally)

6f11eb0

tinkerLiam mentioned this issue May 20, 2020

Starter project tinkerLiam/dowel#1

Open

irisliucy mentioned this issue May 21, 2020

Add Python 3.8 build #48

Closed

Adhyyan1252 mentioned this issue May 21, 2020

Added support for dynamic keys in CSV output Adhyyan1252/dowel#1

Closed

irisliucy mentioned this issue May 21, 2020

Fix #45(Inconsistent tabularInput Keys + test) irisliucy/dowel#1

Open

parul6695 mentioned this issue May 21, 2020

Modified csv_output and test_csv_output, Added dowelTest1 and dowelTest2 parul6695/dowel#1

Open

nicolengsy added a commit to nicolengsy/dowel that referenced this issue May 22, 2020

fixing issue rlworkgroup#45

bf0366a

ziyiwu9494 mentioned this issue May 22, 2020

Fix for #45 ziyiwu9494/dowel#1

Open

koverman47 mentioned this issue May 22, 2020

Fix to Bug 45 koverman47/dowel#1

Closed

GuanyangLuo mentioned this issue Jun 2, 2020

Handle inconsistent TabularInput (rlworkgroup#45) GuanyangLuo/dowel#1

Open

avnishn closed this as completed Jan 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robust handling of inconsistent TabularInput keys #45

Robust handling of inconsistent TabularInput keys #45

naeioi commented Mar 11, 2020 •

edited

avnishn commented May 19, 2020 •

edited by zequnyu

Robust handling of inconsistent TabularInput keys #45

Robust handling of inconsistent TabularInput keys #45

Comments

naeioi commented Mar 11, 2020 • edited

avnishn commented May 19, 2020 • edited by zequnyu

Introduction

The problem

Some General Instructions

naeioi commented Mar 11, 2020 •

edited

avnishn commented May 19, 2020 •

edited by zequnyu