Improve Kedro-Viz docs for Experiment Tracking #2193

tynandebold · 2023-01-11T14:39:50Z

NOTE: Kedro datasets are moving from kedro.extras.datasets to a separate kedro-datasets package in
kedro-plugins repository. Any changes to the dataset implementations
should be done by opening a pull request in that repository.

Description

Resolves (part of) kedro-org/kedro-viz#1241.

Warning
This PR shouldn't be merged until version 5.2.0 of Kedro-Viz is released. This is because there's content in this PR that will only exist when that release is out.

Development notes

Taken from the comment here, I've done the following:

Ensure that the spaceflights tutorial works without issue. There was a problem with Plotly where the charts weren't rendering (relates to Make Kedro-Viz work with kedro-datasets kedro-viz#1205).
There are actually two pages for exp. tracking documentation. We should only have one.
- Copy the contents of the second, smaller page into the first and then delete that content from the second page.
Mention parallel coordinate and time-series plots.

A note on each checked item:

I checked this using Kedro 0.18.3, since 0.18.4 will prevent the charts from loading. There were a couple of minor hangups while running through the tutorial that I've remedied. All should work smoothly now.
I've deleted this secondary docs page with experiment tracking instructions and didn't move anything over into the main page because it was already well covered.
The new work is captured and takes precedent in the ordering on the page, meaning the metrics plots in the experiment tracking page shows up earlier in the documentation versus the metrics plot that can be viewed on the flowchart page.

Lastly, I've updated the gifs and images to reflect the changes in design on the experiment tracking page (now we have tabs for overview, metrics, and plots, and before we didn't).

Checklist

Read the contributing guidelines
Opened this PR as a 'Draft Pull Request' if it is work-in-progress
Updated the documentation to reflect the code changes
Added a description of this change in the RELEASE.md file
Added tests to cover my changes

…eded secondary exp tracking file

…cs-release

docs/source/visualisation/experiment_tracking.md

stichbury · 2023-01-12T11:01:03Z

This looks great, thanks @tynandebold.

I do have a query about Spaceflights. So this now works if you take the most recent spaceflights starter and add the code given above? There's no reliance on any previous part of the Kedro-Viz docs like the previous page on Plotly reporting? I would normally test it myself just to 'dogfood' the instructions, but I'm a bit tied up with the Academy decks today and don't want to hold you up.

I think we still need to introduce Kedro-Viz for anyone (unlikely enough) to be coming straight to this page without having previously installed Viz. We put this into the Plotly page. -- would it be possible to add to this file?

You could also move/edit a small section of the text from where it appears later in the file, so you test the version at the point you mention installing Viz:

"Here comes the fun part of accessing your run data on Kedro-Viz. Having ensured that you are using Kedro-Viz >=4.1.1 (you can confirm your Kedro-Viz version by running kedro info), run"

stichbury · 2023-01-12T11:05:33Z

This looks great to me @tynandebold. LMK when you've made those changes and I'll approve.

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

tynandebold · 2023-01-12T13:33:41Z

I've made the changes now @stichbury. I took your second suggestion and moved the testing of the Viz version to after the install step. Let me know how that reads and if it's in the correct place.

And to your query about Spaceflights: yes, all worked fine for me when I tested with the latest version of Spaceflights (using the kedro new command) and added the code from this PR. I used a fresh conda environment for testing this and followed the instruction from this page only, taking nothing from any other page in the docs.

Could there have been some leftover dependencies hanging around in my new conda environment that were populated from an older one? I'm not too familiar with this. Again, it worked fine for me with 0.18.3 of Kedro and the couple new lines that I added, so I'm quite confident it's solid. I do hope to test with Merel's fix though, to ensure it'll work with 0.18.4.

…cs-release

docs/source/visualisation/experiment_tracking.md

stichbury

LGTM, thanks for this! 🌟 🌟 🌟 🌟 🌟

docs/source/visualisation/experiment_tracking.md

merelcht

This looks good to me! And so exciting to see the time series and parallel coordinates! 🥳

Here's the list of changes in this commit: - Moved some of the "experiment tracking" explanation from the logging file into the current file - Added a "Why use Kedro experiment tracking?" section - Made sure users knew to install dependencies - Fixed formatting and language from the previous versions of the documentation

yetudada

Thank you so much for this stellar work @tynandebold! I've made some additions and I've left a bigger question which we might need @rashidakanchwala or @merelcht to help with.

docs/source/visualisation/experiment_tracking.md

yetudada · 2023-01-13T11:39:00Z

docs/source/visualisation/experiment_tracking.md


 ## Set up tracking datasets

 There are two types of tracking datasets: [`tracking.MetricsDataSet`](/kedro.extras.datasets.tracking.MetricsDataSet) and [`tracking.JSONDataSet`](/kedro.extras.datasets.tracking.JSONDataSet). The `tracking.MetricsDataSet` should be used for tracking numerical metrics, and the `tracking.JSONDataSet` can be used for tracking any other JSON-compatible data like boolean or text-based data.

-Set up two datasets to log `r2 scores` and `parameters` for each run by adding the following in the `conf/base/catalog.yml` file:
+Set up two datasets to log the columns used in the companies dataset (`companies_columns`) and experiment metrics for the `active_modelling_pipeline` (`active_modelling_pipeline.metrics`) like the coefficient of determination (`r2 score`),  max error (`me`) and mean absolute error (`mae`) by adding the following in the `conf/base/catalog.yml` file:

 ```yaml
 active_modelling_pipeline.metrics:


My only other comment is based on this and it might need to be a bigger change to how we structure this tutorial. We introduce namespacing here and in the plot comparison section (data_processing.confusion_matrix in catalog.yml) but we don't actually explain what the full stop means here.

It's important that we do because I got confused with the guidance in 113 - Note that the output dataset must exactly match the name of the tracking dataset specified in the catalog file. because users are asked to just specify metrics and not active_modelling_pipeline.metrics.

So my question would be:

Can we, ideally, do this tutorial without introducing the concept of namespaces too (it's an information overload)?

And if not, how do we introduce it?

Great spot @yetudada ! We do indeed not need the namespace here, as no namespacing has been applied to the pipelines.

Awesome let me update the other mentions of active_modelling_pipeline.metrics 😊

docs/source/visualisation/experiment_tracking.md

Co-authored-by: Yetunde Dada <43755008+yetudada@users.noreply.github.com>

docs/source/visualisation/experiment_tracking.md

merelcht · 2023-01-13T12:45:53Z

docs/source/visualisation/experiment_tracking.md


 ## Set up tracking datasets

 There are two types of tracking datasets: [`tracking.MetricsDataSet`](/kedro.extras.datasets.tracking.MetricsDataSet) and [`tracking.JSONDataSet`](/kedro.extras.datasets.tracking.JSONDataSet). The `tracking.MetricsDataSet` should be used for tracking numerical metrics, and the `tracking.JSONDataSet` can be used for tracking any other JSON-compatible data like boolean or text-based data.

-Set up two datasets to log `r2 scores` and `parameters` for each run by adding the following in the `conf/base/catalog.yml` file:
+Set up two datasets to log the columns used in the companies dataset (`companies_columns`) and experiment metrics for the `active_modelling_pipeline` (`active_modelling_pipeline.metrics`) like the coefficient of determination (`r2 score`),  max error (`me`) and mean absolute error (`mae`) by adding the following in the `conf/base/catalog.yml` file:

 ```yaml
 active_modelling_pipeline.metrics:


Great spot @yetudada ! We do indeed not need the namespace here, as no namespacing has been applied to the pipelines.

yetudada

I think we just need a general check to see if this tutorial runs from beginning to end in a clean virtual environment with all the changes but otherwise I think we're ready to go live.

yetudada · 2023-01-13T13:28:58Z

docs/source/visualisation/experiment_tracking.md


 ## Set up tracking datasets

 There are two types of tracking datasets: [`tracking.MetricsDataSet`](/kedro.extras.datasets.tracking.MetricsDataSet) and [`tracking.JSONDataSet`](/kedro.extras.datasets.tracking.JSONDataSet). The `tracking.MetricsDataSet` should be used for tracking numerical metrics, and the `tracking.JSONDataSet` can be used for tracking any other JSON-compatible data like boolean or text-based data.

-Set up two datasets to log `r2 scores` and `parameters` for each run by adding the following in the `conf/base/catalog.yml` file:
+Set up two datasets to log the columns used in the companies dataset (`companies_columns`) and experiment metrics for the `active_modelling_pipeline` (`active_modelling_pipeline.metrics`) like the coefficient of determination (`r2 score`),  max error (`me`) and mean absolute error (`mae`) by adding the following in the `conf/base/catalog.yml` file:

 ```yaml
 active_modelling_pipeline.metrics:


Awesome let me update the other mentions of active_modelling_pipeline.metrics 😊

docs/source/visualisation/experiment_tracking.md

Co-authored-by: Yetunde Dada <43755008+yetudada@users.noreply.github.com>

Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>

Co-authored-by: Yetunde Dada <43755008+yetudada@users.noreply.github.com>

docs/source/visualisation/experiment_tracking.md

stichbury · 2023-01-13T14:20:01Z

I think we just need a general check to see if this tutorial runs from beginning to end in a clean virtual environment with all the changes but otherwise I think we're ready to go live.

I'm happy to try this out as I want to test it out and learn about experiment tracking, but I'm not really able to right now. I think @tynandebold needs to hold off until the next viz release anyway, so hopefully this can wait until next week -- I'll get it done before the end of the sprint.

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

tynandebold · 2023-01-16T10:22:48Z

docs/source/visualisation/experiment_tracking.md

+You must update the `src/requirements.txt` file in your Kedro project by adding the following dataset to enable Matplotlib for your project:
+
+```bash
+kedro[matplotlib.MatplotlibWriter]==0.18.3


I think we can remove this now, can't we? Once kedro-org/kedro-viz#1214 is merged we shouldn't need to specify a Kedro version.

True! I have to make some wholesale changes to the other viz docs when that PR merges so can remove this then if you merge this PR ahead of #1214 but otherwise, if it won't merge in until later, you can certainly remove this.

I've removed it since this PR will be merged in the next couple of days.

tynandebold · 2023-01-16T10:25:31Z

@stichbury you'll have a couple days to test it out. Hopefully we release Viz tomorrow or Wednesday.

Separately, in this PR I think we should remove the references to Kedro version 0.18.3 in this file, since that problem should be solved when this PR is completed. What do you think?

stichbury · 2023-01-16T11:12:30Z

@stichbury you'll have a couple days to test it out. Hopefully we release Viz tomorrow or Wednesday.

Separately, in this PR I think we should remove the references to Kedro version 0.18.3 in this file, since that problem should be solved when this PR is completed. What do you think?

I'll prioritise this on Tuesday 17th.

…cs-release

… docs/improve-viz-docs-for-exp-tracking-metrics-release Signed-off-by: Tynan DeBold <thdebold@gmail.com>

… of https://github.com/quantumblacklabs/kedro into docs/improve-viz-docs-for-exp-tracking-metrics-release Signed-off-by: Tynan DeBold <thdebold@gmail.com>

Signed-off-by: Tynan DeBold <thdebold@gmail.com>

…cs-release

stichbury · 2023-01-17T14:43:38Z

@stichbury you'll have a couple days to test it out. Hopefully we release Viz tomorrow or Wednesday.
Separately, in this PR I think we should remove the references to Kedro version 0.18.3 in this file, since that problem should be solved when this PR is completed. What do you think?

I'll prioritise this on Tuesday 17th.

@tynandebold I've now tested the text and all is good. I made a couple of changes to the experiment tracking page to tweak it (nothing major) and also removed mention of Kedro 0.18.3 in the Viz docs about plotly charts, which I've sneaked into this PR. Hope that's OK!

tynandebold · 2023-01-17T15:35:09Z

Fantastic, @stichbury! Thank you 💥

…cs-release

Update experiment_tracking docs file and necessary media; delete unne…

87d50b8

…eded secondary exp tracking file

tynandebold requested review from stichbury and merelcht January 11, 2023 14:39

tynandebold requested a review from yetudada as a code owner January 11, 2023 14:39

Merge branch 'main' into docs/improve-viz-docs-for-exp-tracking-metri…

118284d

…cs-release

tynandebold commented Jan 11, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Show resolved Hide resolved

tynandebold commented Jan 11, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Show resolved Hide resolved

tynandebold mentioned this pull request Jan 11, 2023

Improve Kedro-Viz documentation for Experiment Tracking kedro-org/kedro-viz#1241

Closed

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

tynandebold and others added 5 commits January 12, 2023 13:15

Update docs/source/visualisation/experiment_tracking.md

b28dc72

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

Update docs/source/visualisation/experiment_tracking.md

e6bbf1f

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

Update docs/source/visualisation/experiment_tracking.md

967e101

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

Update docs/source/visualisation/experiment_tracking.md

5eb26a5

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

Ensure Viz version is correct after install

8ef4c3e

Merge branch 'main' into docs/improve-viz-docs-for-exp-tracking-metri…

d49fed0

…cs-release

tynandebold requested a review from stichbury January 12, 2023 13:50

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Show resolved Hide resolved

Update docs/source/visualisation/experiment_tracking.md

ce45380

stichbury approved these changes Jan 12, 2023

View reviewed changes

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

Update docs/source/visualisation/experiment_tracking.md

dc012c1

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

Update docs/source/visualisation/experiment_tracking.md

b445ff5

stichbury reviewed Jan 12, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

Update docs/source/visualisation/experiment_tracking.md

ef78ff3

merelcht approved these changes Jan 12, 2023

View reviewed changes

yetudada requested changes Jan 13, 2023

View reviewed changes

Update docs/source/visualisation/experiment_tracking.md

e9cf853

Co-authored-by: Yetunde Dada <43755008+yetudada@users.noreply.github.com>

merelcht reviewed Jan 13, 2023

View reviewed changes

yetudada approved these changes Jan 13, 2023

View reviewed changes

tynandebold and others added 3 commits January 13, 2023 13:57

Update docs/source/visualisation/experiment_tracking.md

470bcdd

Co-authored-by: Yetunde Dada <43755008+yetudada@users.noreply.github.com>

Update docs/source/visualisation/experiment_tracking.md

1ad697d

Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>

Update docs/source/visualisation/experiment_tracking.md

8cc1381

Co-authored-by: Yetunde Dada <43755008+yetudada@users.noreply.github.com>

stichbury reviewed Jan 13, 2023

View reviewed changes

docs/source/visualisation/experiment_tracking.md Outdated Show resolved Hide resolved

tynandebold self-assigned this Jan 16, 2023

Update docs/source/visualisation/experiment_tracking.md

c1596e0

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>

tynandebold commented Jan 16, 2023

View reviewed changes

tynandebold and others added 7 commits January 16, 2023 13:05

Merge branch 'main' into docs/improve-viz-docs-for-exp-tracking-metri…

abcb928

…cs-release

Merge branch 'main' of https://github.com/quantumblacklabs/kedro into…

394df58

… docs/improve-viz-docs-for-exp-tracking-metrics-release Signed-off-by: Tynan DeBold <thdebold@gmail.com>

Merge branch 'docs/improve-viz-docs-for-exp-tracking-metrics-release'…

a410e87

… of https://github.com/quantumblacklabs/kedro into docs/improve-viz-docs-for-exp-tracking-metrics-release Signed-off-by: Tynan DeBold <thdebold@gmail.com>

Remove specfifc Kedro version for install

63a63b7

Signed-off-by: Tynan DeBold <thdebold@gmail.com>

Minor tweaks

e097cc0

Fix some build issues

f997126

Merge branch 'main' into docs/improve-viz-docs-for-exp-tracking-metri…

ccd2081

…cs-release

Merge branch 'main' into docs/improve-viz-docs-for-exp-tracking-metri…

708c789

…cs-release

tynandebold merged commit 68bf9c9 into main Jan 18, 2023

tynandebold deleted the docs/improve-viz-docs-for-exp-tracking-metrics-release branch January 18, 2023 15:45

stichbury mentioned this pull request Jan 30, 2023

Update docs about experiment tracking to adjust version number and add more plotly guidance kedro-org/kedro-viz#1242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Kedro-Viz docs for Experiment Tracking #2193

Improve Kedro-Viz docs for Experiment Tracking #2193

tynandebold commented Jan 11, 2023 •

edited

stichbury commented Jan 12, 2023 •

edited

stichbury commented Jan 12, 2023

tynandebold commented Jan 12, 2023

stichbury left a comment

merelcht left a comment

yetudada left a comment

yetudada Jan 13, 2023

merelcht Jan 13, 2023

yetudada Jan 13, 2023

merelcht Jan 13, 2023

yetudada left a comment

yetudada Jan 13, 2023

stichbury commented Jan 13, 2023

tynandebold Jan 16, 2023

stichbury Jan 16, 2023

tynandebold Jan 16, 2023

tynandebold commented Jan 16, 2023

stichbury commented Jan 16, 2023

stichbury commented Jan 17, 2023

tynandebold commented Jan 17, 2023

Improve Kedro-Viz docs for Experiment Tracking #2193

Improve Kedro-Viz docs for Experiment Tracking #2193

Conversation

tynandebold commented Jan 11, 2023 • edited

Description

Development notes

Checklist

stichbury commented Jan 12, 2023 • edited

stichbury commented Jan 12, 2023

tynandebold commented Jan 12, 2023

stichbury left a comment

Choose a reason for hiding this comment

merelcht left a comment

Choose a reason for hiding this comment

yetudada left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yetudada left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stichbury commented Jan 13, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tynandebold commented Jan 16, 2023

stichbury commented Jan 16, 2023

stichbury commented Jan 17, 2023

tynandebold commented Jan 17, 2023

tynandebold commented Jan 11, 2023 •

edited

stichbury commented Jan 12, 2023 •

edited