Refactor GraphQL backend #964

antonymilne · 2022-07-07T15:24:31Z

Description

Originally I was just doing a bit of tidying to enable #958, but the deeper I got the more I decided that our current experiment tracking backend was not up to scratch. It's still not perfect, but it's a lot better and we now have some clear steps to improve it further.

The only new functionality introduced here is being able to query by group, which isn't yet used on the fronted. But following this PR it should as a result be much easier to extend now, e.g. to add plots.

Renaming and restructuring

api has been broken into two new folders that reflect the GraphQL and REST parts of the app. All GraphQL related code was previously in graphql.py; this has now been broken down into several separate files.

api
├── __init__.py
├── apps.py
├── graphql
│   ├── __init__.py
│   ├── router.py
│   ├── schema.py
│   ├── serializers.py
│   └── types.py
└── rest
    ├── __init__.py
    ├── responses.py
    └── router.py

Some naming has been aligned (e.g. experiments_tracking.py ➡️ experiment_tracking.py, graph.py ➡️ flowchart.py).

Separation of responsibility

To separate out responsibilities better we now have some new components on the backend:

in the data access layer, a new TrackingDatasetRepository
in the domain models, TrackingDatasetModel and TrackingDatasetGroup.

This means that, e.g., data loading is no longer leaking out into the API as it was before.

The way that tracking data is populated and retrieved is worth documenting here:

initially populate_data when the app is first loaded, TrackingDatasetRepository is populated by data_access_manager.add_catalog
this adds tracking datasets for the whole pipeline (i.e. the registered pipelines dropdown menu and --pipeline argument affect only the flowchart view)
If you add a new tracked dataset and do a kedro run then the new dataset won't show up in experiment tracking unless --autoreload is set, because populate_data won't be called again
the data for a run_id is only loaded (through dataset._load) when the GraphQL query run_tracking_data is run with that run_id. Subsequent run_tracking_data queries will not perform the load from disk again but instead read out of memory

Query by group

The runTrackingData query now accepts a group argument with type TrackingDatasetGroup:

enum TrackingDatasetGroup {
  METRIC
  JSON
}

This is not used yet on the frontend but will be very useful to query without needing another layer in the tracking datasets hierarchy. e.g.

query QueryMetricsJSON {
  metric: runTrackingData(group: METRIC, runIds: ...) {
    data
    datasetName
    datasetType
  }
  json: runTrackingData(group: JSON, runIds: ...) {
    data
    datasetName
    datasetType
  }
}

Tidying implementation

Lots of small changes to implementation just to make things tidier, e.g. using json.JSONDataSet._load rather than re-writing it; removing unnecessary custom strawberry type JSONObject; putting strawberry resolvers directly as class methods rather than external functions.

Documentation

The backend (specifically strawberry) defines the GraphQL schema and writes it to schema.graphql and generates a png from it:

A new CI check ensures this is always up to date.

All parts of the schema now have descriptions. Unlike docstrings, these are rendered in GraphiQL:

Next steps

Write lots of tests. Improve the way we do GraphQL tests in general (various ideas here).

For adding plots to experiment tracking:

Add query by group as TrackingDatasetGroup and produce existing behaviour for metrics and JSON.
Try to align TrackingDatasetModel and TrackingDatset. Consider model for run which would simplify (maybe even remove) format_run_tracking_data. How to query by run_id correctly?
Might be worth doing a new model for each TrackingDatasetRun in strawberry but not a dataclass model. Maybe do this as GraphQL interface with different implementations, serializers for plots, etc.

Important other refactoring:

Reuse DataNode and DataNoteMetadata models. There's too much duplication between these are tracking datasets.
Better system for check_db_session, e.g. decorator argument that returns empty iterable (could be done automatically from type hint)? Null class?
Consider whether is_tracking_dataset should use isinstance instead, but be careful with imports
Think about serizalisers. Is format_runs needed? Should formatting go into constructor or class method? Are they needed at all?
Consider structure of GraphQL models and response. e.g. why isn't TrackedDataSets a field in Run?

QA notes

Extended Python GraphQL query tests to be more e2e and accommodate new structure ✅
Manually tested ✅
Tested schema generated and CI check works ✅

Checklist

Read the contributing guidelines
Opened this PR as a 'Draft Pull Request' if it is work-in-progress
Updated the documentation to reflect the code changes
Added new entries to the RELEASE.md file
Added tests to cover my changes

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

…re/refactor-graphql Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

This reverts commit 35a3d6a.

limdauto · 2022-07-07T15:37:04Z

package/kedro_viz/api/graphql.py

@@ -49,7 +54,7 @@ class JSONObject(dict):
        description="Generic scalar type representing a JSON object",
    )

-
+# TODO: where should format functions go?


hey so this is what I usually call serializers in API development. A serializer takes data from one format (domain model) and serializer it into another format (GraphQL type or FastAPI Response Model) so a file called graphql_serializers might be a good idea.

Thanks Lim! This is exactly what I was thinking.

My other idea was that it should live in the Run type itself. It seems that strawberry.scalar has a serialize option but strawberry.type doesn't, so would need to do this as a method:

@strawberry.type class Run: author: Optional[str] bookmark: Optional[bool] ... def format(self): ...

What do you think? Is this a horrible misunderstanding of the purpose of these objects (maybe there's a good reason that strawberry.type doesn't have serialize) or a good way to do it?

Yea, one down side I can think of is unit testing. Ideally you would want to be able to test the serialization logic which is mostly standalone and stateless without having to mock out the whole API

This is what I suspected, thank you for confirming!

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

antonymilne · 2022-07-08T17:00:28Z

Here's how I've reorganised things:

api
├── __init__.py
├── apps.py
├── graphql
│   ├── __init__.py
│   ├── router.py
│   ├── schema.py
│   ├── serializers.py
│   └── types.py
└── rest
    ├── __init__.py
    ├── responses.py
    └── router.py

My proposal for tests would be:

router is very small and doesn't need tests
types doesn't need testing
schema methods (query, mutation, subscription) largely delegate to data_access_manager.get... (already covered by unit tests) and then call format functions on the results if required. Tests are covered by e2e-style query tests that we already have
serializers should be unit tested - currently missing

What do you think? Please tell me if this sounds right to you 🙏 Thank you very much! @limdauto

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

…re/refactor-graphql Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

rashidakanchwala

thanks for this <3 -- amazinggggggg!

tynandebold

Legend! Thank you so much 💥 I didn't see any issues when testing the app. All worked as it should.

Does this work warrant a line in the release.md file?

tynandebold

Helping Rashida with her next-step PR, I'm seeing an issue: the metadata and tracking data aren't showing up in the proper order anymore. It may be difficult to explain here. Let's sync and I'll show you to see if you see it too.

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

tynandebold

Brilliant!

limdauto

Sorry I didn't have a lot of time to look into this in details but the new structure looks great to me. Thank you!

Re testing logic: +1 to your proposal.

limdauto · 2022-07-19T16:05:19Z

Makefile

@@ -37,12 +37,22 @@ lint-check:
 	flake8 --config=package/.flake8 package
 	mypy --config-file=package/mypy.ini package

+schema-fix:


curious: what's the need for this?

This is what automatically generates the graphql.schema file and png from the strawberry schema. It's called schema-fix (bit of a weird name I know) by analogy with make format-fix, which will make the make format-check CI pass. Here schema-fix will make the make schema-check CI that tests if the schema file and diagram is up to date pass.

limdauto · 2022-07-19T16:17:52Z

@AntonyMilneQB if you are fishing for next project, I'd recommend drilling on this one:

If you add a new tracked dataset and do a kedro run then the new dataset won't show up in experiment tracking unless --autoreload is set, because populate_data won't be called again

This is related to this issue that I opened a few weeks ago: #872 -- the ideal flow in my head is:

populate_data will populate a sqlite db through the repositories in the data access layer on first run. Think of the sqlite as a backend for the data access manager. Currently it's an in-memory backend.
Then we listen to changes on the Kedro project and add new entry to this database without having to re-run the whole populate_data like --autoreload is currently doing.
The API layer can keep polling this DB and return new data to the client via subscription.

antonymilne · 2022-07-19T16:45:28Z

@limdauto thanks so much for the comments, much appreciated! Makes sense about populate_data also - let me add that comment to #872 so I don't lose track of it.

This PR was created based on the backend refactor ticket. #964 The backend has now changed and for runTrackingData -- it will now get queries also based on the group (dataset type) the tracking data belongs to i.e. Metrics, JSON Data (and in future Plots) In this ticket, we have adjusted the front-end to get information from the backend for each dataset. If a particular group (Metrics, JSON Dataset) has no datasets, that group will not be shown in the front-end. Otherwise all run tracking data will now have another parent in the heirarchy/accordian i.e. type of dataset (Metrics, JSON Data, Plots)

antonymilne added 7 commits June 23, 2022 13:37

Create jupyter-server-proxy entrypoint

8d3ba79

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Merge remote-tracking branch 'origin/main' into main

da0bdcd

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Move some bits around

62fb03b

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Create jupyter-server-proxy entrypoint

35a3d6a

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Move some bits around

1bc7ac4

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Merge remote-tracking branch 'origin/chore/refactor-graphql' into cho…

b7ffbb9

…re/refactor-graphql Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Revert "Create jupyter-server-proxy entrypoint"

32adef7

This reverts commit 35a3d6a.

limdauto reviewed Jul 7, 2022

View reviewed changes

Break up api into packages

1099f06

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

antonymilne added 9 commits July 8, 2022 18:08

Lint

891f862

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Move schema check to Makefile

299940e

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Fix CI

ede7ef9

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Fix CI

217cbc1

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Fix schema

49aa608

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Fix schema

ebcaf57

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Generate schema.png

28a5a01

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Add schema viz to docs

7dfc79b

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Mass renaming

61d120f

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

tynandebold mentioned this pull request Jul 11, 2022

[KED-3008] Set up linter & linting rules for .graphql files #778

Closed

antonymilne added 4 commits July 11, 2022 20:45

Rename to DataSet

7420fb5

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Start to move get_tracking_datasets to data access layer

ee25e99

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Create new TrackingDatasetsRepository and models

243a613

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Move dataset_group to key rather than model property

f90b346

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

tynandebold added Enhancement Python Pull requests that update Python code labels Jul 15, 2022

tynandebold assigned antonymilne Jul 15, 2022

tynandebold mentioned this pull request Jul 15, 2022

[To be discarded] [KED-953] - Visualise Plots in Experiment Tracking #958

Closed

5 tasks

antonymilne added 2 commits July 15, 2022 17:38

Huge test fix

c043836

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Test coverage

fc7e23f

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

antonymilne requested review from rashidakanchwala, tynandebold and limdauto July 18, 2022 13:49

antonymilne and others added 4 commits July 18, 2022 14:53

Merge branch 'main' into chore/refactor-graphql

ff59015

Bug fix

596086c

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Merge remote-tracking branch 'origin/chore/refactor-graphql' into cho…

f69f2ea

…re/refactor-graphql Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Add TrackingDatasetGroup Enum

3092fa1

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

rashidakanchwala approved these changes Jul 19, 2022

View reviewed changes

tynandebold approved these changes Jul 19, 2022

View reviewed changes

tynandebold self-requested a review July 19, 2022 12:26

tynandebold requested changes Jul 19, 2022

View reviewed changes

antonymilne added 3 commits July 19, 2022 15:12

Probably fix bug

198a020

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Fix runsMetadata ordering

9d000f1

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

Now it's fixed

ad807ab

Signed-off-by: Antony Milne <antony.milne@quantumblack.com>

tynandebold self-requested a review July 19, 2022 16:06

tynandebold approved these changes Jul 19, 2022

View reviewed changes

limdauto approved these changes Jul 19, 2022

View reviewed changes

antonymilne mentioned this pull request Jul 19, 2022

[Refactor] Changing the repository layer from in-memory to sqlite #872

Closed

Merge branch 'main' into chore/refactor-graphql

91dfb46

antonymilne mentioned this pull request Jul 19, 2022

[Spike] Experiment tracking test refactoring #980

Closed

antonymilne merged commit 45d81ef into main Jul 19, 2022

antonymilne deleted the chore/refactor-graphql branch July 19, 2022 16:57

rashidakanchwala mentioned this pull request Jul 20, 2022

FE changes based on GraphQL Refactor #978

Merged

5 tasks

rashidakanchwala mentioned this pull request Jul 20, 2022

Feature/Visualise Plots in Experiment Tracking #984

Merged

5 tasks

tynandebold mentioned this pull request Aug 19, 2022

Matplotlib and Plotly visualizations aren't displaying in Metadata panel #1022

Closed

1 task

antonymilne mentioned this pull request Oct 12, 2022

Build out a query to return data for exp. tracking metrics plots #1133

Closed

1 task

ravi-kumar-pilla mentioned this pull request Oct 30, 2023

Fix dataset factory patterns in Experiment Tracking #1588

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor GraphQL backend #964

Refactor GraphQL backend #964

antonymilne commented Jul 7, 2022 •

edited

limdauto Jul 7, 2022

antonymilne Jul 8, 2022

limdauto Jul 8, 2022

antonymilne Jul 8, 2022

antonymilne commented Jul 8, 2022 •

edited

rashidakanchwala left a comment

tynandebold left a comment

tynandebold left a comment

tynandebold left a comment

limdauto left a comment

limdauto Jul 19, 2022

antonymilne Jul 19, 2022

limdauto commented Jul 19, 2022

antonymilne commented Jul 19, 2022

Refactor GraphQL backend #964

Refactor GraphQL backend #964

Conversation

antonymilne commented Jul 7, 2022 • edited

Description

Renaming and restructuring

Separation of responsibility

Query by group

Tidying implementation

Documentation

Next steps

QA notes

Checklist

limdauto Jul 7, 2022

Choose a reason for hiding this comment

antonymilne Jul 8, 2022

Choose a reason for hiding this comment

limdauto Jul 8, 2022

Choose a reason for hiding this comment

antonymilne Jul 8, 2022

Choose a reason for hiding this comment

antonymilne commented Jul 8, 2022 • edited

rashidakanchwala left a comment

Choose a reason for hiding this comment

tynandebold left a comment

Choose a reason for hiding this comment

tynandebold left a comment

Choose a reason for hiding this comment

tynandebold left a comment

Choose a reason for hiding this comment

limdauto left a comment

Choose a reason for hiding this comment

limdauto Jul 19, 2022

Choose a reason for hiding this comment

antonymilne Jul 19, 2022

Choose a reason for hiding this comment

limdauto commented Jul 19, 2022

antonymilne commented Jul 19, 2022

antonymilne commented Jul 7, 2022 •

edited

antonymilne commented Jul 8, 2022 •

edited