Simplify examples #456

RasmusOrsoe · 2023-03-19T15:38:27Z

This PR attempts to introduce a simplified training script (01_train_model_simple.py) where we showcase training syntax with minimal lines of code. What I did to simplify was:

Remove weights and biases from this example (it's been a showstopper for some people).
Remove CLI for simplicity
Hard-coded configs for clearity
Provided a default value for callbacks (automatically includes early stopping if validation loader is provided)
Provided default values for target_labels and output_labels for all tasks, and making these available under model.output_labels and model.target_labels (closes Make prediction column names self-contained #175 )

These changes has brought down the training example from 183 to 68 lines.

In addition, I've made the default values for target_labels and output_labels more visual in Task's, and written them as @propery such that we have to supply these default labels for tasks in the future. I think this makes it a bit easier to grasp what a given Task requires and returns.

I have left the other training scripts untouched but renamed them to make the distinction easier.

README has been updated.

asogaard · 2023-03-21T09:32:49Z

Hi @RasmusOrsoe,

Thanks for this! Some initial thoughts before I go into review-mode:

As I see it, this PR does two things,

Adds a simplified training script (items 1-3 in your list above)
Addresses Make prediction column names self-contained #175 by adding default output names to each Task (items 4 and 5 in your list above)

The second point I totally get — it fixes a known and well-defined issue that we have discussed and agree is worth solving. 🚀

The first point I am not sure I understand the need for. If the point is to disable W&B, I figure we could just add a --wandb argument to the CLI, and have it be off by default. As for removing the CLI altogether and hard-coding the config file (as opposed to providing a default one in the CLI), I am not sure I see how that could improve the example. But I am open to convincing. :)

RasmusOrsoe · 2023-03-21T10:10:53Z

Hey @asogaard

I definitely think there is a need for some resource that provides a simple point of entry into the code base in as few lines as possible.

I'd like the example script to default to no wand (not because of personal feelings!) because a number of people have had trouble running the example scripts because of wand. It was actually caught on a live stream here.

I hard-coded the configs because the example scripts (our only point of entry currently) hides these away in default arguments and doesn't really provide a transparent view of what it really is.

Perhaps the conclusion is that an actual written tutorial is a better format that an example script. So we could proceed as:

Remove changes to example scripts
Add a written tutorial instead that explains each element in my suggested training script, such that new users are better equipped.

What do you think @asogaard?

asogaard · 2023-03-21T12:56:30Z

Hey @asogaard

I definitely think there is a need for some resource that provides a simple point of entry into the code base in as few lines as possible.

I'd like the example script to default to no wand (not because of personal feelings!) because a number of people have had trouble running the example scripts because of wand. It was actually caught on a live stream here.

I hard-coded the configs because the example scripts (our only point of entry currently) hides these away in default arguments and doesn't really provide a transparent view of what it really is.

Perhaps the conclusion is that an actual written tutorial is a better format that an example script. So we could proceed as:

Remove changes to example scripts

Add a written tutorial instead that explains each element in my suggested training script, such that new users are better equipped.

What do you think @asogaard?

I completely agree that our main training example should be as simple as possible, but no simpler! 😊 So any ideas for making them a more friendly point of entry are more than welcome!

I agree with your proposed solution, but please do feel free to make wandb opt-in (or should I make a separate PR for that?). Regarding the config files: all default values are described when you run

$ python examples/04_training/01_train_model.py --help
usage: 01_train_model.py [-h] [--gpus GPUS [GPUS ...]] [--max-epochs MAX_EPOCHS] [--early-stopping-patience EARLY_STOPPING_PATIENCE]
                         [--batch-size BATCH_SIZE] [--num-workers NUM_WORKERS] [--dataset-config DATASET_CONFIG] [--model-config MODEL_CONFIG]
                         [--prediction-names PREDICTION_NAMES [PREDICTION_NAMES ...]] [--suffix SUFFIX]
(...)
  --dataset-config DATASET_CONFIG
                        Path to dataset config file (default: {CONFIG_DIR}/datasets/training_example_data_sqlite.yml)
  --model-config MODEL_CONFIG
                        Path to model config file (default: {CONFIG_DIR}/models/example_energy_reconstruction_model.yml)

and similar for all example scripts. This is mentioned here, but if people are not familiar the standard python CLI, I think this would be good to point out in a tutorial or similar!

RasmusOrsoe · 2023-03-26T08:57:47Z

@asogaard OK - I think that's it.

asogaard

Some suggestions. :)

examples/04_training/01_train_model.py

examples/04_training/02_train_model_without_configs.py

src/graphnet/models/task/task.py

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

RasmusOrsoe · 2023-03-28T11:06:53Z

OK. I think this is it @asogaard

asogaard

I have added a few comments that I think might have been missed in the last batch, @RasmusOrsoe. :)

src/graphnet/models/model.py

src/graphnet/models/standard_model.py

src/graphnet/models/task/task.py

RasmusOrsoe · 2023-04-03T10:08:02Z

@asogaard Could we maybe go through these comments on zoom sometime?

asogaard · 2023-04-03T15:27:39Z

@RasmusOrsoe Absolutely!

RasmusOrsoe · 2023-04-13T14:38:46Z

@asogaard I think this it it. However, now I see that the data conversion unit tests fail even though no changes have been made to that part of the repo. I've scrolled through the error messages and I'm a bit puzzled. Do you have any ideas what could be causing this?

asogaard · 2023-04-13T14:40:40Z

It's probably due to the pandas version problem I mention in the other PR. We'll rerun the tests once that is merged.

asogaard · 2023-04-14T05:50:10Z

@asogaard I think this it it. However, now I see that the data conversion unit tests fail even though no changes have been made to that part of the repo. I've scrolled through the error messages and I'm a bit puzzled. Do you have any ideas what could be causing this?

Try merging main into this feature branch, following #476. 🤞

asogaard

Thanks so much for your hard work here, @RasmusOrsoe! 🙏 I think this looks great, and is ready merge. I have added a few minor suggestions (one, regarding warn_once, is actually a typo, probably mine, so at least that one should probably be added) that you can consider before hitting the big, green button.

src/graphnet/models/task/task.py

src/graphnet/models/model.py

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Simplify examples

RasmusOrsoe added 6 commits March 18, 2023 16:42

self contained output and target names

bb78249

polish

35b7b08

added explicit, default target and output labels

045ca1a

polish

4e2aca4

shell script rename

682056d

renamed files and updated readme

561ddb0

RasmusOrsoe requested a review from asogaard March 19, 2023 15:38

RasmusOrsoe added 2 commits March 19, 2023 16:39

further simplification

4a089e4

removed print message

8cc2426

RasmusOrsoe marked this pull request as draft March 20, 2023 07:59

asogaard assigned RasmusOrsoe Mar 21, 2023

RasmusOrsoe added 3 commits March 26, 2023 10:27

removed changes to example scripts

f6652b1

made wandb optional

332c586

made wandb optional

ffe7fb5

asogaard reviewed Mar 28, 2023

View reviewed changes

RasmusOrsoe and others added 11 commits March 28, 2023 11:20

Update examples/04_training/01_train_model.py

d9b25d7

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update examples/04_training/01_train_model.py

23c162c

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update examples/04_training/01_train_model.py

21618db

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update examples/04_training/02_train_model_without_configs.py

e902422

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update examples/04_training/02_train_model_without_configs.py

711612c

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update examples/04_training/01_train_model.py

2daec4d

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update examples/04_training/02_train_model_without_configs.py

b18b012

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/task/task.py

7c9011d

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/task/task.py

53b0abd

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/task/task.py

bd3e41f

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/task/task.py

a944453

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

RasmusOrsoe and others added 8 commits March 28, 2023 11:31

Update src/graphnet/models/task/reconstruction.py

3f86536

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/task/reconstruction.py

b88fd78

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/task/reconstruction.py

43e8bbe

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

refractor default callbacks

e0e7a52

prediction columns

8285c77

le docce strings

dd5b195

mypy....

eb31074

mypy..

2cb7dc7

asogaard reviewed Mar 31, 2023

View reviewed changes

src/graphnet/models/model.py Outdated Show resolved Hide resolved

src/graphnet/models/model.py Outdated Show resolved Hide resolved

src/graphnet/models/standard_model.py Outdated Show resolved Hide resolved

src/graphnet/models/task/task.py Outdated Show resolved Hide resolved

RasmusOrsoe added 6 commits April 13, 2023 14:42

added warning for patience & early stopping

4d446b7

typo fix

06e76bf

removed try except in predict_as_dataframe

fe8b547

added predict_as_dataframe to standardmodel

b952791

added predict_as_dataframe to standardmodel

0b99a38

prediction labels in task

38d0e96

asogaard approved these changes Apr 14, 2023

View reviewed changes

src/graphnet/models/task/task.py Outdated Show resolved Hide resolved

src/graphnet/models/model.py Outdated Show resolved Hide resolved

RasmusOrsoe and others added 5 commits April 18, 2023 13:49

Update src/graphnet/models/task/task.py

6e3f0c4

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

Update src/graphnet/models/model.py

9a6d3ea

Co-authored-by: Andreas Søgaard <andreas.sogaard@gmail.com>

black

0ef572f

flake8

f45c527

cmon

70a9e22

RasmusOrsoe marked this pull request as ready for review April 18, 2023 13:56

RasmusOrsoe merged commit 7ac51fe into graphnet-team:main Apr 18, 2023

RasmusOrsoe added a commit to RasmusOrsoe/graphnet that referenced this pull request Oct 25, 2023

Merge pull request graphnet-team#456 from RasmusOrsoe/simplify_examples

068c834

Simplify examples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify examples #456

Simplify examples #456

RasmusOrsoe commented Mar 19, 2023 •

edited

Loading

asogaard commented Mar 21, 2023

RasmusOrsoe commented Mar 21, 2023

asogaard commented Mar 21, 2023

RasmusOrsoe commented Mar 26, 2023

asogaard left a comment

RasmusOrsoe commented Mar 28, 2023

asogaard left a comment

RasmusOrsoe commented Apr 3, 2023

asogaard commented Apr 3, 2023

RasmusOrsoe commented Apr 13, 2023

asogaard commented Apr 13, 2023

asogaard commented Apr 14, 2023

asogaard left a comment •

edited

Loading

Simplify examples #456

Simplify examples #456

Conversation

RasmusOrsoe commented Mar 19, 2023 • edited Loading

asogaard commented Mar 21, 2023

RasmusOrsoe commented Mar 21, 2023

asogaard commented Mar 21, 2023

RasmusOrsoe commented Mar 26, 2023

asogaard left a comment

Choose a reason for hiding this comment

RasmusOrsoe commented Mar 28, 2023

asogaard left a comment

Choose a reason for hiding this comment

RasmusOrsoe commented Apr 3, 2023

asogaard commented Apr 3, 2023

RasmusOrsoe commented Apr 13, 2023

asogaard commented Apr 13, 2023

asogaard commented Apr 14, 2023

asogaard left a comment • edited Loading

Choose a reason for hiding this comment

RasmusOrsoe commented Mar 19, 2023 •

edited

Loading

asogaard left a comment •

edited

Loading