Add Pydantic V1 IO models for use with Hera Runner #920

elliotgunton · 2024-01-10T17:15:44Z

Pull Request Checklist

Part of Improve Hera I/O models #858
Tests added
~~Documentation/examples added~~ See Add pydantic IO docs #939
Good commit messages and/or PR title

Description of PR
Currently hera i/o with annotated params can become extremely verbose. The output syntax is especially error-prone.

This PR introduces custom Input/Output BaseModels for users to subclass, which allow a cleaner arrangement of inputs and outputs for functions. These are available under the script_pydantic_io experimental feature flag.

With these Pydantic input/output models, the following should be noted:

duplicated param names (for normal Parameters as well as the new models) are now detected in Hera rather than when linted by Argo (as well as duplicated artifact names). Parameters and Artifacts having the same name is legal in the Argo spec as they exist in different scopes e.g.

...
      inputs:
        parameters:
          - name: my-name
            default: test
        artifacts:
          - name: my-name
            path: /tmp
            optional: true
...

exit_code and result are reserved attributes for the RunnerOutput. A user trying to use their own parameters with these names would have to be specified with an annotated parameter e.g. my_exit_code: Annotated[int, Parameter(name="exit_code")] (TBC with a test)
Scripts cannot have a return tuple containing any RunnerOutput to avoid multiple exit_codes being specified. @samj1912 / @flaviuvadan this is up for debate but I think would encourage better practices to discourage tuples and have a single script template outputting a single RunnerOutput subclass, and it keeps the logic clearer from the Hera side. Users can still use inline output parameters alongside the RunnerOutput return annotation
Multiple input parameters when using a RunnerInput in the function params is not legal
A RunnerInput's __fields__ as defined by pydantic are used to "explode" the input class into constituent parameters for the Argo spec. i.e. using the following class as an input param to a script function:

class MyInput(RunnerInput):
     my_input_str: str
     my_input_int: int

@script(constructor="runner")
def my_func(my_input: MyInput):
    ...

will create the script template my_func in yaml with Parameters my_input_str and my_input_int, NOT my_input, see the example

codecov · 2024-01-12T14:17:42Z

Codecov Report

Attention: 9 lines in your changes are missing coverage. Please review.

Comparison is base (b3fc378) 80.7% compared to head (a0dd261) 81.2%.
Report is 1 commits behind head on main.

❗ Current head a0dd261 differs from pull request most recent head 221ecae. Consider uploading reports for the commit 221ecae to get more accurate results

Files	Patch %	Lines
src/hera/workflows/runner.py	93.8%	1 Missing and 3 partials ⚠️
src/hera/workflows/io.py	95.2%	0 Missing and 3 partials ⚠️
src/hera/workflows/script.py	94.1%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##            main    #920     +/-   ##
=======================================
+ Coverage   80.7%   81.2%   +0.4%     
=======================================
  Files         50      51      +1     
  Lines       3908    4045    +137     
  Branches     793     843     +50     
=======================================
+ Hits        3157    3287    +130     
- Misses       564     565      +1     
- Partials     187     193      +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

samj1912 · 2024-01-18T13:02:40Z

@elliotgunton can you provide a brief examples/details in the PR description of the functionality that is currently enabled?

Specifically I am curious how we handle things when a user -

has duplicate param names between a non annotated param/annotated param/normal param
has duplicate names between params/artifacts (this should be valid?)
has a parameter/artifact called output/exit code
has multiple output return types
has multiple input types
how is the input object constructed from argo params
I see we added support for defaults as well (is it also added back to the normal params apart from hera io models?)

Having that in the PR description would help future readers understand what is happening.

src/hera/shared/serialization.py

src/hera/workflows/io.py

elliotgunton · 2024-01-24T14:59:18Z

src/hera/workflows/script.py

+            if len(inspect.signature(source).parameters) != 1:
+                raise SyntaxError("Only one function parameter can be specified when using a RunnerInput.")


Only one function param that is a RunnerInput subclass is allowed

elliotgunton · 2024-01-24T15:01:28Z

src/hera/workflows/script.py

+            if isinstance(annotation, type) and issubclass(annotation, RunnerOutput):
+                raise ValueError("RunnerOutput cannot be part of a tuple output")


Only one RunnerOutput is allowed as the function return type. Note that we also haven't dealt with nested Annotated types above.

elliotgunton · 2024-01-24T15:17:40Z

@samj1912 ready for re-review! Self-reviewed to double check this PR stands on its own to implement for Pydantic V1 and signposted the single input/output we discussed 🚀

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

* Add annotations test only in this commit Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

* Tidy up kwarg mapping code - may need another refactor * TODO artifact outputs Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

* Add environment variable to script templates using the feature * Use actual error message for ValueError when flag not enabled Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

* Also allow setting defaults of a RunnerInput param via its func param default * Remove runner tests that use multiple inputs * Add script annotations test for using a RunnerInput in a List Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Part of #858, follow up to #920 * Enables users to use Pydantic V2 objects in their scripts while maintaining V1 usage internally for Hera * RunnerInput/Output classes are created in hera.workflows.io depending on the value of _PYDANTIC_VERSION - users will automatically get classes using their (possibly pinned) version of Pydantic * I have not yet managed to get an explicit test that uses the automatic import of V1 classes - we may just have to rely on the Pydantic V1 CI check. --------- Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

elliotgunton force-pushed the pydantic-io branch 3 times, most recently from b6af175 to 43a25f8 Compare January 12, 2024 14:14

elliotgunton added type:enhancement A general enhancement semver:minor A change requiring a minor version bump labels Jan 12, 2024

elliotgunton force-pushed the pydantic-io branch from 43a25f8 to 6c5b4f1 Compare January 12, 2024 14:15

elliotgunton force-pushed the pydantic-io branch 16 times, most recently from fe9ff79 to 3a0a39f Compare January 17, 2024 17:13

samj1912 reviewed Jan 18, 2024

View reviewed changes

src/hera/shared/serialization.py Outdated Show resolved Hide resolved

src/hera/workflows/io.py Outdated Show resolved Hide resolved

elliotgunton force-pushed the pydantic-io branch 3 times, most recently from 5bc252c to f37fbea Compare January 18, 2024 16:00

elliotgunton commented Jan 18, 2024

View reviewed changes

src/hera/workflows/io.py Outdated Show resolved Hide resolved

elliotgunton force-pushed the pydantic-io branch from 22fb851 to d13cb41 Compare January 18, 2024 17:02

elliotgunton requested a review from flaviuvadan as a code owner January 22, 2024 10:01

elliotgunton changed the title ~~[WIP] Add IO models for use with Hera Runner~~ Add IO models for use with Hera Runner Jan 22, 2024

elliotgunton force-pushed the pydantic-io branch 2 times, most recently from b553cbb to 66a4443 Compare January 24, 2024 14:53

elliotgunton changed the title ~~Add IO models for use with Hera Runner~~ Add Pydantic V1 IO models for use with Hera Runner Jan 24, 2024

elliotgunton force-pushed the pydantic-io branch from 66a4443 to a0dd261 Compare January 24, 2024 15:10

elliotgunton commented Jan 24, 2024

View reviewed changes

elliotgunton requested a review from samj1912 January 24, 2024 15:17

elliotgunton mentioned this pull request Jan 24, 2024

Add RunnerInput/Output Pydantic V2 classes #938

Merged

elliotgunton added 15 commits January 29, 2024 10:57

Add IO models for use with Hera Runner

5b6d06b

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Process RunnerInput/Outputs when running as script runner

372a38f

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add Artifacts to pydantic io

b733aaa

* Add annotations test only in this commit Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add runner test with pydantic io, fix serialization

7a38a40

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add test for artifact inputs

3f2ab00

* Tidy up kwarg mapping code - may need another refactor * TODO artifact outputs Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add parameter outputs tests

29c2154

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add artifact input/output tests

d1b2e75

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Process exit_code and result

2d56e9a

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Remove tuple usage of RunnerOutput

13a984b

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add duplicate named parameter/artifact detection

3f33f29

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Remove unused replace_keys function

3a3aa2e

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add pydantic io example

a7e7744

* Add environment variable to script templates using the feature * Use actual error message for ValueError when flag not enabled Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Add call to serialize, add object to serialize in io example

f21c9ac

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

Update doc strings

9f93a2f

Signed-off-by: Elliot Gunton <egunton@bloomberg.net>

elliotgunton force-pushed the pydantic-io branch from a0dd261 to 221ecae Compare January 29, 2024 10:57

samj1912 approved these changes Jan 30, 2024

View reviewed changes

samj1912 merged commit 7998f2e into main Jan 30, 2024
20 checks passed

samj1912 deleted the pydantic-io branch January 30, 2024 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Pydantic V1 IO models for use with Hera Runner #920

Add Pydantic V1 IO models for use with Hera Runner #920

elliotgunton commented Jan 10, 2024 •

edited

Loading

codecov bot commented Jan 12, 2024 •

edited

Loading

samj1912 commented Jan 18, 2024 •

edited

Loading

elliotgunton Jan 24, 2024

elliotgunton Jan 24, 2024

elliotgunton commented Jan 24, 2024

		if len(inspect.signature(source).parameters) != 1:
		raise SyntaxError("Only one function parameter can be specified when using a RunnerInput.")

		if isinstance(annotation, type) and issubclass(annotation, RunnerOutput):
		raise ValueError("RunnerOutput cannot be part of a tuple output")

Add Pydantic V1 IO models for use with Hera Runner #920

Add Pydantic V1 IO models for use with Hera Runner #920

Conversation

elliotgunton commented Jan 10, 2024 • edited Loading

codecov bot commented Jan 12, 2024 • edited Loading

Codecov Report

samj1912 commented Jan 18, 2024 • edited Loading

elliotgunton Jan 24, 2024

Choose a reason for hiding this comment

elliotgunton Jan 24, 2024

Choose a reason for hiding this comment

elliotgunton commented Jan 24, 2024

elliotgunton commented Jan 10, 2024 •

edited

Loading

codecov bot commented Jan 12, 2024 •

edited

Loading

samj1912 commented Jan 18, 2024 •

edited

Loading