Skip to content

Conversation

@andrew-coleman
Copy link
Member

Adds the formal dialect definition for the Spark converter. The file is generated from code which extracts the relevant information from the exisitng FunctionMapper definitions. This ensures that the dialect YAML remains in-sync with the code.
Tests have been introduced to ensure that the dialect conforms to the schema (defined in teh base substrait library), and that the published dialect YAML is the equivalent to one that would be generated from the current code.

@andrew-coleman andrew-coleman marked this pull request as draft October 14, 2025 13:30
@andrew-coleman andrew-coleman force-pushed the dialect branch 3 times, most recently from d7727de to 1f8756f Compare October 17, 2025 15:15
@andrew-coleman andrew-coleman marked this pull request as ready for review October 17, 2025 15:30
@andrew-coleman andrew-coleman force-pushed the dialect branch 3 times, most recently from 4dc3359 to 03da734 Compare October 20, 2025 07:59
Adds the formal dialect definition for the Spark converter.
The file is generated from code which extracts the relevant
information from the exisitng FunctionMapper definitions.
This ensures that the dialect YAML remains in-sync with the
code.
Tests have been introduced to ensure that the dialect conforms
to the schema (defined in teh base substrait library), and that
the published dialect YAML is the equivalent to one that would
be generated from the current code.

Signed-off-by: Andrew Coleman <andrew_coleman@uk.ibm.com>
Comment on lines +54 to +57
// The following will fail if the generated dialect differs from the published one.
// If this is caused by an intentional change, the published dialect should be regenerated using:
// `./gradlew dialect`
assertResult(publishedDialect)(genDialect)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

great that you thought of that

Copy link
Member

@nielspardon nielspardon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@andrew-coleman andrew-coleman merged commit c4c6153 into substrait-io:main Oct 21, 2025
12 checks passed
@andrew-coleman andrew-coleman deleted the dialect branch October 21, 2025 07:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants