feat: spark sql schema translator by giuseppegrieco · Pull Request #54 · dacolabs/cli

giuseppegrieco · 2026-01-29T12:22:33Z

No description provided.

giuseppegrieco · 2026-01-30T12:18:50Z

@cubic-dev-ai review this

cubic-dev-ai · 2026-01-30T12:18:56Z

@cubic-dev-ai review this

@giuseppegrieco I have started the AI code review. It will take a few minutes to complete.

cubic-dev-ai

1 issue found across 5 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="internal/translate/sparksql/translator.go">

<violation number="1" location="internal/translate/sparksql/translator.go:57">
P2: The `inlineStruct` function can cause infinite recursion if the schema contains circular type references. Consider adding a `visited` set to detect cycles and return an error instead of crashing.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

gummiorri

LGTM

cubic-dev-ai

6 issues found across 41 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="internal/translate/databrickssql/databrickssql.go.tmpl">

<violation number="1">
P2: Databricks SQL identifiers are emitted without quoting/escaping. Schema property names (and port names) can include characters or reserved words that are invalid as bare identifiers, which will make the generated `CREATE TABLE` fail. Consider quoting identifiers with backticks and escaping any backticks in the name.</violation>
</file>

<file name="cmd/daco/internal/app.go">

<violation number="1" location="cmd/daco/internal/app.go:35">
P2: The "spark-scala" registry entry is wired to the PySpark translator, so Spark Scala requests will be translated using PySpark logic. This likely produces incorrect output. Map it to the Scala translator instead.</violation>
</file>

<file name="internal/translate/gotypes/resolver.go">

<violation number="1">
P2: Non-acronym words are not properly lowercased, causing incorrect PascalCase conversion when input contains uppercase characters. For example, `"HTTP_SERVER"` produces `"HTTPSERVER"` instead of `"HTTPServer"`.</violation>
</file>

<file name="internal/translate/databrickssql/translator.go">

<violation number="1">
P2: Circular type references silently produce invalid SQL output. When `visited[name]` is true (indicating a cycle), the code `continue`s without modifying the type or returning an error. This leaves the field's Type as the definition name (e.g., "Person"), which is not a valid Databricks SQL type. Consider returning an error when a circular reference is detected.</violation>
</file>

<file name="internal/translate/prepare.go">

<violation number="1">
P2: ToSnakeCase doesn’t handle camelCase boundaries, so strings like "myFieldName" become "myfieldname" instead of "my_field_name", contradicting the function’s documented snake_case contract.</violation>
</file>

<file name="internal/translate/databrickspyspark/databrickspyspark.go.tmpl">

<violation number="1">
P2: Definition names are emitted as Python identifiers without sanitizing `$defs` names, so schemas with hyphens/spaces/digit-leading names will produce invalid Python code. Consider normalizing `$defs` names (e.g., ToSnakeCase) in the resolver before rendering the template.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

cubic-dev-ai · 2026-01-30T15:50:33Z

@@ -34,6 +35,7 @@ func registerTranslators() translate.Register {
 	translators["spark-scala"] = &pyspark.Translator{}


P2: The "spark-scala" registry entry is wired to the PySpark translator, so Spark Scala requests will be translated using PySpark logic. This likely produces incorrect output. Map it to the Scala translator instead.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At cmd/daco/internal/app.go, line 35: <comment>The "spark-scala" registry entry is wired to the PySpark translator, so Spark Scala requests will be translated using PySpark logic. This likely produces incorrect output. Map it to the Scala translator instead.</comment> <file context> @@ -9,13 +9,32 @@ import ( + translators["python"] = &python.Translator{} + translators["databricks-sql"] = &databrickssql.Translator{} + translators["scala"] = &scala.Translator{} + translators["spark-scala"] = &pyspark.Translator{} + translators["databricks-scala"] = &databricksscala.Translator{} + translators["protobuf"] = &protobuf.Translator{} </file context>

Suggested change

translators["spark-scala"] = &pyspark.Translator{}

translators["spark-scala"] = &scala.Translator{}

giuseppegrieco added 2 commits January 29, 2026 13:22

feat: add spark sql schema translator

aa21281

fixup! feat: add spark sql schema translator

1ed7dc7

giuseppegrieco requested a review from gummiorri as a code owner January 29, 2026 12:22

giuseppegrieco linked an issue Jan 29, 2026 that may be closed by this pull request

Create translator for spark-DDL #37

Closed

giuseppegrieco changed the title ~~feat: add spark sql schema translator~~ feat: spark sql schema translator Jan 29, 2026

cubic-dev-ai Bot reviewed Jan 30, 2026

View reviewed changes

Comment thread internal/translate/sparksql/translator.go Outdated

giuseppegrieco added 2 commits January 30, 2026 16:42

fixup! feat: add spark sql schema translator

0b5f132

chore: merge main

7529ed8

gummiorri approved these changes Jan 30, 2026

View reviewed changes

gummiorri merged commit 8573956 into main Jan 30, 2026
9 checks passed

gummiorri deleted the 37-create-translator-for-spark-ddl branch January 30, 2026 15:45

cubic-dev-ai Bot reviewed Jan 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: spark sql schema translator#54

feat: spark sql schema translator#54
gummiorri merged 4 commits intomainfrom
37-create-translator-for-spark-ddl

giuseppegrieco commented Jan 29, 2026

Uh oh!

giuseppegrieco commented Jan 30, 2026

Uh oh!

cubic-dev-ai Bot commented Jan 30, 2026

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

Uh oh!

gummiorri left a comment

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

cubic-dev-ai Bot Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -34,6 +35,7 @@ func registerTranslators() translate.Register {
		translators["spark-scala"] = &pyspark.Translator{}

	translators["spark-scala"] = &pyspark.Translator{}
	translators["spark-scala"] = &scala.Translator{}

Conversation

giuseppegrieco commented Jan 29, 2026

Uh oh!

giuseppegrieco commented Jan 30, 2026

Uh oh!

cubic-dev-ai Bot commented Jan 30, 2026

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gummiorri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai Bot Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants