Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SEDONA-511] Fix reading/writing geoparquet metadata for snake_case or camelCase column names #1270

Merged

Conversation

Kontinuation
Copy link
Member

Did you read the Contributor Guide?

Is this PR related to a JIRA ticket?

What changes were proposed in this PR?

The current GeoParquet implementation converts the geo metadata to camel or underscore style during parsing and serialization, and it introduces consistency issues with the schema of the parquet files. This patch resolves this issue by skipping the style conversion for column names. Now it should work correctly with geometry column names such as geom_column or geomColumn.

How was this patch tested?

Added tests for geoparquet and geoparquet.metadata data source.

Did this PR include necessary documentation updates?

  • No, this PR does not affect any public API so no need to change the docs.

@Kontinuation Kontinuation marked this pull request as ready for review March 6, 2024 08:13
@jiayuasu jiayuasu added this to the sedona-1.6.0 milestone Mar 6, 2024
@jiayuasu jiayuasu merged commit fea229f into apache:master Mar 6, 2024
48 checks passed
jiayuasu pushed a commit that referenced this pull request Apr 28, 2024
…r camelCase column names (#1270)

* Fix geoparquet metadata for snake_case and camelCase geometry column names

* Apply the change to Spark 3.4 and 3.5

* Fix binary compatibility issue for Spark 3.0.x
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants