Exported Parquet Files are incompatible with Hive due to capital letters in column names #37
Labels
feature
Product feature
timeline:long-term
Marker for tickets that are unlikely to be implemented in the near future
When using the script
EXPORT_PATH
to export an eXasol Table, the generated parquet files have a schema with columns names in capital letters. The reason is probably that EXASOL uses upper case metadata.This is not a problem by itself, but when it comes to store these files as a hive table, where Hive and Spark share the common meta-store, new issues appear.
As explained here, Hive is case insensitive, while Parquet is not.
Therefore, as a user of cloud-storage-etl-udfs,
I want to be able to export parquet files with column names in lower case to maximize compatibility with Hive and Spark.
The text was updated successfully, but these errors were encountered: