Make Iceberg support case insensitivity #83

xabriel · 2019-01-17T23:35:56Z

Iceberg's current implementation has column case sensitivity, which hinders usability, as most sql users expect case insensitivity by default. While a query like the following will succeed in other Spark Readers, it will fail on Iceberg:

SELECT COUNT(*)
FROM iceTable
WHERE year = 2017
  AND MONTH = 11 -- Notice how MONTH has different casing than other predicates
  AND day = 01

This will fail with a stack trace similar to:

com.google.common.util.concurrent.UncheckedExecutionException: com.netflix.iceberg.exceptions.ValidationException: Cannot find field 'MONTH' in struct: struct<...>
...

PR to solve this issue at iceberg-api level: #82

More PRs to use this new flag to follow.

The text was updated successfully, but these errors were encountered:

xabriel · 2019-01-23T18:30:41Z

PR #82 solves this issue at iceberg-api level.

We still need follow up PRs to:

Expose this new caseSensitive flag as a configuration, perhaps by introducing the use of org.apache.hadoop.conf.Configuration.
Also need to address a comment from @rdblue on Make expression binding support a case sensitivity flag #82:

I think some of these Evaluators will also need case sensitivity options. It doesn't do much good to support it in expression binding if it isn't also exposed when working with expressions in other ways. Can you also open a follow-up issue?

xabriel · 2019-03-19T01:06:18Z

PR #89, just merged, solves this problem all the way to the Spark Reader.

Jotting here some minor follow up items so that we don't forget:

Need to address comments:
#89 (comment)
and
#89 (comment)

xabriel · 2019-03-25T20:28:32Z

There's another issue with Filterables described in #145 .

rdblue · 2019-07-06T19:13:25Z

I'm going to close this because #89 and #141 fixed the original problem. #145 would be nice to fix, but the code works as it is right now.

xabriel added a commit to xabriel/incubator-iceberg that referenced this issue Jan 19, 2019

Make expression binding support a case sensitivity flag. (apache#83)

606ab12

xabriel changed the title ~~Make expression binding case insensitive~~ Make expression binding support a case sensitivity flag Jan 19, 2019

xabriel mentioned this issue Jan 23, 2019

Make expression binding support a case sensitivity flag #82

Merged

xabriel added a commit to xabriel/incubator-iceberg that referenced this issue Jan 23, 2019

Allow existing tests to use a package-private Binder#bind. (apache#83)

1d18549

xabriel mentioned this issue Jan 30, 2019

Make read-path Evaluators honor case sensitivity flag. Expose flag in Spark Reader. #89

Merged

xabriel changed the title ~~Make expression binding support a case sensitivity flag~~ Make Iceberg support case insensitivity Mar 19, 2019

rdblue mentioned this issue Mar 20, 2019

Add case insensitive support to Parquet. #141

Merged

rdblue closed this as completed Jul 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Iceberg support case insensitivity #83

Make Iceberg support case insensitivity #83

xabriel commented Jan 17, 2019 •

edited

xabriel commented Jan 23, 2019 •

edited

xabriel commented Mar 19, 2019

xabriel commented Mar 25, 2019

rdblue commented Jul 6, 2019

Make Iceberg support case insensitivity #83

Make Iceberg support case insensitivity #83

Comments

xabriel commented Jan 17, 2019 • edited

xabriel commented Jan 23, 2019 • edited

xabriel commented Mar 19, 2019

xabriel commented Mar 25, 2019

rdblue commented Jul 6, 2019

xabriel commented Jan 17, 2019 •

edited

xabriel commented Jan 23, 2019 •

edited