[SPARK-2190][SQL] Specialized ColumnType for Timestamp#1440
[SPARK-2190][SQL] Specialized ColumnType for Timestamp#1440liancheng wants to merge 5 commits intoapache:masterfrom
Conversation
|
QA tests have started for PR 1440. This patch merges cleanly. |
|
QA results for PR 1440: |
|
Hmm, just realized |
There was a problem hiding this comment.
Hi, SimpleDateFormat is not thread-safe, so def should be used instead of val.
There was a problem hiding this comment.
Just checked TimestampWritable.java, it's indeed handled with a thread local variable. Thanks for pointing this out!
|
Confirmed that the following test cases are timezone sensitive and blacklisted them (by first remove all timestamp related golden answers, run them in my local timezone to generate new golden answers, then manually change my timezone settings and rerun these tests):
[*] Reverted from #1396. |
|
QA tests have started for PR 1440. This patch merges cleanly. |
|
QA tests have started for PR 1440. This patch merges cleanly. |
|
QA results for PR 1440: |
|
QA results for PR 1440: |
There was a problem hiding this comment.
Nit: Spark style would probably prefer != here.
There was a problem hiding this comment.
I think I read somewhere ne has better performance in certain cases...
There was a problem hiding this comment.
Oh, I see... I forgot that ne/eq do reference equality... in the case of null I would imagine there is no difference, but this is probably fine then.
|
QA tests have started for PR 1440. This patch merges cleanly. |
|
QA results for PR 1440: |
|
QA tests have started for PR 1440. This patch DID NOT merge cleanly! |
|
Rebased to the most recent master, generated all golden answer files after setting timezone of my local machine to |
|
QA tests have started for PR 1440. This patch merges cleanly. |
|
QA results for PR 1440: |
|
QA results for PR 1440: |
|
Thanks! I've merged this into master. |
JIRA issue: [SPARK-2190](https://issues.apache.org/jira/browse/SPARK-2190) Added specialized in-memory column type for `Timestamp`. Whitelisted all timestamp related Hive tests except `timestamp_udf`, which is timezone sensitive. Author: Cheng Lian <lian.cs.zju@gmail.com> Closes apache#1440 from liancheng/timestamp-column-type and squashes the following commits: e682175 [Cheng Lian] Enabled more timezone sensitive Hive tests. 53a358f [Cheng Lian] Fixed failed test suites 01b592d [Cheng Lian] Fixed SimpleDateFormat thread safety issue 2a59343 [Cheng Lian] Removed timezone sensitive Hive timestamp tests 45dd05d [Cheng Lian] Added Timestamp specific in-memory columnar representation
JIRA issue: SPARK-2190
Added specialized in-memory column type for
Timestamp. Whitelisted all timestamp related Hive tests excepttimestamp_udf, which is timezone sensitive.