[WIP] druid-hive integration #2880

navis · 2016-04-25T07:16:09Z

Based on #2282. This is the first try to integrate druid with hive.

create table <table-name> (<column-schema>...)
STORED BY "io.druid.hive.DruidHiveStorageHandler" 
TBLPROPERTIES (
  "druid.broker.address"="<broker-address>", 
  "druid.datasource"="<datasource>"
);

tested on hive-2.0 with MR/TEZ. needs configuration hive.optimize.index.filter=true (hive-site.xml) and mapreduce.job.user.classpath.first=true (hadoop mapred-site.xml)

jaehc · 2016-04-25T09:19:54Z

indexing-hadoop/src/main/java/io/druid/indexer/hadoop/QueryBasedInputFormat.java

+    long maxSize = conf.getLong(CONF_MAX_SPLIT_SIZE, DEFAULT_MAX_SPLIT_SIZE);
+
+    if (maxSize > 0) {
+      Collections.shuffle(segments);


I am just wondering it is necessary to shuffle the segments list?

xvrl · 2016-04-25T22:38:03Z

@navis it's a bit confusing to have the same code changes in two different PRs. We're getting comments in both, which makes it hard to track the ones that are addressed or not. Maybe we should close one of the PRs?

navis · 2016-04-26T00:02:50Z

@xvrl Sorry, I thought WIP in title can make others skip reviewing. I'll close this, for now.

navis added 7 commits April 25, 2016 16:10

Show candidate hosts for the given query

ad59a6b

added get method with datasource/intervals param

4063c06

provide approximated size of target segment

718da13

Support queried input format for hive integration

319f9fa

support filter pushdown on dimensions

bb9378d

Add storage handler

ade75cf

fix log

25cc89b

navis force-pushed the hive-druid-integration branch from ba464b5 to 25cc89b Compare April 25, 2016 07:44

jaehc reviewed Apr 25, 2016
View reviewed changes

navis closed this Apr 26, 2016

gianm mentioned this pull request Jun 17, 2016

Add segment pruning based on secondary partition dimension #2982

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] druid-hive integration #2880

[WIP] druid-hive integration #2880

navis commented Apr 25, 2016

jaehc Apr 25, 2016 •

edited

xvrl commented Apr 25, 2016

navis commented Apr 26, 2016

[WIP] druid-hive integration #2880

[WIP] druid-hive integration #2880

Conversation

navis commented Apr 25, 2016

jaehc Apr 25, 2016 • edited

Choose a reason for hiding this comment

xvrl commented Apr 25, 2016

navis commented Apr 26, 2016

jaehc Apr 25, 2016 •

edited