You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using the elasticsearch-hadoop-2.2.0-beta1.jar to move data from hive to elasticsearch service on amazon AWS.
I want to use the uid as an index but not to also include it in the _source of the document.
This doesn't work.
DROP TABLE IF EXISTS corpusElasticSearch;
CREATE EXTERNAL TABLE corpusElasticSearch (
country STRING, uid STRING, gender INT, age INT,
education INT, employment INT, income INT, householdsize INT, children INT,
domains ARRAY<STRING>, devices ARRAY<STRING>)
STORED BY 'org.elasticsearch.hadoop.hive.EsStorageHandler'
TBLPROPERTIES( 'es.mapping.id' = 'uid',
'es.mapping.exclude' = 'uid', --this should exclude the uid from the _source mapping. For some reason it doesn't. Might only work for JSON ?
'es.resource' = 'audiencereport/testHive21',
'es.nodes' = 'Amazon ES endpoint',
'es.nodes.wan.only' = 'true',
'es.index.auto.create' = 'true');
INSERT OVERWRITE TABLE corpusElasticSearch select * from corpus limit 10;
After more testing neither es.mapping.exclude nor es.mapping.include work in this scenario.
The text was updated successfully, but these errors were encountered:
Finally got around looking at this bug. It was caused by Hive stripping the real names of the columns which meant the filtering had completely different values. Fixed in master (and will be back ported to 2.1.3).
A nightly build will soon ensue.
I am using the elasticsearch-hadoop-2.2.0-beta1.jar to move data from hive to elasticsearch service on amazon AWS.
I want to use the uid as an index but not to also include it in the _source of the document.
This doesn't work.
After more testing neither es.mapping.exclude nor es.mapping.include work in this scenario.
The text was updated successfully, but these errors were encountered: