New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple indexes setting for 'es.resource' #289
Comments
Could you explain what the expectation is and what is the actual result? es-hadoop does minimal interpretation of the index/type and feeds the information directly to Elasticsearch. |
The hive table I created is like below CREATE EXTERNAL TABLE test and I used 'select count() from test;' which is a hive query to count the total number of rows of the table. environmental information
|
Hi, Sorry for the delay in picking this up. I've tried reproducing this but can't - maybe it has something to do with the dataset or potentially the way the counting is done.
Thanks! |
Hi, I created new indexes with small data for easy test. POST /cars-01/transactions/_bulk POST /cars-02/transactions/_bulk 'GET cars-01/_search?search_type=count' returns 3 hits and created table like below CREATE EXTERNAL TABLE cars 'select * from cars' and 'select count(*) from cars' returns different result every time 10000 red honda 2014-10-28 00:00:00 example 2 (10 rows) 10000 red honda 2014-10-28 00:00:00 I uploaded two logs. Thanks. |
I get this issue as well, but from a Spark perspective.
If i set es.resource to be "index1,index2/mydoc", things get a bit wierd:
Moving forward, if i add an additional index, "index1,index2,index3/mydoc", the count will be 36, with 3 identical partitions of size 12 each. |
Folks, can you try the latest Beta (4) and see whether it addressed your issue? There have been several updates on this front. Thanks, |
It is working now. |
Closing the issue... |
'es.resource' = 'apache-2014.09./apache-access' or
'es.resource' = 'apache-2014.09.29,apache-2014.09.30/apache-access'
are not working well for 'select count(*) from test' which is HiveQL.
The count result is not right.
'es.resource' configuration should support multiple indexes setting.
Or, at least give an error message.
The text was updated successfully, but these errors were encountered: