Elasticsearch+Hadoop read and write #45

marcelopaesrech · 2013-05-13T12:53:01Z

Hi, I want to use Elasticsearch+Hadoop MapReduce and I want to read from elasticsearch and write to elasticsearch in the same MapReduce task. In the sample only one way is showed (or read or write because exists only one es.resource). What I want is something like follows:

Read from:
/radio/artists/_search?q=me*

Write to :
/radio/statistics

Best regards.

tzolov · 2013-05-13T14:59:09Z

@marcelopaesrech this looks the same issue as #26. IMO there is no technical reason prohibiting this case.

Furthermore i've implemented a simple fix that adds ES_QUERY in addition to the ES_RESOURCE and it works fine.

marcelopaesrech · 2013-05-13T20:03:16Z

Yes, I want the input of Map is /radio/artists/_search?q=me* and the Reducers write to /radio/statistics (or maybe another index). Any intermediate data that Hadoop generates might be stored by hdfs, I don't care. But the final result I want to store in a index on ES.

I read your issue and is the same thing I think.

costin · 2014-03-10T15:53:51Z

@tzolov @marcelopaesrech hey, finally got around fixing this. Cascading/Hive and Pig set the read/write automatically - in case of MapReduce jobs one can use the es.resource.read and es.resource.write properties. es.resource is still supported and used as fall-back if the aforementioned properties are not defined.

Improve conf to allow for dedicated read and write resource as oppose to a single, unified resource used for both. This allows for different ES indices to be used in the same index, one as a source and the other as a sink. 'es.resource' is still supported and used as a fall back. Higher level abstractions, such as Cascading, Hive and Pig, set the proper property automatically. fix #156 fix #45 fix #26

costin removed the v1.3.0.M2 label Feb 6, 2014

costin added enhancement labels Mar 10, 2014

costin closed this as completed in 68cd50e Mar 10, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elasticsearch+Hadoop read and write #45

Elasticsearch+Hadoop read and write #45

marcelopaesrech commented May 13, 2013

tzolov commented May 13, 2013

marcelopaesrech commented May 13, 2013

costin commented Mar 10, 2014

Elasticsearch+Hadoop read and write #45

Elasticsearch+Hadoop read and write #45

Comments

marcelopaesrech commented May 13, 2013

tzolov commented May 13, 2013

marcelopaesrech commented May 13, 2013

costin commented Mar 10, 2014