Using the EsSpark.esRDD method to read from Elasticsearch does not honor size parameters in either URI or DSL #469

sherry-ger · 2015-06-05T22:34:49Z

The size option is ignored. Both methods returned all results.

//conf.set("es.query", "?q=text_entry:rebel&size=1") or

val q1 = "{\"query\": {\"filtered\" : {\"query\" : {\"term\" : { \"text_entry\": \"rebel\" }}}},\"size\" : 1}"
conf.set("es.query", q1)
val esRDD = sc.newAPIHadoopRDD(conf, classOf[EsInputFormat[Text, MapWritable]], classOf[Text], classOf[MapWritable])

May be this is similar to #444

The text was updated successfully, but these errors were encountered:

costin · 2015-06-12T16:43:13Z

This is actually on purpose and needs to be documented. Since the connector does a parallel query, it also looks at the number of documents being returned so if the user specifies a parameter, it will overwrite it according to the batch.size setting (see the configuration option).
In other words, if you want to control the size, do so through that setting as it will always take precedence.

apatrida · 2015-07-12T00:21:45Z

its nice to have docs...

costin · 2015-10-28T18:58:28Z

relates #546

costin · 2015-10-29T11:35:10Z

Fixed through #546

costin added v2.1.0.rc1 :Rest doc v2.1.0.Beta4 labels Jun 8, 2015

costin added v2.2.0-m1 and removed v2.1.0.Beta4 v2.2.0-m1 v2.1.0.rc1 labels Jun 12, 2015

costin added v2.2.0-beta1 and removed v2.2.0-m1 labels Aug 27, 2015

kim333 mentioned this issue Sep 8, 2015

elasticsearch spark size option to limit the number of documents returned #546

Closed

costin closed this as completed Oct 29, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using the EsSpark.esRDD method to read from Elasticsearch does not honor size parameters in either URI or DSL #469

Using the EsSpark.esRDD method to read from Elasticsearch does not honor size parameters in either URI or DSL #469

sherry-ger commented Jun 5, 2015

costin commented Jun 12, 2015

apatrida commented Jul 12, 2015

costin commented Oct 28, 2015

costin commented Oct 29, 2015

Using the EsSpark.esRDD method to read from Elasticsearch does not honor size parameters in either URI or DSL #469

Using the EsSpark.esRDD method to read from Elasticsearch does not honor size parameters in either URI or DSL #469

Comments

sherry-ger commented Jun 5, 2015

costin commented Jun 12, 2015

apatrida commented Jul 12, 2015

costin commented Oct 28, 2015

costin commented Oct 29, 2015