Object field named 'properties' breaks parsing of ES mapping #809

andregarcia · 2016-07-20T08:16:53Z

What kind an issue is this?

Bug report. If you’ve found a bug, please provide a code snippet or test to reproduce it below.
The easier it is to track down the bug, the faster it is solved.
Feature Request. Start by telling us what problem you’re trying to solve.
Often a solution already exists! Don’t send pull requests to implement new features without
first getting our support. Sometimes we leave features out on purpose to keep the project small.

Issue description

Parser of ES type mapping does not work properly when you have a field of type object named 'properties' in it. I checked the code and verified that the parser is unable to distinguish between a field of type object named 'properties' and the 'properties' key used by ES to define object sub-fields.

Steps to reproduce

Code:

create ES index, mapping and document:

curl -XPOST 'localhost:9200/sample_index/sample_type/5123' -d '{"name":"value0","properties":{"x":"value1","y":"value2"},"title":"value3"}'

query using elasticsearch-hadoop (python code)

from pyspark import SparkContext, SparkConf
conf = SparkConf().setAppName('app').setMaster('local')
sc = SparkContext(conf=conf)
es_hadoop_conf = {
        'es.nodes' : 'localhost',
        'es.port' : '9200',
        'es.resource' : 'sample_index/sample_type'
}
rdd = sc.newAPIHadoopRDD(
        inputFormatClass="org.elasticsearch.hadoop.mr.EsInputFormat",
        keyClass="org.apache.hadoop.io.NullWritable",
        valueClass="org.elasticsearch.hadoop.mr.LinkedMapWritable",
        conf=es_hadoop_conf
)
print rdd.collect()[0]

output shows empty document:

(u'5123', {})

Strack trace:

no exceptions, but DEBUG log indicates that although mapping was retrieved correctly, parsing was not done correctly

16/07/20 05:00:13 DEBUG header: >> "GET /sample_index/sample_type/_mapping HTTP/1.1[\r][\n]"
16/07/20 05:00:13 DEBUG HttpMethodBase: Adding Host request header
16/07/20 05:00:13 DEBUG header: >> "User-Agent: Jakarta Commons-HttpClient/3.1[\r][\n]"
16/07/20 05:00:13 DEBUG header: >> "Host: 127.0.0.1:9200[\r][\n]"
16/07/20 05:00:13 DEBUG header: >> "[\r][\n]"
16/07/20 05:00:13 DEBUG header: << "HTTP/1.1 200 OK[\r][\n]"
16/07/20 05:00:13 DEBUG header: << "HTTP/1.1 200 OK[\r][\n]"
16/07/20 05:00:13 DEBUG header: << "Content-Type: application/json; charset=UTF-8[\r][\n]"
16/07/20 05:00:13 DEBUG header: << "Content-Length: 187[\r][\n]"
16/07/20 05:00:13 DEBUG header: << "[\r][\n]"
16/07/20 05:00:13 DEBUG content: << "{"sample_index":{"mappings":{"sample_type":{"properties":{"name":{"type":"string"},"properties":{"properties":{"x":{"type":"string"},"y":{"type":"string"}}},"title":{"type":"string"}}}}}}"
16/07/20 05:00:13 INFO EsInputFormat: Discovered mapping {sample_type=[x=STRING, y=STRING]} for [sample_index/sample_type]

Version Info

OS: : Ubuntu 14.04 LTS
JVM : java version "1.8.0_91"
Hadoop/Spark: spark-1.6.2-bin-hadoop2.6
ES-Hadoop : elasticsearch-hadoop-5.0.0-alpha4
ES : elasticsearch-2.2.0

The text was updated successfully, but these errors were encountered:

Fixes #809

andregarcia mentioned this issue Jul 20, 2016

Fixed parsing of ES mapping with OBJECT field named 'properties' #810

Merged

1 task

jbaiera added bug :Rest v5.0.0-alpha5 labels Jul 22, 2016

jbaiera closed this as completed in #810 Jul 22, 2016

jbaiera pushed a commit that referenced this issue Jul 22, 2016

Fixed parsing of ES mapping with OBJECT field named 'properties' (#810)

d3db11d

Fixes #809

jbaiera added the v2.3.4 label Jul 22, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Object field named 'properties' breaks parsing of ES mapping #809

Object field named 'properties' breaks parsing of ES mapping #809

andregarcia commented Jul 20, 2016

Object field named 'properties' breaks parsing of ES mapping #809

Object field named 'properties' breaks parsing of ES mapping #809

Comments

andregarcia commented Jul 20, 2016

What kind an issue is this?

Issue description

Steps to reproduce

Version Info