Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark SQL 1.3 support - esDF function incorrectly maps column names #451

Closed
difin opened this issue May 14, 2015 · 3 comments
Closed

Spark SQL 1.3 support - esDF function incorrectly maps column names #451

difin opened this issue May 14, 2015 · 3 comments

Comments

@difin
Copy link

difin commented May 14, 2015

Hi,

Environment:
Spark 1.3.0
elasticsearch 1.4.4
elasticsearch-hadoop 2.1.0.Beta4

When using esDF function to load data from Elasticsearch into Spark, columns get assigned to incorrect column names in the resulted DataFrame. The columns names that get assigned, they do exist in the index/type, but the mapping between the columns and column names is totally incorrect.

This bug doesn't happen when using the general Spark SQL 1.3 "load" function, columns are mapped to correct column names, but when using the load function instead of esDF and then doing JOIN with DataFrame loaded from Oracle, an exception happens (see issue #449: #449).

@jmahonin
Copy link

I ran into this same issue myself using 2.1.0-Beta4. It seems to be resolved in the master branch, possibly by this commit:
279e893

It's been some time, and a number of changes since the Beta4 release. A Beta5 (or full) 2.1.0 release would be great!

@costin
Copy link
Member

costin commented Jun 3, 2015

@FDmitriy @jmahonin Indeed, it has. Can you please try out the snapshot/dev build and report back?

Thanks,

@costin
Copy link
Member

costin commented Jun 3, 2015

Duplicate of #441

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants