Reader parses schema but returns no rows #49

fzqneo · 2018-11-25T22:23:13Z

The result of the first command suggests it connects to dynamodb and parses the schema. But there are no rows in the dataframe, while I'm sure the dynamodb table is not empty.

scala> val users = spark.sqlContext.read.dynamodb("ap-southeast-1", "user")
users: org.apache.spark.sql.DataFrame = [account_name: string, avatar: string ... 24 more fields]

scala> users.count()
res15: Long = 0

The text was updated successfully, but these errors were encountered:

protometa · 2019-03-15T00:16:24Z

Damn, I'm just now running into this. Did you find any solution?

protometa · 2019-03-19T20:23:48Z

This seemed to be related to having a nested json format. If I used json that wasn't nested or left out the nested fields in my schema the records showed up. I was able to work around by reading as rdd and then reading the rdd with dataframe json reader as suggested in #46

PowerToThePeople111 · 2019-03-25T16:28:30Z

I got the same problem with this dynamodb exporter. This one works fine though and also seems to have been updated more recently.

fzqneo · 2019-04-01T13:28:21Z

I switched to something else and didn't continue with the issue. Please refer to the comments above for possible solution.

fzqneo closed this as completed Apr 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reader parses schema but returns no rows #49

Reader parses schema but returns no rows #49

fzqneo commented Nov 25, 2018

protometa commented Mar 15, 2019

protometa commented Mar 19, 2019 •

edited

Loading

PowerToThePeople111 commented Mar 25, 2019

fzqneo commented Apr 1, 2019

Reader parses schema but returns no rows #49

Reader parses schema but returns no rows #49

Comments

fzqneo commented Nov 25, 2018

protometa commented Mar 15, 2019

protometa commented Mar 19, 2019 • edited Loading

PowerToThePeople111 commented Mar 25, 2019

fzqneo commented Apr 1, 2019

protometa commented Mar 19, 2019 •

edited

Loading