twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed in Spark 1.2.0. #50

MiguelPeralvo · 2015-01-04T11:39:05Z

Line 42 of reference-apps/twitter_classifier/scala/src/main/scala/com/databricks/apps/twitter_classifier/Collect.scala can now be safely removed, as SPARK-3390 was fixed in pull request #2364 for Apache 1.2.0.

If you use Spark 1.2.0, this is the code that can be removed:

.filter(!_.contains("boundingBoxCoordinates")) // TODO(vida): Remove this workaround when SPARK-3390 is fixed.

If you remove it for Spark 1.1.0, Collect.java won't break when run, but ExamineAndTrain.scala will do, with a "scala.MatchError: StructType(List())" exception. It will be caused by the "boundingBoxCoordinates" json entries, as Spark 1.1.0 doesn't handle them properly.

The text was updated successfully, but these errors were encountered:

Fixes [issue databricks#50: Pending TODO can be completed: SPARK-3390 was fixed] (databricks#50). I've tested it in Spark 1.1.0 and 1.2.0 and it works, as expected.

MiguelPeralvo added a commit to MiguelPeralvo/reference-apps that referenced this issue Jan 4, 2015

Update Collect.scala

7251316

Fixes [issue databricks#50: Pending TODO can be completed: SPARK-3390 was fixed] (databricks#50). I've tested it in Spark 1.1.0 and 1.2.0 and it works, as expected.

MiguelPeralvo mentioned this issue Jan 4, 2015

Update Collect.scala #51

Merged

MiguelPeralvo changed the title ~~twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed.~~ twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed in Spark 1.2.0. Jan 4, 2015

vidaha closed this as completed in #51 Jan 14, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed in Spark 1.2.0. #50

twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed in Spark 1.2.0. #50

MiguelPeralvo commented Jan 4, 2015

twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed in Spark 1.2.0. #50

twitter_classifier/Collect.scala: Pending TODO can be completed: SPARK-3390 was fixed in Spark 1.2.0. #50

Comments

MiguelPeralvo commented Jan 4, 2015