Boosting Spark to 2.1.0. Adding tests that include Java types encoded … #105

imarios · 2017-01-29T23:30:42Z

…using injection.

codecov-io · 2017-01-29T23:41:59Z

Codecov Report

Merging #105 into master will increase coverage by 0.09%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master     #105      +/-   ##
==========================================
+ Coverage   90.72%   90.82%   +0.09%     
==========================================
  Files          25       25              
  Lines         507      512       +5     
  Branches        7        7              
==========================================
+ Hits          460      465       +5     
  Misses         47       47

Impacted Files	Coverage Δ
core/src/main/scala/frameless/Injection.scala	`100% <ø> (ø)`	⬆️
dataset/src/main/scala/frameless/implicits.scala	`100% <100%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f3ad8d3...59c6a61. Read the comment docs.

OlivierBlanvillain · 2017-01-31T15:47:54Z

Apparently Spark's 2.10 jar contains an old version of shapeless, introduced by this PR apache/spark#14150. I hope this won't cause binary incompatibilities...

kanterov · 2017-01-31T21:29:08Z

dataset/src/main/scala/frameless/implicits.scala

@@ -32,4 +32,12 @@ object implicits {
    // implicit def floatToDouble[T](col: TypedColumn[T, Float]): TypedColumn[T, Double] = col.cast[Double]
    // implicit def floatToBigDecimal[T](col: TypedColumn[T, Float]): TypedColumn[T, BigDecimal] = col.cast[BigDecimal]
  }
+
+  object injections {
+    implicit val javaBoolean: Injection[java.lang.Boolean, Boolean] = Injection(a => a, b => b)


DId you consider Injection[java.lang.Boolean, Option[Boolean]]? Often when I see Java boxed primitives, it's Java-style Scala code using nulls.

Hmm I didn't think about this. From what I tested, using Optional here doesn't come for free in terms of extra code being generated. My main goal with the Java types here was for completeness and and enrich some of the tests.

imarios · 2017-01-31T23:43:07Z

@OlivierBlanvillain looks to be ok for now! I didn't see any serious issue with 2.1.0.

OlivierBlanvillain · 2017-02-01T16:16:25Z

@imarios https://gitter.im/spark-scala/Lobby?at=5891dd204c04e9a44e590301

imarios · 2017-02-01T18:28:06Z

@OlivierBlanvillain I didn't really see any problems in our code. We can try to add an exclusion rule in Spark to make sure that they are using our own shapeless. Else we will need to publish shapeless under a separate maven coordinate so the class-paths don't collide. Shading will not really work for us here.

julien-truffaut · 2017-02-01T18:42:44Z

will there be a way to get support for several spark version? e.g. 2.0 and 2.1?

imarios · 2017-02-01T18:51:15Z

@julien-truffaut I think the best is to release a new version of Frameless for every new version of Spark and have a table saying Spark 2.0.x use Frameless 0.2.0 for Spark 2.1.0 use Frameless 0.3.0, etc. Supporting multiple Spark version for every release will be a lot of extra work for us (testing and validating all new features across different spark version). We might want to do that later if we see people asking for it and us having more contributors to actually take on the challenge.

kanterov · 2017-02-02T21:42:17Z

Fixed problems with udfs #107

imarios · 2017-02-19T03:35:22Z

Hey @kanterov, let me know if you want any changes for this PR. thanks!

jeremyrsmith · 2017-02-19T04:05:20Z

@imarios hate to be the one to say it, but I really want to use frameless at $work (hence the sudden flurry of PRs) and we're stuck on 2.0 for the time being. If someone needs to maintain cross-building for 2.0 and 2.1 then I'll volunteer, as long as we can drop Scala 2.10 (hopefully nobody is using Spark 2 on Scala 2.10?)

imarios · 2017-02-19T05:30:39Z

@jeremyrsmith so glad that you guys use it work! I think it's a good idea to move Frameless together with Spark. Maybe not same day releases but if we are having a new release it should be with latest Spark. The assumption here is that anyone using frameless is production is also brave enough to be updating their Spark dependencies as new version come out :D

…using injection.

OlivierBlanvillain · 2017-04-24T18:34:14Z

@imarios Could you explain the need for this injections object? If it's only for testing I would rather have it moved to tests...

imarios · 2017-04-24T22:34:39Z

@OlivierBlanvillain I will actually revisit this PR. Maybe close and open two separate. One for boosting Spark and one for the Injection tests. Right now I wouldn't consider this for 0.3.0.

imarios changed the title ~~Boosting Spark to 2.1.0. Adding tests that include Java type encoded …~~ Boosting Spark to 2.1.0. Adding tests that include Java types encoded … Jan 29, 2017

kanterov reviewed Jan 31, 2017

View reviewed changes

imarios force-pushed the fixing_#93 branch from 35d7345 to 0f7aa56 Compare January 31, 2017 23:41

kanterov mentioned this pull request Feb 1, 2017

Upgrade to Spark 2.1.0 #93

Closed

imarios force-pushed the fixing_#93 branch from 0f7aa56 to 17076a1 Compare February 2, 2017 01:12

imarios force-pushed the fixing_#93 branch from 17076a1 to 694bef7 Compare February 3, 2017 20:55

imarios added 2 commits March 26, 2017 09:32

Boosting Spark to 2.1.0. Adding tests that include Java type encoded …

788b18d

…using injection.

Nested Vector unit tests for UDFs

59c6a61

imarios force-pushed the fixing_#93 branch from 694bef7 to 59c6a61 Compare March 26, 2017 16:40

imarios closed this May 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boosting Spark to 2.1.0. Adding tests that include Java types encoded … #105

Boosting Spark to 2.1.0. Adding tests that include Java types encoded … #105

imarios commented Jan 29, 2017

codecov-io commented Jan 29, 2017 •

edited

OlivierBlanvillain commented Jan 31, 2017

kanterov Jan 31, 2017

imarios Feb 1, 2017

imarios commented Jan 31, 2017

OlivierBlanvillain commented Feb 1, 2017

imarios commented Feb 1, 2017

julien-truffaut commented Feb 1, 2017

imarios commented Feb 1, 2017 •

edited

kanterov commented Feb 2, 2017

imarios commented Feb 19, 2017

jeremyrsmith commented Feb 19, 2017

imarios commented Feb 19, 2017

OlivierBlanvillain commented Apr 24, 2017

imarios commented Apr 24, 2017

Boosting Spark to 2.1.0. Adding tests that include Java types encoded … #105

Boosting Spark to 2.1.0. Adding tests that include Java types encoded … #105

Conversation

imarios commented Jan 29, 2017

codecov-io commented Jan 29, 2017 • edited

Codecov Report

OlivierBlanvillain commented Jan 31, 2017

kanterov Jan 31, 2017

Choose a reason for hiding this comment

imarios Feb 1, 2017

Choose a reason for hiding this comment

imarios commented Jan 31, 2017

OlivierBlanvillain commented Feb 1, 2017

imarios commented Feb 1, 2017

julien-truffaut commented Feb 1, 2017

imarios commented Feb 1, 2017 • edited

kanterov commented Feb 2, 2017

imarios commented Feb 19, 2017

jeremyrsmith commented Feb 19, 2017

imarios commented Feb 19, 2017

OlivierBlanvillain commented Apr 24, 2017

imarios commented Apr 24, 2017

codecov-io commented Jan 29, 2017 •

edited

imarios commented Feb 1, 2017 •

edited