Fix spray-json unmarshalling #699

jrudolph · 2016-12-30T14:26:50Z

Changes:

upgrade to spray-json 1.3.3 (which has working 4-byte UTF-8 character decoding for all Scala versions)
use new IndexedBytesParserInput to avoid copied code from spray-json
refactor unmarshallers to base all on a single implementation
fix tests to actually test more kinds of UTF-8 characters

Fixes #691.

akka-ci · 2016-12-30T14:36:06Z

Test FAILed.

jrudolph · 2016-12-30T14:53:28Z

Needed to mima around some things... ;)

akka-ci · 2016-12-30T14:55:53Z

Test PASSed.

jonas · 2016-12-30T16:39:34Z

...src/main/scala/akka/http/scaladsl/marshallers/sprayjson/SprayJsonByteStringParserInput.scala

@@ -70,6 +28,7 @@ final class SprayJsonByteStringParserInput(bytes: ByteString) extends DefaultPar
    StandardCharsets.UTF_8.decode(bytes.slice(start, end).asByteBuffer).array()
 }

+@deprecated("Not needed any more. Should have been private.", "10.0.2")
 object SprayJsonByteStringParserInput {
  private final val EOI = '\uFFFF'


Would it make sense to remove the private members?

jonas · 2016-12-30T16:56:49Z

...tp-spray-json/src/main/scala/akka/http/scaladsl/marshallers/sprayjson/SprayJsonSupport.scala

-  implicit def sprayJsonByteStringUnmarshaller[T](implicit reader: RootJsonReader[T]): FromByteStringUnmarshaller[T] =
+  implicit def sprayJsValueUnmarshaller: FromEntityUnmarshaller[JsValue] =
+    Unmarshaller.byteStringUnmarshaller
+      .forContentTypes(`application/json`)


If I understand correctly this forces the content type to avoid errors but keeps the charset attribute, which the old code explicitly handled. With the move to use SprayJsonByteStringParserInput is this explicit decoding no longer needed?

Content type json includes it being UTF-8 actually so this forces that.

(as according to the JSON spec)

So do we need a test that it will re-encode the original content? I'm mainly concerned that this could introduce a regression in the sense that the original special case must have been put in place for a reason.

According to http://www.iana.org/assignments/media-types/application/json adding charset parameter should have no effect:

Note: No "charset" parameter is defined for this registration. Adding one really has no effect on compliant recipients.

So, yes, we might introduce a regression for people relying on non-standard behavior. In reality, it might not be a big deal because almost everyone seems to be on UTF-8 nowadays. It is still possible to create a custom unmarshaller if you need to deal with non-standard data. If it turns out that this is a pain point for lots of people we can still revert to more relaxed behavior.

Thanks for the clarification.

ktoso · 2017-01-02T10:48:43Z

...src/main/scala/akka/http/scaladsl/marshallers/sprayjson/SprayJsonByteStringParserInput.scala

-      } else EOI
-    }
-  }
+@deprecated("Will be made private.", "10.0.2")


ktoso · 2017-01-02T10:55:57Z

akka-http/src/main/scala/akka/http/javadsl/unmarshalling/Unmarshaller.scala

+   * Deprecated in favor of [[unmarshal]].
+   */
+  @deprecated("Use unmarshal instead.", "10.0.2")
+  def unmarshall(a: A, ec: ExecutionContext, mat: Materializer): CompletionStage[B] = unmarshal(a, ec, mat)


ktoso · 2017-01-02T10:56:28Z

project/Dependencies.scala

@@ -57,7 +57,7 @@ object Dependencies {
    val alpnApi     = "org.eclipse.jetty.alpn"        % "alpn-api"                     % "1.1.3.v20160715" // ApacheV2

    object Docs {
-      val sprayJson   = "io.spray"                   %%  "spray-json"                  % "1.3.2"             % "test"
+      val sprayJson   = Compile.sprayJson % "test"


align the "test" please?

ktoso

Had a minor comment but LGTM in general 👍

Changes: * upgrade to spray-json 1.3.3 (which has working 4-byte UTF-8 character decoding for all Scala versions) * use new IndexedBytesParserInput to avoid copied code from spray-json * refactor unmarshallers to base all on a single implementation * fix tests to actually test more kinds of UTF-8 characters

akka-ci · 2017-01-02T14:53:24Z

Test PASSed.

jrudolph added 2 commits December 30, 2016 15:17

!htp deprecate wrongly spelled method

d310a0d

+htp akka#691 add Unmarshaller.andThen to combine two Unmarshallers

cb795f0

akka-ci added the validating PR that is currently being validated by Jenkins label Dec 30, 2016

jrudolph force-pushed the jr/w/691-spray-json-UTF-8-decoding branch from 948d7c4 to dd3c33f Compare December 30, 2016 14:49

akka-ci added the validating PR that is currently being validated by Jenkins label Dec 30, 2016

akka-ci added tested PR that was successfully built and tested by Jenkins and removed validating PR that is currently being validated by Jenkins labels Dec 30, 2016

jonas reviewed Dec 30, 2016

View reviewed changes

ktoso reviewed Jan 2, 2017

View reviewed changes

ktoso approved these changes Jan 2, 2017

View reviewed changes

jonas mentioned this pull request Jan 2, 2017

Version 1.3.2 for Scala 2.12 is different from the one for Scala 2.11 and 2.10 spray/spray-json#213

Closed

jrudolph added 2 commits January 2, 2017 15:42

=pro realign a few dependencies

d67300e

jrudolph force-pushed the jr/w/691-spray-json-UTF-8-decoding branch from dd3c33f to d67300e Compare January 2, 2017 14:43

akka-ci added validating PR that is currently being validated by Jenkins tested PR that was successfully built and tested by Jenkins and removed tested PR that was successfully built and tested by Jenkins validating PR that is currently being validated by Jenkins labels Jan 2, 2017

jrudolph merged commit 0b227d7 into akka:master Jan 2, 2017

jrudolph deleted the jr/w/691-spray-json-UTF-8-decoding branch February 20, 2017 16:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix spray-json unmarshalling #699

Fix spray-json unmarshalling #699

jrudolph commented Dec 30, 2016

akka-ci commented Dec 30, 2016

jrudolph commented Dec 30, 2016

akka-ci commented Dec 30, 2016

jonas Dec 30, 2016

jonas Dec 30, 2016

ktoso Jan 2, 2017

ktoso Jan 2, 2017

jonas Jan 2, 2017

jrudolph Jan 2, 2017

jonas Jan 2, 2017

ktoso Jan 2, 2017

ktoso Jan 2, 2017

ktoso Jan 2, 2017

ktoso left a comment

akka-ci commented Jan 2, 2017

Fix spray-json unmarshalling #699

Fix spray-json unmarshalling #699

Conversation

jrudolph commented Dec 30, 2016

akka-ci commented Dec 30, 2016

jrudolph commented Dec 30, 2016

akka-ci commented Dec 30, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ktoso left a comment

Choose a reason for hiding this comment

akka-ci commented Jan 2, 2017