[text-to-speech]  Spaces in text are encoded as +

- [ ] Steps to reproduce:
`TSS.synthesize("Hello Bob", Voice.EN_LISA`);

- [ ] Expected behavior
Audio for "Hello Bob"

- [ ] Actual behavior
Audio for "Hello+Bob"

- [ ] JDK version: OpenJDK 1.8.0_121

- [ ] java-sdk version: 3.7.1

With commit 7d9bbd7dc18b80483b73cf25d398d0ed07b9cbbc, to resolve #602, the text is now url-encoded before being passed off to okhttp.

Unfortunately, `RequestUtils.encode` calls `URLEncoder.encode()`, which performs form-encoding instead of %-encoding. okhttp then does proper %-encoding, which results in requests for synthesizing "Hi Bob" becoming:

    https://stream.watsonplatform.net/text-to-speech/api/v1/synthesize?text=Hi%252C2BBob&voice=en-US_LisaVoice&accept=audio/l16;%20rate%3D48000

Issue #602 seems to be caused by the `é` character being encoded as UTF-8 by okhttp (`0xC3 0xA9`) but decoded as ASCII by the backend, hence the `BadRequestException: 'ascii' codec can't decode byte 0xc3` error.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[text-to-speech] Spaces in text are encoded as + #635

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[text-to-speech] Spaces in text are encoded as + #635

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions