Automatically convert TTS audio to MP3 on demand #102814

synesthesiam · 2023-10-25T19:38:24Z

Breaking change

The ATTR_AUDIO_OUTPUT attribute is deprecated. This previously told the TTS system what audio format to generate. It has been superseded by ATTR_PREFERRED_FORMAT (see below).

Unless ATTR_PREFERRED_FORMAT is given, TTS audio will always be converted to (or kept in) MP3 format.

All TTS audio generation is now non-blocking. A media source id/URL will be returned immediately while TTS audio is generated in the background. Resolving the media/fetching the URL will block until generation is finished.

Proposed change

Different TTS systems produce audio in different formats, some not compatible with many media players. Some TTS systems support ATTR_AUDIO_OUTPUT format to change the output format, there is no guarantee that the TTS system can generate the requested audio format. Wyoming, for example, can only generate WAV files.

This PR adds several things to TTS:

A new ATTR_PREFERRED_FORMAT option lets the caller select a different audio format than what the TTS natively generates, such as "wav". Unless provided, it defaults to MP3.
Two additional options, ATTR_PREFERRED_SAMPLE_RATE and ATTR_PREFERRED_SAMPLE_CHANNELS, allow the caller to control the exact details of the final audio. This is required for ESPHome to stream audio to speakers.
All TTS audio generation is now non-blocking.

Lastly, the ESPHome voice assistant code has been updated to request 16 Khz 16-bit mono WAV audio when it will be streamed back to the client. This should now work with any TTS system.

Type of change

Dependency upgrade
Bugfix (non-breaking change which fixes an issue)
New integration (thank you!)
New feature (which adds functionality to an existing integration)
Deprecation (breaking change to happen in the future)
Breaking change (fix/feature causing existing functionality to break)
Code quality improvements to existing code or addition of tests

Additional information

This PR fixes or closes issue: fixes # Piper TTS not working with DLNA connected speakers addons#3030 Wyoming integration returning incorrect URLs from piper #92969
This PR is related to issue:
Link to documentation pull request:

Checklist

The code change is tested and works locally.
Local tests pass. Your PR cannot be merged unless tests pass
There is no commented out code in this PR.
I have followed the development checklist
I have followed the perfect PR recommendations
The code has been formatted using Black (black --fast homeassistant tests)
Tests have been added to verify that the new code works.

If user exposed functionality or configuration variables are added/changed:

Documentation added/updated for www.home-assistant.io

If the code communicates with devices, web services, or third-party tools:

The manifest file has all fields filled out correctly.
Updated and included derived files by running: python3 -m script.hassfest.
New or updated dependencies have been added to requirements_all.txt.
Updated by running python3 -m script.gen_requirements_all.
For the updated dependencies - a link to the changelog, or at minimum a diff between library versions is added to the PR description.
Untested files have been added to .coveragerc.

To help with the load of incoming pull requests:

I have reviewed two other open pull requests in this repository.

home-assistant · 2023-10-25T19:38:32Z

Hey there @balloob, mind taking a look at this pull request as it has been labeled with an integration (wyoming) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of wyoming can trigger bot actions by commenting:

@home-assistant close Closes the pull request.
@home-assistant rename Awesome new title Renames the pull request.
@home-assistant reopen Reopen the pull request.
@home-assistant unassign wyoming Removes the current integration label and assignees on the pull request, add the integration domain after the command.

home-assistant · 2023-10-25T19:38:33Z

Hey there @home-assistant/core, @pvizeli, mind taking a look at this pull request as it has been labeled with an integration (tts) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of tts can trigger bot actions by commenting:

@home-assistant close Closes the pull request.
@home-assistant rename Awesome new title Renames the pull request.
@home-assistant reopen Reopen the pull request.
@home-assistant unassign tts Removes the current integration label and assignees on the pull request, add the integration domain after the command.

homeassistant/components/wyoming/tts.py

homeassistant/components/assist_pipeline/pipeline.py

homeassistant/components/tts/__init__.py

homeassistant/components/assist_pipeline/pipeline.py

synesthesiam · 2023-10-27T20:10:05Z

TODO: The TTS memory and file caches need to know about the multiple formats available. At the moment, MP3 replaces the cache after conversion. This means the original files will not be accessible without probing the cache directory.

tests/components/tts/test_init.py

homeassistant/components/tts/__init__.py

stale

homeassistant/components/esphome/voice_assistant.py

homeassistant/components/tts/__init__.py

MartinHjelmare · 2023-11-06T22:50:32Z

homeassistant/components/tts/__init__.py

+            if proc.returncode != 0:
+                _LOGGER.error(stderr.decode())
+                raise RuntimeError(
+                    f"Unexpected error while running ffmpeg with arguments: {command}. See log for details."


Please break long strings around max 88 characters per line.

home-assistant bot added cla-signed core Hacktoberfest has-tests integration: tts integration: wyoming new-feature labels Oct 25, 2023

home-assistant bot assigned balloob and pvizeli Oct 25, 2023

home-assistant bot added by-code-owner Quality Scale: No score labels Oct 25, 2023

home-assistant bot added the Quality Scale: internal label Oct 25, 2023

synesthesiam mentioned this pull request Oct 25, 2023

Default to MP3 for Wyoming TTS #92867

Closed

20 tasks

synesthesiam marked this pull request as ready for review October 26, 2023 02:20

synesthesiam requested review from balloob, pvizeli and a team as code owners October 26, 2023 02:20

balloob reviewed Oct 26, 2023

View reviewed changes

homeassistant/components/wyoming/tts.py Outdated Show resolved Hide resolved

balloob reviewed Oct 27, 2023

View reviewed changes

homeassistant/components/assist_pipeline/pipeline.py Show resolved Hide resolved

balloob reviewed Oct 27, 2023

View reviewed changes

homeassistant/components/tts/__init__.py Outdated Show resolved Hide resolved

balloob reviewed Oct 27, 2023

View reviewed changes

homeassistant/components/tts/__init__.py Outdated Show resolved Hide resolved

balloob reviewed Oct 27, 2023

View reviewed changes

homeassistant/components/assist_pipeline/pipeline.py Outdated Show resolved Hide resolved

synesthesiam changed the title ~~Add ATTR_PREFERRED_FORMAT to TTS for auto-converting audio~~ Automatically convert TTS audio to MP3 on demand Oct 27, 2023

synesthesiam marked this pull request as draft October 27, 2023 20:07

balloob reviewed Oct 29, 2023

View reviewed changes

tests/components/tts/test_init.py Outdated Show resolved Hide resolved

balloob reviewed Oct 30, 2023

View reviewed changes

homeassistant/components/tts/__init__.py Outdated Show resolved Hide resolved

home-assistant bot added the cla-error label Nov 2, 2023

synesthesiam added 8 commits November 2, 2023 21:04

Prefer MP3 in pipelines

25fff0e

Automatically convert to mp3 on demand

35cd120

Add preferred audio format

6c0da27

Break out preferred format

9293ec2

Add ATTR_BLOCKING to allow async fetching

9c68cff

Make a copy of supported options

b6c4d46

Fix MaryTTS tests

6183bc5

Update ESPHome to use "wav" instead of "raw"

6eaf9bc

synesthesiam force-pushed the synestheisam-20231025-tts-autoconvert branch from 88c0629 to 6eaf9bc Compare November 3, 2023 02:22

synesthesiam requested review from OttoWinter, jesserockz and bdraco as code owners November 3, 2023 02:22

synesthesiam mentioned this pull request Nov 3, 2023

Use ffmpeg to output raw audio to voice assistant in ESPHome #102869

Closed

20 tasks

bdraco added cla-recheck and removed cla-error labels Nov 3, 2023

home-assistant bot removed the cla-recheck label Nov 3, 2023

balloob reviewed Nov 3, 2023

View reviewed changes

homeassistant/components/esphome/voice_assistant.py Show resolved Hide resolved

balloob reviewed Nov 3, 2023

View reviewed changes

homeassistant/components/tts/__init__.py Show resolved Hide resolved

balloob reviewed Nov 3, 2023

View reviewed changes

homeassistant/components/tts/__init__.py Show resolved Hide resolved

balloob reviewed Nov 3, 2023

View reviewed changes

homeassistant/components/tts/__init__.py Outdated Show resolved Hide resolved

synesthesiam added 4 commits November 3, 2023 17:37

Clean up tests, remove blocking

ee61b98

Clean up rest of TTS tests

423aeef

Fix ESPHome tests

cc16d47

More test coverage

bad035b

balloob approved these changes Nov 6, 2023

View reviewed changes

balloob merged commit ae516ff into dev Nov 6, 2023
53 checks passed

balloob deleted the synestheisam-20231025-tts-autoconvert branch November 6, 2023 20:26

MartinHjelmare reviewed Nov 6, 2023

View reviewed changes

github-actions bot locked and limited conversation to collaborators Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically convert TTS audio to MP3 on demand #102814

Automatically convert TTS audio to MP3 on demand #102814

synesthesiam commented Oct 25, 2023 •

edited

home-assistant bot commented Oct 25, 2023

home-assistant bot commented Oct 25, 2023

synesthesiam commented Oct 27, 2023

MartinHjelmare Nov 6, 2023

Automatically convert TTS audio to MP3 on demand #102814

Automatically convert TTS audio to MP3 on demand #102814

Conversation

synesthesiam commented Oct 25, 2023 • edited

Breaking change

Proposed change

Type of change

Additional information

Checklist

home-assistant bot commented Oct 25, 2023

home-assistant bot commented Oct 25, 2023

synesthesiam commented Oct 27, 2023

MartinHjelmare Nov 6, 2023

Choose a reason for hiding this comment

synesthesiam commented Oct 25, 2023 •

edited