feat:add tts-streaming config and future #5492

ic-xu · 2024-06-21T14:54:25Z

Description

Add TTS streaming configuration and feature support, add text to speech in the feature configuration options, click on configure to start

Fixes #5251

Type of Change

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Simply configure the streaming TTS option and test it

crazywoola · 2024-06-22T03:55:10Z

@charli117 Please review this pr as well.

charli117 · 2024-06-22T16:57:52Z

@crazywoola ic-xu is a colleague of our team, and the version submitted now is the version we are currently using. There is also an optimization point that requires adding a blocking mode output logic for models that are not compatible and do not support streaming tts. We have communicated with each other and merged later

api/controllers/web/audio.py

api/core/app/apps/advanced_chat/generate_task_pipeline.py

api/core/app/apps/base_app_queue_manager.py

…e of streaming to implement TTS functionality

It has been extracted as a separate module, and processing logic has been added to the corresponding module in GenerateTask Pipeline. process(). At the same time, the expiration time has been set to be small. At the same time, Redis will not store all message objects, but only a portion of the required data, so the data will not be very large

web/types/app.ts

web/i18n/zh-Hans/app-debug.ts

…io messages

web/models/debug.ts

web/app/components/base/features/types.ts

crazywoola

See comments

takatost · 2024-07-05T06:31:03Z

I tested the manual play mode and found it to be unresponsive, with the chrome console reporting the error message as:

Uncaught (in promise) SyntaxError: Unexpected token '�', "�����bt9��"... is not valid JSON

takatost · 2024-07-05T10:34:34Z

Maybe it's because they've all switched to streaming mode now, and the web front-end isn't compatible with manual playback mode.

ic-xu · 2024-07-05T11:09:53Z

Maybe it's because they've all switched to streaming mode now, and the web front-end isn't compatible with manual playback mode.

Could you provide a test browser? We are using Chrome for testing, which works well and has a fast response speed.

takatost

Ops, now in the basic chatbot, clicking the play button does not send a request.
Additionally, if the TTS function is enabled for the first time in the Chatflow, clicking the play button will result in an error. This might be due to using a draft version instead of a published one. The error message is as follows:

Traceback (most recent call last):
  File "/Users/takatost/Projects/dify/api/controllers/console/app/audio.py", line 97, in post
    text_to_speech = app_model.workflow.features_dict.get('text_to_speech')
AttributeError: 'NoneType' object has no attribute 'features_dict'

…w, clicking the play button will result in an error.

Fixed audio playback lag, console prompts json parsing error, automatic playback failure, etc. bugs

takatost

thx!

* refs/heads/main: (51 commits) feat: tailwind related improvement (#6085) feat: support AnalyticDB vector store (#5586) feat:add tts-streaming config and future (#5492) Feat: add index bar to select tool panel of workflow (#6066) bump to 0.6.13 (#6078) fix: Inconsistency Between Actual and Debug Input Variables (#6055) refactor: revamp picker block (#4227) chore: remove tsne unused code (#6077) fix: relative in overflow div (#5998) chore(action): move docker login above Set up QEMU in build-push action workflow (#6073) remove clunky welcome message (#6069) feat: add request_params field to jina_reader tool (#5610) fix azure stream download (#6063) chore: hide tracing introduce detail (#6049) Address the issue of the absence of poetry in the development container. (#6036) Fix authorization header validation to handle bearer types correctly - "authorization config header is required" error (#6040) Fix/6034 get random order of categories in explore and workflow is missing in zh hant (#6043) Modify slack webhook url validation to allow workflow (#6041) (#6042) fix(configs): Update pydantic settings in config files (#6023) Fix/incorrect parameter extractor memory (#6038) ...

feat:add tts-streaming config and future

fb0437f

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. 💪 enhancement New feature or request labels Jun 21, 2024

ic-xu added 2 commits June 21, 2024 22:59

feat:add tts-streaming config and future

6b6fca7

feat:fix return type

ec4255f

crazywoola requested a review from takatost June 22, 2024 03:54

takatost requested changes Jun 23, 2024

View reviewed changes

api/controllers/web/audio.py Outdated Show resolved Hide resolved

api/core/app/apps/advanced_chat/generate_task_pipeline.py Show resolved Hide resolved

api/core/app/apps/base_app_queue_manager.py Outdated Show resolved Hide resolved

charli117 mentioned this pull request Jun 24, 2024

When requesting /v1/text-to-audio with the streaming parameter set to true, get the TypeError: 'function' object is not iterable #5251

Closed

4 tasks

Merge remote-tracking branch 'main/main' into stream_tts

e5a70f7

takatost requested changes Jun 25, 2024

View reviewed changes

api/core/app/apps/base_app_queue_manager.py Outdated Show resolved Hide resolved

ic-xu added 3 commits June 25, 2024 16:25

feat:Merge blocking and streaming interfaces in TTS, and unify the us…

62b7297

…e of streaming to implement TTS functionality

Merge remote-tracking branch 'main/main' into stream_tts

32292f7

crazywoola requested a review from takatost July 3, 2024 11:31

crazywoola reviewed Jul 3, 2024

View reviewed changes

web/types/app.ts Outdated Show resolved Hide resolved

crazywoola reviewed Jul 3, 2024

View reviewed changes

web/i18n/zh-Hans/app-debug.ts Show resolved Hide resolved

ic-xu added 2 commits July 4, 2024 15:10

Merge remote-tracking branch 'main/main' into stream_tts

77fdf91

feat :tts removes the redis middleware usage and uses sse to push aud…

18eb5c1

…io messages

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Jul 4, 2024

ic-xu added 2 commits July 4, 2024 17:23

feat: Add i18n description ，Use enumerations instead of string constants

4a72190

feat : fixed tts config

a84c75a

crazywoola reviewed Jul 4, 2024

View reviewed changes

web/models/debug.ts Outdated Show resolved Hide resolved

crazywoola reviewed Jul 4, 2024

View reviewed changes

web/app/components/base/features/types.ts Outdated Show resolved Hide resolved

crazywoola reviewed Jul 4, 2024

View reviewed changes

ic-xu added 3 commits July 5, 2024 09:36

feat: use enum TtsAutoPlay replace string 'enable'|disable

12ce7c6

Merge remote-tracking branch 'main/main' into stream_tts

258d4a0

feat : Remove unused code

3e3c67a

crazywoola previously approved these changes Jul 5, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jul 5, 2024

fix : Fix TTS auto-playback stuttering and audio distortion bug

bd1f463

ic-xu dismissed crazywoola’s stale review via bd1f463 July 5, 2024 08:49

fix : Fix TTS auto-playback stuttering and audio distortion bug

a5642a8

ic-xu added 2 commits July 5, 2024 22:42

fix : Compatible with older versions of the API

0506757

feat: Fix the bug that the last message cannot be played manually.

36b2996

takatost requested changes Jul 6, 2024

View reviewed changes

dosubot bot removed the lgtm This PR has been approved by a maintainer label Jul 6, 2024

ic-xu added 2 commits July 8, 2024 08:56

fix : fixed TTS function is enabled for the first time in the Chatflo…

c3b0e14

…w, clicking the play button will result in an error.

feat:

6bb76bc

Fixed audio playback lag, console prompts json parsing error, automatic playback failure, etc. bugs

crazywoola approved these changes Jul 9, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jul 9, 2024

takatost approved these changes Jul 9, 2024

View reviewed changes

takatost merged commit 6ef401a into langgenius:main Jul 9, 2024
5 checks passed

takatost mentioned this pull request Jul 15, 2024

bump to 0.6.14 #6294

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat:add tts-streaming config and future #5492

feat:add tts-streaming config and future #5492

ic-xu commented Jun 21, 2024 •

edited by crazywoola

Loading

crazywoola commented Jun 22, 2024

charli117 commented Jun 22, 2024

crazywoola left a comment

takatost commented Jul 5, 2024

takatost commented Jul 5, 2024

ic-xu commented Jul 5, 2024

takatost left a comment

takatost left a comment

feat:add tts-streaming config and future #5492

feat:add tts-streaming config and future #5492

Conversation

ic-xu commented Jun 21, 2024 • edited by crazywoola Loading

Description

Type of Change

How Has This Been Tested?

crazywoola commented Jun 22, 2024

charli117 commented Jun 22, 2024

crazywoola left a comment

Choose a reason for hiding this comment

takatost commented Jul 5, 2024

takatost commented Jul 5, 2024

ic-xu commented Jul 5, 2024

takatost left a comment

Choose a reason for hiding this comment

takatost left a comment

Choose a reason for hiding this comment

ic-xu commented Jun 21, 2024 •

edited by crazywoola

Loading