Skip to content

Latest commit

 

History

History
317 lines (257 loc) · 4.73 KB

llm_tts_en.md

File metadata and controls

317 lines (257 loc) · 4.73 KB

llm-tts

Text-to-Speech unit, used to provide text-to-speech services, with options for Chinese and English models to provide text-to-speech services in both languages.

setup

Configure the unit.

Send JSON:

{
  "request_id": "2",
  "work_id": "tts",
  "action": "setup",
  "object": "tts.setup",
  "data": {
    "model": "single_speaker_fast",
    "response_format": "sys.pcm",
    "input": "tts.utf-8",
    "enoutput": false
  }
}
  • request_id: Refer to the basic data explanation.
  • work_id: For configuring the unit, it is tts.
  • action: The method to call is setup.
  • object: The type of data being transmitted is tts.setup.
  • model: The model used is the single_speaker_fast Chinese model.
  • response_format: The returned result is sys.pcm, system audio data, which is directly sent to the llm-audio module for playback.
  • input: Input is tts.utf-8, representing user input.
  • enoutput: Whether to enable user result output.

Response JSON:

{
  "created": 1731488402,
  "data": "None",
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "None",
  "request_id": "2",
  "work_id": "llm.1003"
}
  • created: Message creation time, in Unix time.
  • work_id: The successfully created work_id unit.

link

Link the output of the upper unit.

Send JSON:

{
  "request_id": "3",
  "work_id": "tts.1003",
  "action": "link",
  "object": "work_id",
  "data": "kws.1000"
}

Response JSON:

{
  "created": 1731488402,
  "data": "None",
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "None",
  "request_id": "3",
  "work_id": "tts.1003"
}

error::code 0 indicates successful execution.

Link the llm and tts units, so that when kws wakes up, the tts unit stops the previous unfinished inference, used for repeat wake-up functionality.

When linking, ensure that kws is already configured and in working state. Linking can also be done during the setup phase.

Example:

{
  "request_id": "2",
  "work_id": "tts",
  "action": "setup",
  "object": "tts.setup",
  "data": {
    "model": "single_speaker_fast",
    "response_format": "sys.pcm",
    "input": [
      "tts.utf-8",
      "llm.1002",
      "kws.1000"
    ],
    "enoutput": false
  }
}

unlink

Unlink.

Send JSON:

{
  "request_id": "4",
  "work_id": "tts.1003",
  "action": "unlink",
  "object": "work_id",
  "data": "kws.1000"
}

Response JSON:

{
  "created": 1731488402,
  "data": "None",
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "None",
  "request_id": "4",
  "work_id": "tts.1003"
}

error::code 0 indicates successful execution.

pause

Pause unit work.

Send JSON:

{
  "request_id": "5",
  "work_id": "llm.1003",
  "action": "pause"
}

Response JSON:

{
  "created": 1731488402,
  "data": "None",
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "None",
  "request_id": "5",
  "work_id": "llm.1003"
}

error::code 0 indicates successful execution.

work

Resume unit work.

Send JSON:

{
  "request_id": "6",
  "work_id": "llm.1003",
  "action": "work"
}

Response JSON:

{
  "created": 1731488402,
  "data": "None",
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "None",
  "request_id": "6",
  "work_id": "llm.1003"
}

error::code 0 indicates successful execution.

exit

Exit the unit.

Send JSON:

{
  "request_id": "7",
  "work_id": "llm.1003",
  "action": "exit"
}

Response JSON:

{
  "created": 1731488402,
  "data": "None",
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "None",
  "request_id": "7",
  "work_id": "llm.1003"
}

error::code 0 indicates successful execution.

Task Information

Get task list.

Send JSON:

{
  "request_id": "2",
  "work_id": "tts",
  "action": "taskinfo"
}

Response JSON:

{
  "created": 1731652311,
  "data": [
    "tts.1003"
  ],
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "tts.tasklist",
  "request_id": "2",
  "work_id": "tts"
}

Get task runtime parameters.

Send JSON:

{
  "request_id": "2",
  "work_id": "tts.1003",
  "action": "taskinfo"
}

Response JSON:

{
  "created": 1731652344,
  "data": {
    "enoutput": false,
    "inputs_": [
      "tts.utf-8"
    ],
    "model": "single_speaker_fast",
    "response_format": "sys.pcm"
  },
  "error": {
    "code": 0,
    "message": ""
  },
  "object": "tts.taskinfo",
  "request_id": "2",
  "work_id": "tts.1003"
}

**Note: work_id increases according to the order of unit initialization registration and is not a fixed index value. **
The same type of unit cannot have multiple units working simultaneously, as it may cause unknown errors. For example, tts and melo tts cannot be enabled to work at the same time.