Skip to content
@DigitalPhonetics

Speech and Language Technology (SaLT) at the University of Stuttgart

Research institute in the field of speech, natural language processing and machine learning

Pinned

  1. IMS-Toucan IMS-Toucan Public

    Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

    Python 449 80

  2. VoicePAT VoicePAT Public

    VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

    Shell 41 4

  3. bloomzmms bloomzmms Public

    Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"

    Python

  4. conversational-tree-search conversational-tree-search Public

    Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.

    Python 5

Repositories

Showing 10 of 17 repositories
  • IMS-Toucan Public

    Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

    Python 449 Apache-2.0 80 30 0 Updated Apr 23, 2024
  • bloomzmms Public

    Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"

    Python 0 0 0 0 Updated Apr 16, 2024
  • VoicePAT Public

    VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

    Shell 41 Apache-2.0 4 2 0 Updated Apr 10, 2024
  • conversational-tree-search Public

    Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.

    Python 5 0 0 0 Updated Mar 22, 2024
  • speaker-anonymization Public

    Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

    Python 41 GPL-3.0 4 2 0 Updated Mar 14, 2024
  • Python 4 0 0 0 Updated Feb 23, 2024
  • diagraph Public

    DIAGRAPH: An open-source graphic interface for dialog flow design

    Python 2 GPL-3.0 0 0 0 Updated Oct 23, 2023
  • multilingual-seq2seq-slu Public

    Materials for the publication "Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding"

    Python 2 0 0 0 Updated Oct 11, 2023
  • adviser Public

    ADvISER is a flexible framework to encourage task-oriented dialog system research & development

    Python 55 GPL-3.0 32 3 8 Updated Aug 14, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…