Speech and Language Technology (SaLT) at the University of Stuttgart
Pinned Loading
Repositories
- speaker-anonymization Public
Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
-
- conversational-tree-search Public
Code and Data for Conversational Tree Search: A new task that bridges the gap between FAQ-style information retrieval and task-oriented dialog.
- Intrinsic-Subgraph-Generation-for-VQA Public
Predicting a subgraph alongside the answer in a graph based VQA model
- hard-negative-captions Public
- bloomzmms Public
Materials for the publication "Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training"
- multilingual-seq2seq-slu Public
Materials for the publication "Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding"
- VoicePAT Public
VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Most used topics
Loading…