Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
-
Updated
Aug 3, 2024
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"
text-to-audio-latent-diffusion
Code to train a custom time-domain autoencoder to dereverb audio
A deep learning-based Speech Emotion Recognition (SER) model trained primarily on Indian languages. Designed for applications in call centers, sentiment analysis, and accessibility tools.
Guide to deploying neural networks in VST plugins, with a specific focus on embedded devices using the Elk Audio OS
🗣️ Audio AI: Your Audio & Video Transcription Powerhouse!
Whether it’s text or a link, it can be turned into a podcast!
Add a description, image, and links to the audio-ai topic page so that developers can more easily learn about it.
To associate your repository with the audio-ai topic, visit your repo's landing page and select "manage topics."