-
Technical University of Munich
- Germany
Highlights
- Pro
Stars
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
Toolkit for linearizing PDFs for LLM datasets/training
Integrate the DeepSeek API into popular softwares
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Enjoy the magic of Diffusion models!
An Open-source Streaming High-fidelity Neural Audio Codec
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
High-quality PNGs for logos I made for fun
Minimal Implementation of a D3PM in pytorch
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
[ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP
🛠️ ❤️ Want to know NixOS & Flakes in detail? Looking for a beginner-friendly tutorial? Then you've come to the right place! 想要学习使用 NixOS 与 Flakes 吗?在寻找一份新手友好的教程?那你可来对地方了!
The speaker-wise f0 search ranges of the LibriTTS-R corpus.
A curated list of awesome voice conversion, projects and communities.
Foundational model for human-like, expressive TTS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
You like pytorch? You like micrograd? You love tinygrad! ❤️
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701