Skip to content
View KeiKinn's full-sized avatar
🤯
Focusing
🤯
Focusing
  • Technical University of Munich
  • Germany

Highlights

  • Pro

Block or report KeiKinn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 19,450 1,612 Updated Mar 24, 2025

Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'

Python 111 4 Updated Mar 24, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,577 708 Updated Mar 27, 2025

Integrate the DeepSeek API into popular softwares

30,497 3,306 Updated Mar 27, 2025

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,323 179 Updated Feb 14, 2025

Automated iOS Backup Robot

Swift 2,480 175 Updated Mar 7, 2025

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 337 21 Updated Sep 3, 2024
Python 122 13 Updated Aug 19, 2024

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Python 216 10 Updated Jun 17, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 390 23 Updated Mar 28, 2025

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,097 80 Updated Feb 19, 2025

Enjoy the magic of Diffusion models!

Python 8,139 728 Updated Mar 26, 2025

An Open-source Streaming High-fidelity Neural Audio Codec

Python 461 22 Updated Mar 4, 2025

spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io

Python 49 7 Updated Jul 8, 2023

Mamba SSM architecture

Python 14,407 1,259 Updated Jan 18, 2025

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,760 205 Updated Mar 8, 2024

High-quality PNGs for logos I made for fun

CSS 5,790 296 Updated Jun 3, 2024

Minimal Implementation of a D3PM in pytorch

Jupyter Notebook 206 15 Updated Apr 22, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 21,203 6,143 Updated Mar 23, 2025

[ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP

Python 37 3 Updated Jul 11, 2022

🛠️ ❤️ Want to know NixOS & Flakes in detail? Looking for a beginner-friendly tutorial? Then you've come to the right place! 想要学习使用 NixOS 与 Flakes 吗?在寻找一份新手友好的教程?那你可来对地方了!

Nix 2,355 114 Updated Mar 18, 2025

The speaker-wise f0 search ranges of the LibriTTS-R corpus.

2 Updated Dec 26, 2023

A curated list of awesome voice conversion, projects and communities.

227 13 Updated Jan 13, 2025

Foundational model for human-like, expressive TTS

Python 4,076 682 Updated Jul 30, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 43,257 4,816 Updated Mar 26, 2025

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Python 382 32 Updated Jul 9, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,863 692 Updated Mar 3, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 28,450 3,264 Updated Mar 28, 2025

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 909 40 Updated Sep 27, 2024
Python 1,057 329 Updated Feb 27, 2025
Next
Showing results