-
-
- Russia, Saint-Petersburg
- edmy.ru
Starred repositories
List of telegram groups, channels & bots // Список интересных групп, каналов и ботов телеграма // Список чатов для программистов
A collection of resources and papers on Diffusion Models
Pretrained language model with 100B parameters
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a ve…
Official baseline solutions to Yandex Cup ML challenge
Lingtrain Alignment Studio is an ML based app for texts alignment on different languages. It can produce parallel corpora and parallel books.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Authors' implementation of DeepSpeech Distances.
A Pytorch implementation of StarGAN-VC2
spring-media / ForwardTacotron
Forked from fatchord/WaveRNN⏩ Generating speech in a single forward pass without any attention!
Курс "Глубокое обучение (Deep Learning)" (ВМК, МГУ имени М.В. Ломоносова)
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
Awesome list of TTS papers with audio samples
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
This library provides common speech features for ASR including MFCCs and filterbank energies.
(re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition
🤖 Interactive Machine Learning experiments: 🏋️models training + 🎨models demo
📚 Специализация «Машинное обучение и анализ данных»
[Russian] This script will split audio file on silence, transcript it with google recognition and save it in LJSpeech-1.1 dataset manner.
Open Machine Learning course