Skip to content

ghlee3401/PaperList

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

87 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

My Paper

Jamming Prediction for Radar Signals Using Machine Learning Methods https://www.hindawi.com/journals/scn/2020/2151570/

PaperList

TTS

Date Link Name Blog
2020.01.11 Arxiv FastPitch: Parallel Text-to-speech with Pitch Prediction
2020.01.08 Arxiv FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
2019.05.22 Arxiv FastSpeech: Fast, Robust and Controllable Text to Speech
2020.03.05 Arxiv AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment

Vocoder

Date Link Name Blog
2019.12.05 Arxiv Towards Robust Neural Vocoding for Speech Generation: A Survey
2019.12.03 Arxiv WaveFlow: A Compact Flow-based Model for Raw Audio
2019.10.31 Arxiv WaveGlow: A Flow-based Generative Network for Speech Synthesis
2019.10.25 Arxiv Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
2019.10.08 Arxiv MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
2019.04.09 Arxiv Probability density distillation with generative adversarial networks for high-quality parallel waveform generation
2018.11.09 Arxiv ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems

GAN

Date Link Name Blog
2019.12.03 Arxiv Analyzing and Improving the Image Quality of StyleGAN
2019.08.06 Arxiv Adversarially Trained End-to-end Korean Singing Voice Synthesis System
2019.04.09 Arxiv A New GAN-based End-to-End TTS Training Algorithm

Bi-Lingual, Multi-Lingual, Cross-Lingual

Date Link Name Blog
2020.06.26 Arxiv Multilingual Jointly Trained Acoustic and Written Word Embeddings
2020.06.26 Arxiv Unsupervised Cross-lingual Representation Learning for Speech Recognition
2020.01.29 Arxiv Learning Robust and Multilingual Speech Representations
2019.11.26 Arxiv Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers Link
2019.07.09 Arxiv Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
2018.12.04 Arxiv Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain

Reepresentations

Date Link Name Blog
2020.06.25 Arxiv wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
2019.12.03 Arxiv Singing Voice Conversion with Disentangled Representations of Singer and Vocal Technique Using Variational Autoencoders
2019.11.28 Arxiv Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
2019.11.27 Arxiv Powerful Speaker Embedding Training Framework by Adversarially Disentangled Identity Representation
2019.04.04 Arxiv Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis

Enhancement (+ Super Resolution)

Date Link Name Blog
2020.06.25 Arxiv Real Time Speech Enhancement in the Waveform Domain
2019.12.03 Arxiv High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram

Normalizing Flow

Date Link Name Blog
2019.01.30 Arxiv Emerging Convolutions for Generative Normalizing Flows

Optimization

Date Link Name Blog
2019.11.01 Arxiv Does Adam optimizer keep close to the optimal point?

About

The list of papers in speech synthesis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published