WangHelin1997

Follow

🎯

Focusing

Helin Wang WangHelin1997

🎯

Focusing

Follow

A PhD candidate at Johns Hopkins University, interested in AI for Audio & Speech Processing.

216 followers · 62 following

THU & PKU & JHU
Baltimore, US
https://wanghelin1997.github.io/helinwang
in/helin-wang-2a74671b3
https://scholar.google.com/citations?user=I_V0zBMAAAAJ
https://huggingface.co/OpenSound

Achievements

Achievements

Highlights

Pro

Pinned Loading

CapSpeech CapSpeech Public

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

Jupyter Notebook 347 42
SoloSpeech SoloSpeech Public

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Python 234 27
SSR-Speech SSR-Speech Public

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 135 15
SoloAudio SoloAudio Public

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 95 10
SpeechTasks SpeechTasks Public

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

77 7
MaskSpec MaskSpec Public

The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

Python 42 8