Skip to content
View liusongxiang's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report liusongxiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
liusongxiang/README.md

Hi there 👋

My research interests encompass the extensive domain of speech and language intelligence, which includes speech foundation models, large language models (LLMs), text-to-speech synthesis (TTS), voice conversion (VC), singing synthesis, cross-modal representation learning, audio adversarial attacks & defense, among other related areas.

My homepage

Google scholar profile

Pinned

  1. StarGAN-Voice-Conversion StarGAN-Voice-Conversion Public

    This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

    Python 498 95

  2. ppg-vc ppg-vc Public

    PPG-Based Voice Conversion

    Python 313 72

  3. efficient_tts efficient_tts Public

    Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

    Python 114 22

  4. BNE-Seq2SeqMoL-VC BNE-Seq2SeqMoL-VC Public

    Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"

    6 3

  5. diffsvc diffsvc Public

    DiffSVC demo page

    77 66

  6. Large-Audio-Models Large-Audio-Models Public

    Keep track of big models in audio domain, including speech, singing, music etc.

    391 23