Skip to content
View zhenye234's full-sized avatar
๐Ÿ‰
๐Ÿ‰
  • Hong Kong University of Science and Technology
  • Hong Kong

Highlights

  • Pro

Block or report zhenye234

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
zhenye234/README.md
  • ๐Ÿ‘‹ Hi, Iโ€™m Ye Zhen, a PhD student at HKUST.
  • ๐Ÿ‘€ Iโ€™m interested in audio generation, speech synthesis and speech LLM.

Popular repositories Loading

  1. CoMoSpeech CoMoSpeech Public

    ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

    Python 195 20

  2. LLaSA_training LLaSA_training Public

    LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis

    Python 147 14

  3. xcodec xcodec Public

    AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

    Python 136 6

  4. FlashSpeech FlashSpeech Public

    ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis

    Python 120 7

  5. X-Codec-2.0 X-Codec-2.0 Public

    Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis

    Python 109 10

  6. LLaSA_inference LLaSA_inference Public

    9