Skip to content
View guxm2021's full-sized avatar
๐ŸŽ†
Focusing
๐ŸŽ†
Focusing
Block or Report

Block or report guxm2021

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
guxm2021/README.md

Hi there ๐Ÿ‘‹

I am Xiangming Gu. You can also call me Brian. I am currently a third-year Ph.D. candidate from NUS Sound and Music Computing Lab, where I am supervised by Prof. Ye Wang. I am affilated to Integrative Sciences and Engineering Programme and School of Computing at National University of Singapore. Before that, I obtained my B.E. degree of Electronic Engineering and B.S. degree of Finance at Tsinghua University.

My research interests include two directions: (i) fundamental research for generative models and (multimodal) large language models; (ii) application of machine learning, e.g. multimodal learning, multi-distribution learning (domain adaptation), and trustworthy machine learning (fairness, memorization), to singing/speech techniques.

Visit my personal website.

Xiangming's GitHub stats

Pinned

  1. sail-sg/Agent-Smith sail-sg/Agent-Smith Public

    [ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

    Python 60 7

  2. sail-sg/DiffMemorize sail-sg/DiffMemorize Public

    On Memorization in Diffusion Models

    Python 20 2

  3. ALT_SpeechBrain ALT_SpeechBrain Public

    [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription

    Python 38 6

  4. SVT_SpeechBrain SVT_SpeechBrain Public

    [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing

    Python 16 3

  5. MM_ALT MM_ALT Public

    [MM 2022 Oral] MM-ALT: A Multimodal Automatic Lyric Transcription System

    Python 13

  6. ECBNN ECBNN Public

    [NeurIPS 2022] Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation

    Python 8 1