Skip to content
View ygyuan's full-sized avatar
Block or Report

Block or report ygyuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ygyuan/README.md

Hi everyone, This is Yougen Yuan, Welcome to my personal homepage. I’m interested in

  • speech keyword retrival / spotting / search
  • speech recognition / speaker recognition / audio scene classication / speech language identification
  • audio / visual / text similarity
  • audio-visual multi-modal
  • image-text / video-text multi-modal
  • large language models

If you are interested in my works, please feel free to reach me by 📫 [email protected]

Popular repositories Loading

  1. Speech-keyword-verification Speech-keyword-verification Public

    Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings

    Python 1 1

  2. wenet wenet Public

    Forked from wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    C++ 1

  3. kaldi-baseline kaldi-baseline Public

    Forked from VKW2021/kaldi-baseline

    kaldi cnn-tdnnf baseline

    Shell 1

  4. ygyuan ygyuan Public

    Config files for my GitHub profile.

  5. k2 k2 Public

    Forked from k2-fsa/k2

    FSA/FST algorithms, differentiable, with PyTorch compatibility.

    Cuda

  6. FunASR FunASR Public

    Forked from modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit

    Python