hbwu-ntu

Follow

hbwu-ntu

Follow

🏠 Ph.D. student at NTU working on speech processing and machine learning. 💻 Contributor of S3PRL.

122 followers · 119 following

National Taiwan University
Seattle, WA, US
https://hbwu-ntu.github.io/
in/haibin-wu-479a39252
https://scholar.google.com/citations?user=-bB-WHEAAAAJ&hl=zh-TW

Achievements

Achievements

Highlights

Pro

Block or Report

Block or report hbwu-ntu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 197 7 Updated Jul 28, 2024

ChenLiu-1996 / CitationMap

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 45 6 Updated Jul 30, 2024

lucidrains / autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Python 190 1 Updated Jul 30, 2024

roger-tseng / CodecDetect

Fake speech detection with the CodecFake dataset

Python 3 Updated Jul 27, 2024

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 62,729 7,796 Updated Jul 24, 2024

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 996 88 Updated Feb 13, 2024

OFA-Sys / AIR-Bench

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Python 11 Updated Jul 17, 2024

hu-po / docs

documentation for content creation

99 10 Updated Jul 26, 2024

leo19941227 / speech_metrics

A lightweight package for some common metrics used in speech

Python 3 Updated Jul 27, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 2,821 378 Updated Jul 30, 2024

AudioLLMs / AudioBench

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 47 Updated Jul 29, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,462 281 Updated Jul 29, 2024

microsoft / LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 43 Updated Jul 25, 2024

lucidrains / rectified-flow-pytorch

Implementation of rectified flow and some of its followup research / improvements in Pytorch

Python 111 2 Updated Jul 24, 2024

speechcraft2024 / speechcraft2024

Ruby 5 1 Updated Jun 18, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 29,944 3,448 Updated Jul 29, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

491 9 Updated Jul 22, 2024

y-ren16 / TiCodec

Python 37 2 Updated Dec 19, 2023

vasistalodagala / whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python 201 47 Updated May 23, 2023

mini-sora / minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,135 146 Updated Jun 1, 2024

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 868 42 Updated Jul 19, 2024

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,286 184 Updated Jul 20, 2024

RoyChao19477 / SEMamba

This is the official implementation of the SEMamba paper.

Python 98 8 Updated Jul 20, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 179 14 Updated Jul 27, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 1,730 164 Updated Jul 29, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 2,903 277 Updated Jul 29, 2024

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 6,022 536 Updated Jul 29, 2024

NUS-HPC-AI-Lab / OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Python 1,373 89 Updated Jul 26, 2024

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 134 8 Updated Apr 20, 2024

speechbrain / benchmarks

This repository contains the SpeechBrain Benchmarks

Python 77 33 Updated Jul 25, 2024