hbwu-ntu

hbwu-ntu

🏠 Ph.D. student at NTU working on speech processing and machine learning. 💻 Contributor of S3PRL.

137 followers · 135 following

National Taiwan University
Seattle, WA, US
https://hbwu-ntu.github.io/
in/haibin-wu-479a39252
https://scholar.google.com/citations?user=-bB-WHEAAAAJ&hl=zh-TW

Achievements

Highlights

Stars

374 stars written in Python

Clear filter

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 167,305 44,201 Updated Oct 8, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 69,054 8,125 Updated Sep 30, 2024

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 64,598 7,980 Updated Oct 7, 2024

meta-llama / llama

Inference code for Llama models

Python 55,899 9,517 Updated Aug 18, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,603 4,519 Updated Oct 6, 2024

XingangPan / DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,656 3,448 Updated May 18, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,009 4,060 Updated Oct 8, 2024

chenfei-wu / TaskMatrix

Python 34,519 3,315 Updated Jan 6, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,431 4,165 Updated Aug 16, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,730 3,866 Updated Oct 2, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 32,016 3,925 Updated Oct 8, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,285 3,388 Updated Sep 21, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,281 6,387 Updated Oct 3, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 29,047 2,837 Updated Aug 21, 2024

acheong08 / ChatGPT

Reverse engineered ChatGPT API

Python 28,009 4,477 Updated Aug 2, 2023

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,529 2,999 Updated Aug 12, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,335 2,912 Updated Sep 2, 2024

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,584 1,964 Updated Sep 26, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,795 2,115 Updated Aug 9, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,713 2,111 Updated Jul 18, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,962 2,994 Updated Oct 4, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,636 2,161 Updated Aug 12, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,033 1,573 Updated Oct 8, 2024

THUDM / ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,375 1,553 Updated Jul 10, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 13,022 970 Updated Oct 8, 2024

state-spaces / mamba

Mamba SSM architecture

Python 12,777 1,077 Updated Oct 7, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,489 849 Updated Sep 23, 2024

OpenMOSS / MOSS

An open-source tool-augmented conversational language model from Fudan University

Python 11,926 1,145 Updated Jul 13, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,809 1,241 Updated Aug 21, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,755 977 Updated Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hbwu-ntu

Achievements

Achievements

Highlights

Block or report hbwu-ntu

Stars

Significant-Gravitas / AutoGPT

openai / whisper

binary-husky / gpt_academic

meta-llama / llama

lm-sys / FastChat

XingangPan / DragGAN

microsoft / DeepSpeed

chenfei-wu / TaskMatrix

coqui-ai / TTS

RVC-Boss / GPT-SoVITS

hiyouga / LLaMA-Factory

2noise / ChatTTS

facebookresearch / fairseq

myshell-ai / OpenVoice

acheong08 / ChatGPT

meta-llama / llama3

Vision-CAIR / MiniGPT-4

microsoft / JARVIS

hpcaitech / Open-Sora

facebookresearch / audiocraft

lucidrains / vit-pytorch

haotian-liu / LLaVA

huggingface / peft

THUDM / ChatGLM3

fishaudio / fish-speech

state-spaces / mamba

BlinkDL / RWKV-LM

OpenMOSS / MOSS

m-bain / whisperX

SYSTRAN / faster-whisper