HenryZhou7

Henry Zhou HenryZhou7

Computer Engineering Student at University of Toronto <henryzhou> @cs.toronto.edu

31 followers · 79 following

San Francisco, California
https://henryzhou7.github.io

Achievements

Stars

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,902 754 Updated Sep 11, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,122 101 Updated Jul 11, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,600 2,092 Updated Jul 18, 2024

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 82,435 6,426 Updated Sep 14, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,184 411 Updated Aug 19, 2024

rowanz / hellaswag

HellaSwag: Can a Machine _Really_ Finish Your Sentence?

Python 177 22 Updated May 28, 2020

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,374 255 Updated Jan 27, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,313 274 Updated Sep 5, 2024

ewan-xu / pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link a…

Python 316 97 Updated Nov 29, 2021

Wramberg / adaptfilt

Adaptive filtering module for Python

Python 101 35 Updated Jul 14, 2024

mk-fg / python-pulse-control

Python high-level interface and ctypes-based bindings for PulseAudio (libpulse)

Python 170 36 Updated Aug 27, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

28,259 1,540 Updated Aug 1, 2024

mattingalls / Soundflower

Forked from RogueAmoeba/Soundflower-Original

MacOS system extension that allows applications to pass audio to other applications. Soundflower works on macOS Catalina.

Objective-C 8,858 610 Updated Feb 1, 2021

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,766 217 Updated Sep 1, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,370 950 Updated Aug 21, 2024

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,440 196 Updated Aug 1, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 32,512 3,746 Updated Sep 13, 2024

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C 34,406 3,498 Updated Sep 15, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 10,906 1,827 Updated Sep 5, 2024

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 845 45 Updated Sep 13, 2024

mistralai / mistral-finetune

Python 2,652 210 Updated Sep 13, 2024

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,218 81 Updated Jul 22, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,330 177 Updated Jul 16, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,871 836 Updated Sep 13, 2024

kenjihiranabe / The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 17,630 2,149 Updated Feb 4, 2024

PlainTextTools / plain-text-table

JavaScript 116 24 Updated Apr 27, 2023

ml-explore / mlx-examples

Examples in the MLX framework

Python 5,833 828 Updated Sep 14, 2024

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 43,597 5,194 Updated Aug 21, 2024

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,547 1,959 Updated Apr 24, 2024

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 53,858 5,564 Updated Aug 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Henry Zhou HenryZhou7

Achievements

Achievements

Block or report HenryZhou7

Stars

pyannote / pyannote-audio

descriptinc / descript-audio-codec

facebookresearch / audiocraft

yt-dlp / yt-dlp

huggingface / parler-tts

rowanz / hellaswag

lucidrains / audiolm-pytorch

MahmoudAshraf97 / whisper-diarization

ewan-xu / pyaec

Wramberg / adaptfilt

mk-fg / python-pulse-control

karpathy / LLM101n

mattingalls / Soundflower

ufal / whisper_streaming

SYSTRAN / faster-whisper

Camb-ai / MARS5-TTS

RVC-Boss / GPT-SoVITS

ggerganov / whisper.cpp

PaddlePaddle / PaddleSpeech

fixie-ai / ultravox

mistralai / mistral-finetune

0nutation / SpeechGPT

mit-han-lab / llm-awq

OpenBMB / MiniCPM-V

kenjihiranabe / The-Art-of-Linear-Algebra

PlainTextTools / plain-text-table

ml-explore / mlx-examples

geekan / MetaGPT

microsoft / JARVIS

labmlai / annotated_deep_learning_paper_implementations