jerryyxu

🐵

皆虚 jerryyxu

🐵

Everything is nothing

10 followers · 7 following

Tech
China

Achievements

Stars

k4yt3x / video2x

A machine learning-based lossless video super resolution framework. Est. Hack the Valley II, 2018.

C++ 10,627 997 Updated Oct 31, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,598 447 Updated Jul 30, 2024

hankcs / HanLP

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Python 33,810 10,113 Updated Oct 8, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,038 1,084 Updated Oct 14, 2024

tyiannak / pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,867 1,196 Updated Mar 31, 2024

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 31,208 3,629 Updated Oct 30, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 13,693 1,026 Updated Oct 30, 2024

scikit-image / scikit-image

Image processing in Python

Python 6,073 2,226 Updated Oct 23, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,232 5,212 Updated Oct 22, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,651 3,291 Updated Jul 23, 2024

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 44,701 5,326 Updated Oct 31, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,012 2,200 Updated Aug 12, 2024

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,087 141 Updated Sep 3, 2024

pixpark / gpupixel

Real-time image and video processing library similar to GPUImage, with built-in beauty filters, achieving commercial-grade beauty effects. Written in C++11 and based on OpenGL/ES.

C++ 1,364 178 Updated Oct 10, 2024

CSAILVision / places365

The Places365-CNNs for Scene Classification

Python 1,920 536 Updated Jul 16, 2020

chaofengc / IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,938 170 Updated Oct 29, 2024

AdamSpannbauer / python_video_stab

A Python package to stabilize videos using OpenCV

Python 695 120 Updated Jul 19, 2023

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 14,352 1,382 Updated Oct 31, 2024

Netflix / vmaf

Perceptual video quality assessment based on multi-method fusion.

Python 4,574 748 Updated Oct 14, 2024

Vision-CAIR / MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 548 60 Updated Oct 4, 2024

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 831 56 Updated Oct 29, 2024

andrewyng / translation-agent

Python 4,735 542 Updated Aug 4, 2024

Textualize / rich

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 49,407 1,719 Updated Oct 31, 2024

python-pillow / Pillow

Python Imaging Library (Fork)

Python 12,233 2,226 Updated Oct 29, 2024

mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,303 3,961 Updated Sep 3, 2024

FFmpeg / FFmpeg

Mirror of https://git.ffmpeg.org/ffmpeg.git

C 45,741 12,136 Updated Oct 31, 2024

gl-transitions / gl-transitions

The open collection of GL Transitions

GLSL 1,867 301 Updated Jul 4, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,954 3,480 Updated Oct 21, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—foundation models

Python 18,325 1,407 Updated Oct 31, 2024

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 16,733 2,668 Updated Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

皆虚 jerryyxu

Achievements

Achievements

Block or report jerryyxu

Stars

k4yt3x / video2x

AILab-CVC / YOLO-World

hankcs / HanLP

facebookresearch / sam2

tyiannak / pyAudioAnalysis

facebookresearch / faiss

fishaudio / fish-speech

scikit-image / scikit-image

babysor / MockingBird

openai / CLIP

geekan / MetaGPT

haotian-liu / LLaVA

THUDM / CogVLM2

pixpark / gpupixel

CSAILVision / places365

chaofengc / IQA-PyTorch

AdamSpannbauer / python_video_stab

ItzCrazyKns / Perplexica

Netflix / vmaf

Vision-CAIR / MiniGPT4-video

DAMO-NLP-SG / VideoLLaMA2

andrewyng / translation-agent

Textualize / rich

python-pillow / Pillow

mozilla / DeepSpeech

FFmpeg / FFmpeg

gl-transitions / gl-transitions

2noise / ChatTTS

stanfordnlp / dspy

harry0703 / MoneyPrinterTurbo