Skip to content
View jerryyxu's full-sized avatar
🐵
🐵
  • Tech
  • China

Block or report jerryyxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A machine learning-based lossless video super resolution framework. Est. Hack the Valley II, 2018.

C++ 10,627 997 Updated Oct 31, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,598 447 Updated Jul 30, 2024

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

Python 33,810 10,113 Updated Oct 8, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,038 1,084 Updated Oct 14, 2024

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,867 1,196 Updated Mar 31, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 31,208 3,629 Updated Oct 30, 2024

Brand new TTS solution

Python 13,693 1,026 Updated Oct 30, 2024

Image processing in Python

Python 6,073 2,226 Updated Oct 23, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,232 5,212 Updated Oct 22, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,651 3,291 Updated Jul 23, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 44,701 5,326 Updated Oct 31, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,012 2,200 Updated Aug 12, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,087 141 Updated Sep 3, 2024

Real-time image and video processing library similar to GPUImage, with built-in beauty filters, achieving commercial-grade beauty effects. Written in C++11 and based on OpenGL/ES.

C++ 1,364 178 Updated Oct 10, 2024

The Places365-CNNs for Scene Classification

Python 1,920 536 Updated Jul 16, 2020

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,938 170 Updated Oct 29, 2024

A Python package to stabilize videos using OpenCV

Python 695 120 Updated Jul 19, 2023

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 14,352 1,382 Updated Oct 31, 2024

Perceptual video quality assessment based on multi-method fusion.

Python 4,574 748 Updated Oct 14, 2024

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 548 60 Updated Oct 4, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 831 56 Updated Oct 29, 2024

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 49,407 1,719 Updated Oct 31, 2024

Python Imaging Library (Fork)

Python 12,233 2,226 Updated Oct 29, 2024

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,303 3,961 Updated Sep 3, 2024

Mirror of https://git.ffmpeg.org/ffmpeg.git

C 45,741 12,136 Updated Oct 31, 2024

The open collection of GL Transitions

GLSL 1,867 301 Updated Jul 4, 2024

A generative speech model for daily dialogue.

Python 31,954 3,480 Updated Oct 21, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 18,325 1,407 Updated Oct 31, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 16,733 2,668 Updated Jul 26, 2024
Next