Stars
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
A lightweight library for portable low-level GPU computation using WebGPU.
uBlock Origin filter list to hide YouTube Shorts
Official inference repo for FLUX.1 models
High-resolution models for human tasks.
Build and share delightful machine learning apps, all in Python. ๐ Star to support our work!
[Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers
ComfyUI nodes to use segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use thโฆ
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Noise supression using deep filtering
FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs ๐ ๐ ๐
[ECCV2024] An official pytorch implement of the paper "MambaIR: A simple baseline for image restoration with state-space model".
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Automate the tedious development tasks with AI
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
World's Best AI Aimbot - CS2, Valorant, Fortnite, APEX, every game
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.
Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Whisper realtime streaming for long speech-to-text transcription and translation