Lists (1)
Sort Name ascending (A-Z)
Stars
Corruption and Perturbation Robustness (ICLR 2019)
Image to prompt with BLIP and CLIP
Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desired target dataset.
Unofficial Python client library for Semantic Scholar APIs.
A curated list of Composable AI methods: Building AI system by composing modules.
LAVIS - A One-stop Library for Language-Vision Intelligence
Official implementation of "Segment Any Anomaly without Training via Hybrid Prompt Regularization (SAA+)".
Segment-anything related awesome extensions/projects/repos.
Language Models Can See: Plugging Visual Controls in Text Generation
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be…
Painter & SegGPT Series: Vision Foundation Models from BAAI
EVA Series: Visual Representation Fantasies from BAAI
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
[PR] Complementary Pseudo Multimodal Feature for Point Cloud Anomaly Detection
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Tactile Sensing and Simulation; Visual Tactile Manipulation; Open Source.
Optimized Stable Diffusion modified to run on lower GPU VRAM
High-Resolution Image Synthesis with Latent Diffusion Models
Hackable and optimized Transformers building blocks, supporting a composable construction.
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time