Lists (1)
Sort Name ascending (A-Z)
Stars
[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
[CVPR'23] Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
This repo includes ChatGPT prompt curation to use ChatGPT better.
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
The official gpt4free repository | various collection of powerful language models
Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
Official Code for DragGAN (SIGGRAPH 2023)
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
A community supported Windows build for jax.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Google, Naver multiprocess image web crawler (Selenium)
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
A curated list of prompt-based paper in computer vision and vision-language learning.