Lists (3)
Sort Name ascending (A-Z)
Starred repositories
A suite of image and video neural tokenizers
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
[NeurIPS'2024]: DiffGS: Functional Gaussian Splatting Diffusion
[ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
DepthSplat: Connecting Gaussian Splatting and Depth
Text4Seg: Reimagining Image Segmentation as Text Generation
Code release for ConvNeXt V2 model
Official Implemention for "CDNeXt: Remote Sensing Image Change Detection Based on Temporospatial Interactive Attention Module"
[CVPR 2024] Code release for TransNeXt model
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"
The codes for TCFormer in paper: Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
UnrealCV: Connecting Computer Vision to Unreal Engine
CVPR2023: Vector Quantization with Self-Attention for Quality-Independent Representation Learning.
[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption