Stars
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
An official codebase of Two-Stream Transformer for Multi-Label Image Classification, ACMMM 2022.
Multi-label Classification Using a Variation of VGGNet
This repo is a collection of AWESOME things about 🌟Medical Image Registration🌟, including useful materials, papers, code.
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding. PixelLM is accepted by CVPR 2024.
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
Code release for GECCO: Geometrically-Conditioned Point Diffusion Models
[NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding
Code to access and generate ProciGen dataset, CVPR'24.
Official implementation for Hierarachical Diffusion Model in CVPR24 Template free reconstruction of human object interaction
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
[ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Character Animation (AnimateAnyone, Face Reenactment)
[ECCV 2024] Tokenize Anything via Prompting
Segment Anything in 3D with NeRFs (NeurIPS 2023)
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
The official gpt4free repository | various collection of powerful language models
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
official code of “OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding”
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"