Skip to content
View needsee's full-sized avatar
Block or Report

Block or report needsee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 41 7 Updated Jul 2, 2023

This is the official implemntation for "Multi-scale spatial temporal graph convolutional network for skeleton-based action recognition" AAAI-2021

Python 27 7 Updated May 25, 2022

official implementation for Language Supervised Training for Skeleton-based Action Recognition

Python 89 10 Updated Sep 6, 2023

Task Residual for Tuning Vision-Language Models (CVPR 2023)

Python 62 6 Updated May 27, 2023
Python 159 14 Updated May 10, 2023

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,016 418 Updated Nov 29, 2023

Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)

Jupyter Notebook 37 7 Updated May 16, 2022

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,490 599 Updated May 20, 2024

GLoRIA: A Multimodal Global-Local Representation Learning Framework forLabel-efficient Medical Image Recognition

Python 157 27 Updated Feb 6, 2023
Python 506 41 Updated Nov 28, 2023
Python 430 19 Updated Jul 19, 2022

A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]

Python 284 15 Updated Jul 11, 2024

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Python 505 36 Updated Sep 15, 2023

The efficient tuning method for VLMs

Python 66 1 Updated Mar 10, 2024

Grounded Language-Image Pre-training

Python 1 Updated May 8, 2023

Grounded Language-Image Pre-training

Python 2,084 186 Updated Jan 24, 2024

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Python 201 15 Updated May 30, 2024

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Python 482 58 Updated Dec 6, 2023

ROLO is short for Recurrent YOLO, aimed at simultaneous object detection and tracking

Python 879 278 Updated Nov 1, 2016

CVPR 2024 论文和开源项目合集

17,273 2,547 Updated Jul 4, 2024

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

Python 720 142 Updated May 7, 2020

A curated list of action recognition and related area resources

3,744 722 Updated May 13, 2023

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

Python 1,274 168 Updated Nov 3, 2023

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Cuda 4,294 908 Updated Dec 14, 2022
Python 29 2 Updated Jun 28, 2023

Jupyter notebook tutorials for mmpose

Jupyter Notebook 303 53 Updated Jun 7, 2023

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,133 104 Updated Jul 19, 2024

A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…

Python 1,885 343 Updated Jul 21, 2023
Python 2 1 Updated Jul 21, 2023

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 5,375 1,180 Updated Jul 17, 2024
Next