-
Beijing University Of Posts and Telecommunications
- China Beijing
-
07:42
(UTC -12:00) - [email protected]
- https://www.bupt.edu.cn/
Block or Report
Block or report needsee
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
This is the official implemntation for "Multi-scale spatial temporal graph convolutional network for skeleton-based action recognition" AAAI-2021
official implementation for Language Supervised Training for Skeleton-based Action Recognition
Task Residual for Tuning Vision-Language Models (CVPR 2023)
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
GLoRIA: A Multimodal Global-Local Representation Learning Framework forLabel-efficient Medical Image Recognition
A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]
[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
hhaAndroid / GLIP
Forked from microsoft/GLIPGrounded Language-Image Pre-training
【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
ROLO is short for Recurrent YOLO, aimed at simultaneous object detection and tracking
LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking
A curated list of action recognition and related area resources
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
Jupyter notebook tutorials for mmpose
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…
OpenMMLab Pose Estimation Toolbox and Benchmark.