-
Institute of Automation, Chinese Academy of Sciences
- Beijing
- https://scholar.google.com.hk/citations?user=4pHKj8kAAAAJ&hl=zh-CN
Block or Report
Block or report TXH-mercury
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A progressive, highly extensible and developer-friendly framework for building deep learning projects based on PyTorch.
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Official repository of the paper "Are current long-term video understanding datasets long-term?", published in CVEU 2023.
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI'2020).
Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"
ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.
Tracking and collecting papers/projects/others related to Segment Anything.
✨✨Latest Advances on Multimodal Large Language Models
[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
Moved to https://github.com/nodejs/node
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
[ECCV 2020] PyTorch Implementation of some RGBD Semantic Segmentation models.
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
RainHxj / DANet
Forked from junfu1115/DANetDual Attention Network for Scene Segmentation (CVPR2019)