Lists (2)
Sort Name ascending (A-Z)
Stars
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
Y-HuiMing-Y / CLIP
Forked from BYMUST/CTP[ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
The official GitHub page for the survey paper "A Survey of Large Language Models".
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab