Stars
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Factify is a Multi-Modal Fact Verification dataset released for a shared task as part of the De-Factify workshop in AAAI-21.
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
南开大学硕士毕业论文/博士论文模板 (Latex Template for Nankai University)
A curated list of awesome papers on dataset distillation and related applications.
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
Open Source Neural Architecture Search Toolbox for Device-aware Image Dense Prediction & Official implementation of ICCV2021 "iNAS: Integral NAS for Device-Aware Salient Object Detection"
Neural Machine Translation with universal Visual Representation (ICLR 2020)
A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"
A PyTorch implementation of SimCLR based on ICML 2020 paper "A Simple Framework for Contrastive Learning of Visual Representations"
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
gwy-nk / visual7w-toolkit
Forked from yukezhu/visual7w-toolkitToolkit for Visual7W visual question answering dataset
Convolutional Neural Network for Text Classification in Tensorflow
gwy-nk / tensorflow
Forked from tensorflow/tensorflowComputation using data flow graphs for scalable machine learning
A Tensorflow implementation of CapsNet(Capsules Net) in Hinton's paper Dynamic Routing Between Capsules
Pytorch 3.6 implementation VQA2017 cvpr winner
gwy-nk / iQAN
Forked from yikang-li/iQANVisaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)
gwy-nk / ARN
Forked from GingL/ARNAdaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
gwy-nk / mcan-vqa
Forked from MILVLG/mcan-vqaDeep Modular Co-Attention Networks for Visual Question Answering