Skip to content
View long-wa's full-sized avatar

Block or report long-wa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)

Jupyter Notebook 616 74 Updated Dec 8, 2022

Official code of CVPR 2023 Highlight paper CVT-SLR

Python 68 4 Updated Dec 23, 2023

This is the official code implementation for 'What, How, and When Should Object Detectors Update in Continually Changing Test Domains?' presented at CVPR 2024.

Python 11 Updated Jul 19, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,877 188 Updated Sep 19, 2024

[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

Cuda 1,194 138 Updated Jul 31, 2024

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Python 2,045 296 Updated Oct 4, 2023

Official PyTorch implementation of ICCV 2019 paper "DPOD: 6D Pose Object Detector and Refiner"

Python 56 14 Updated Jan 1, 2021

Pytorch implementation of Generated Image Quality Assessment

Python 215 32 Updated Nov 20, 2021

Code for ALBEF: a new vision-language pre-training method

Python 1,532 195 Updated Sep 20, 2022

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,747 779 Updated Oct 2, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,711 625 Updated Aug 5, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,742 954 Updated Aug 23, 2024

PyTorch extensions for high performance and large scale training.

Python 3,168 279 Updated Aug 30, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,151 801 Updated Aug 20, 2024

PyTorch DDPM implementation

Python 641 106 Updated May 23, 2022
Python 230 33 Updated Aug 22, 2024

Simple Implementation of Pix2Seq model for object detection in PyTorch

Python 117 16 Updated Sep 2, 2023

Image-to-image translation with conditional adversarial nets

Lua 10,118 1,706 Updated Jun 6, 2021

the pytorch version of pix2pix

Python 30 6 Updated Nov 5, 2019

A technical report on convolution arithmetic in the context of deep learning

TeX 13,995 2,283 Updated Jun 8, 2023
Python 32 6 Updated Apr 10, 2024

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Python 3,526 591 Updated Sep 19, 2023

This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".

Python 31 3 Updated Jun 24, 2024

A PyTorch-based library for semi-supervised learning (NeurIPS'21)

Python 1,291 186 Updated Aug 28, 2023

[CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution

Python 199 13 Updated Jul 24, 2022

[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"

Python 485 39 Updated May 22, 2023
Python 124 5 Updated Jun 25, 2024

CVPR 2024 论文和开源项目合集

17,901 2,574 Updated Jul 4, 2024

Explainability for Vision Transformers

Python 833 96 Updated Mar 12, 2022

[ECCV'22] The official PyTorch implementation of our ECCV 2022 paper: "AiATrack: Attention in Attention for Transformer Visual Tracking".

Python 105 10 Updated Dec 30, 2023
Next