Skip to content
View KunpengLi1994's full-sized avatar

Block or report KunpengLi1994

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kandinsky 2 — multilingual text2image latent diffusion model

Jupyter Notebook 2,757 308 Updated May 1, 2024

[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio

Python 385 42 Updated May 29, 2023

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Python 1,314 82 Updated Aug 10, 2023

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Python 730 36 Updated Nov 16, 2023

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Jupyter Notebook 991 58 Updated Sep 21, 2023

Unified Controllable Visual Generation Model

Python 615 35 Updated Apr 22, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,344 199 Updated Oct 26, 2024

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Jupyter Notebook 1,107 106 Updated Aug 14, 2023

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,239 385 Updated Oct 25, 2023

WebUI extension for ControlNet

Python 16,992 1,957 Updated Aug 12, 2024

A Close Look at Spatial Modeling: From Attention to Convolution

Python 91 5 Updated Dec 27, 2022

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Jupyter Notebook 688 61 Updated Oct 17, 2023

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

Python 64 5 Updated Dec 20, 2021

PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"

Python 292 47 Updated Jan 14, 2020

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

Python 39 4 Updated Mar 24, 2023

📚 A collection of Deep Learning based Image Colorization and Video Colorization papers.

993 105 Updated Oct 12, 2024

A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch

Python 1,981 256 Updated Jul 17, 2023

[TPAMI 2023] Generative Multi-Label Zero-Shot Learning

Python 51 14 Updated Jul 12, 2023

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)

Python 156 16 Updated Feb 20, 2023

Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)

Python 29 3 Updated Aug 3, 2022

PyTorch code for the CVPR'2020 paper "Screencast Tutorial Video Understanding"

Jupyter Notebook 4 1 Updated Sep 18, 2020

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,784 241 Updated Jan 24, 2024

Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)

Python 73 12 Updated Dec 6, 2023

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

3,374 396 Updated May 24, 2023

This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.

Python 207 50 Updated Nov 16, 2022

The official implementation of CFBI(+): Collaborative Video Object Segmentation by (Multi-scale) Foreground-Background Integration.

Python 322 43 Updated Jan 18, 2023

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,593 3,284 Updated Jul 23, 2024

A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.

Python 832 176 Updated Aug 3, 2023

Global Reasoning module for visual recognition

Python 206 52 Updated Oct 12, 2021

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

Python 414 83 Updated Sep 10, 2021
Next