Skip to content
View WenOOI's full-sized avatar
  • Tsinghua University

Block or report WenOOI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.

Python 33 3 Updated Sep 24, 2024

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 303 17 Updated Oct 6, 2024

Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars

Python 78 8 Updated Nov 4, 2024

A latent text-to-image diffusion model

Jupyter Notebook 68,342 10,160 Updated Jun 18, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,173 5,394 Updated Nov 13, 2024

DeepFake Face Datasets. Code accompanying the paper "Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models".

Python 40 2 Updated Sep 8, 2023

[VISAPP2024] Towards the Detection of Diffusion Model Deepfakes

Python 85 10 Updated Apr 19, 2024

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,124 297 Updated Nov 13, 2024

Image-to-Image Translation in PyTorch

Python 23,078 6,313 Updated May 14, 2024

Non-local Neural Networks for Video Classification

Python 1,975 323 Updated Sep 15, 2021

Convolutional neural network model for video classification trained on the Kinetics dataset.

Python 1,741 461 Updated Sep 12, 2019

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Jupyter Notebook 939 215 Updated Dec 7, 2020

I3D Nonlocal ResNets in Pytorch

Python 246 39 Updated Mar 26, 2022
Python 980 250 Updated Jun 28, 2020

Video classification tools using 3D ResNet

Python 1,102 260 Updated Nov 23, 2018

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,880 3,310 Updated Jul 23, 2024

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python 1,554 212 Updated Apr 9, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,221 2,174 Updated Aug 9, 2024

Unofficial implementation of FSD50k baselines for Sound Event Recognition

Python 24 6 Updated Apr 27, 2024

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,622 1,214 Updated Aug 13, 2024

Sequence modeling benchmarks and temporal convolutional networks

Python 4,169 877 Updated Mar 28, 2022

Point cloud diffusion for 3D model synthesis

Python 6,533 760 Updated Jul 4, 2024

[CVPR 2024] Text-to-3D using Gaussian Splatting

Python 787 48 Updated Jan 7, 2024

Retinaface get 80.99% in widerface hard val using mobilenet0.25.

Python 2,628 772 Updated Jun 28, 2023

🧠 A PyTorch implementation of 'Deep CORAL: Correlation Alignment for Deep Domain Adaptation.', ECCV 2016

Python 226 42 Updated Apr 22, 2021

This is the implementation for the NeurIPS 2022 paper: ZIN: When and How to Learn Invariance Without Environment Partition?

Python 22 5 Updated Dec 3, 2022

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,559 1,558 Updated Oct 19, 2024

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Python 356 27 Updated Apr 23, 2024

[ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition

Python 76 9 Updated Dec 1, 2023
Next