WenOOI

WenOOI

1 follower · 0 following

Tsinghua University

Stars

bzluan / TextCoT

The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.

Python 33 3 Updated Sep 24, 2024

neeek2303 / EMOPortraits

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 303 17 Updated Oct 6, 2024

johndpope / MegaPortrait-hack

Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars

Python 78 8 Updated Nov 4, 2024

Ekko-zn / AIGCDetectBenchmark

Python 229 23 Updated Mar 10, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 68,342 10,160 Updated Jun 18, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,173 5,394 Updated Nov 13, 2024

OpenRL-Lab / DeepFakeFace

DeepFake Face Datasets. Code accompanying the paper "Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models".

Python 40 2 Updated Sep 8, 2023

jonasricker / diffusion-model-deepfake-detection

[VISAPP2024] Towards the Detection of Diffusion Model Deepfakes

Python 85 10 Updated Apr 19, 2024

FurkanGozukara / Stable-Diffusion

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,124 297 Updated Nov 13, 2024

junyanz / pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Python 23,078 6,313 Updated May 14, 2024

facebookresearch / video-nonlocal-net

Non-local Neural Networks for Video Classification

Python 1,975 323 Updated Sep 15, 2021

google-deepmind / kinetics-i3d

Convolutional neural network model for video classification trained on the Kinetics dataset.

Python 1,741 461 Updated Sep 12, 2019

HHTseng / video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101

Jupyter Notebook 939 215 Updated Dec 7, 2020

Tushar-N / pytorch-resnet3d

I3D Nonlocal ResNets in Pytorch

Python 246 39 Updated Mar 26, 2022

piergiaj / pytorch-i3d

Python 980 250 Updated Jun 28, 2020

kenshohara / video-classification-3d-cnn-pytorch

Video classification tools using 3D ResNet

Python 1,102 260 Updated Nov 23, 2018

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,880 3,310 Updated Jul 23, 2024

facebookresearch / TimeSformer

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python 1,554 212 Updated Apr 9, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,221 2,174 Updated Aug 9, 2024

SarthakYadav / fsd50k-pytorch

Unofficial implementation of FSD50k baselines for Sound Event Recognition

Python 24 6 Updated Apr 27, 2024

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,622 1,214 Updated Aug 13, 2024

locuslab / TCN

Sequence modeling benchmarks and temporal convolutional networks

Python 4,169 877 Updated Mar 28, 2022

openai / point-e

Point cloud diffusion for 3D model synthesis

Python 6,533 760 Updated Jul 4, 2024

gsgen3d / gsgen

[CVPR 2024] Text-to-3D using Gaussian Splatting

Python 787 48 Updated Jan 7, 2024

biubug6 / Pytorch_Retinaface

Retinaface get 80.99% in widerface hard val using mobilenet0.25.

Python 2,628 772 Updated Jun 28, 2023

SSARCandy / DeepCORAL

🧠 A PyTorch implementation of 'Deep CORAL: Correlation Alignment for Deep Domain Adaptation.', ECCV 2016

Python 226 42 Updated Apr 22, 2021

linyongver / ZIN_official

This is the implementation for the NeurIPS 2022 paper: ZIN: When and How to Learn Invariance Without Environment Partition?

Python 22 5 Updated Dec 3, 2022

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,559 1,558 Updated Oct 19, 2024

rshaojimmy / MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Python 356 27 Updated Apr 23, 2024

Wangt-CN / CaaM

[ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition

Python 76 9 Updated Dec 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly