Skip to content
View jacobswan1's full-sized avatar
👋
👋
  • Amazon Alexa AI.
  • San Jose
Block or Report

Block or report jacobswan1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for the paper PLLaVA

Python 529 35 Updated Jul 28, 2024

a comfyui custom node for I2V-Adapter

Python 20 2 Updated Jul 2, 2024

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models

Python 151 6 Updated Jun 18, 2024

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Python 240 8 Updated Jun 9, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 47,264 4,985 Updated Aug 19, 2024

Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.

Jupyter Notebook 326 28 Updated Apr 5, 2023

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)

Python 196 14 Updated Jul 1, 2024

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

Jupyter Notebook 174 15 Updated Jul 11, 2024

Create Magic Story!

Jupyter Notebook 5,678 549 Updated Jul 24, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 4,827 313 Updated Jun 28, 2024

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 10,721 783 Updated Jul 18, 2024

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 1,034 61 Updated May 23, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 2,631 184 Updated Aug 15, 2024

Mora: More like Sora for Generalist Video Generation

Python 1,463 91 Updated Jun 21, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,281 2,035 Updated Aug 9, 2024

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 474 18 Updated Jun 26, 2024

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Python 213 17 Updated Jul 21, 2023

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,785 199 Updated Jul 27, 2024
Python 550 27 Updated Feb 15, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,246 78 Updated Aug 18, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,238 70 Updated Jul 29, 2024

Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

Python 246 25 Updated Sep 20, 2023

[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"

Python 1,200 86 Updated Mar 20, 2024

Nightly release of ControlNet 1.1

Python 4,597 369 Updated Aug 8, 2024
Python 12 Updated Jul 25, 2023

Official codebase for the Paper “Retrieval-Augmented Diffusion Models”

Jupyter Notebook 111 7 Updated Apr 5, 2023

Official implementation of DreaMoving

1,788 99 Updated Jan 9, 2024
Next