Skip to content
View tsujuifu's full-sized avatar
⚔️
RS @ Apple
⚔️
RS @ Apple

Highlights

  • Pro
Block or Report

Block or report tsujuifu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Stable-Hair: Real-World Hair Transfer via Diffusion Model

215 12 Updated Jul 22, 2024

Agent driven automation starting with the web. Discord: https://discord.gg/wgNfmFuqJF

Python 550 68 Updated Jul 30, 2024

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

Python 13 1 Updated Jul 3, 2024
Python 15 Updated Jun 22, 2024

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Python 1,376 91 Updated Jul 30, 2024

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 713 38 Updated Jul 29, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 8,672 810 Updated Jul 30, 2024

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Python 17 Updated Jul 10, 2024

[ICCV 2023] A latent space for stochastic diffusion models

Python 543 34 Updated Dec 31, 2023

CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing

Python 42 Updated May 18, 2024

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Python 790 69 Updated Jul 30, 2024

Fast Diffusion Models with Transformers

Python 633 87 Updated Oct 7, 2023

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 154 10 Updated Jul 22, 2024

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

Python 275 35 Updated May 23, 2023

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 31 Updated Apr 15, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 323 20 Updated Jul 26, 2024
HTML 13 Updated Jul 10, 2024

Bring portraits to life!

Python 8,897 845 Updated Jul 30, 2024

Modern Stable Diffusion models family - Fluently

Python 24 2 Updated Jun 6, 2024

Long Context Transfer from Language to Vision

Python 256 13 Updated Jul 28, 2024

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 300 8 Updated Jul 3, 2024

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 124 2 Updated Jul 19, 2024

Enjoy the magic of Diffusion models!

Python 6,030 538 Updated Jul 30, 2024

4M: Massively Multimodal Masked Modeling

Python 1,454 84 Updated Jul 17, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,612 99 Updated Jul 26, 2024
Python 75 6 Updated Jun 28, 2024
22 Updated Jun 20, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 2,742 203 Updated Jul 29, 2024

The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"

C 215 34 Updated Jun 13, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 966 70 Updated Jun 15, 2024
Next