Skip to content
View hysts's full-sized avatar
Block or Report

Block or report hysts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!

Python 48 Updated Jul 3, 2024

Code release for "Segment Anything without Supervision"

Jupyter Notebook 170 8 Updated Jul 1, 2024

AuraSR: GAN-based Super-Resolution for real-world

Python 244 14 Updated Jun 26, 2024

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Python 69 Updated Jul 2, 2024

Enjoy the magic of Diffusion models!

Python 5,602 510 Updated Jul 5, 2024

[CVPR 2024 Highlight] VGGSfM Visual Geometry Grounded Deep Structure From Motion

Python 454 29 Updated Jul 4, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,418 89 Updated Jul 4, 2024

Long Context Transfer from Language to Vision

Python 170 10 Updated Jul 3, 2024
Python 56 3 Updated May 24, 2024

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Python 800 45 Updated Jul 2, 2024

4M: Massively Multimodal Masked Modeling

Python 1,242 60 Updated Jul 3, 2024

Tiny AutoEncoder for Stable Diffusion

Python 481 27 Updated Jun 16, 2024

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Python 341 22 Updated Jul 3, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,417 88 Updated Jun 21, 2024

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 820 44 Updated Jun 28, 2024

A diffusers pipeline for zero shot stylised portrait creation

Python 388 15 Updated Jun 22, 2024

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

379 11 Updated Jun 28, 2024

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Python 1,488 57 Updated Jul 2, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 2,233 154 Updated Jul 1, 2024

Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"

Python 106 Updated Jun 13, 2024

Official implementation of Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

Python 117 5 Updated Jun 12, 2024

Official code for "Neural Gaffer: Relighting Any Object via Diffusion"

154 2 Updated Jun 12, 2024

This respository contains the code for SF-V: Single Forward Video Generation Model.

74 3 Updated Jun 7, 2024

Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language

Jupyter Notebook 43 8 Updated Jun 12, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 473 28 Updated Jul 4, 2024

Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).

Python 45 4 Updated Jun 11, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 770 59 Updated Jun 15, 2024

MARS5 speech model (TTS) from CAMB.AI

Python 2,090 164 Updated Jul 2, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 958 33 Updated Jun 29, 2024

VideoTetris: Towards Compositional Text-To-Video Generation

Python 146 2 Updated Jun 30, 2024
Next