Skip to content
View hysts's full-sized avatar
Block or Report

Block or report hysts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Python 34 1 Updated Jul 25, 2024

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Python 694 53 Updated Jul 25, 2024
Python 19 1 Updated Jul 19, 2024

[arXiv 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Python 160 9 Updated Jul 25, 2024

Grounding Image Matching in 3D with MASt3R

Python 547 18 Updated Jul 25, 2024

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Python 336 11 Updated Jul 20, 2024

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 31 Updated Jul 23, 2024

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 134 9 Updated Jul 22, 2024
Python 122 6 Updated Jul 23, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 586 43 Updated Jul 24, 2024

Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 513 27 Updated Jul 24, 2024

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

Python 74 2 Updated Jul 24, 2024

Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis

Python 57 1 Updated Jul 18, 2024

Understand Human Behavior to Align True Needs

Python 2,985 253 Updated Jul 20, 2024

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Python 139 4 Updated Jul 25, 2024

Bring portraits to life!

Python 8,314 765 Updated Jul 25, 2024

Kolors Team

Python 2,702 153 Updated Jul 19, 2024

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 581 20 Updated Jul 24, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,055 79 Updated Jul 7, 2024

[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!

Python 80 1 Updated Jul 5, 2024

Code release for "Segment Anything without Supervision"

Jupyter Notebook 247 17 Updated Jul 9, 2024

AuraSR: GAN-based Super-Resolution for real-world

Python 305 17 Updated Jul 23, 2024

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Python 73 Updated Jul 2, 2024

Enjoy the magic of Diffusion models!

Python 5,982 532 Updated Jul 12, 2024

[CVPR 2024 Highlight] VGGSfM Visual Geometry Grounded Deep Structure From Motion

Python 669 36 Updated Jul 25, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,595 97 Updated Jul 6, 2024

Long Context Transfer from Language to Vision

Python 249 12 Updated Jul 12, 2024
Python 58 3 Updated May 24, 2024

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Python 883 53 Updated Jul 9, 2024

4M: Massively Multimodal Masked Modeling

Python 1,435 81 Updated Jul 17, 2024
Next