Skip to content
View Yifehuang97's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report Yifehuang97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

nanoGPT style version of Llama 3.1

Python 1,033 37 Updated Aug 8, 2024

Implements VAR+CLIP for image generation

Python 53 1 Updated Aug 5, 2024

Official inference repo for FLUX.1 models

Python 8,977 570 Updated Aug 16, 2024

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Python 801 60 Updated Aug 14, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 9,668 688 Updated Aug 18, 2024

RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Jupyter Notebook 94 4 Updated Jun 12, 2024

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Python 188 16 Updated Aug 9, 2024

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,734 159 Updated Aug 19, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,141 277 Updated May 4, 2024

Stable-Hair: Real-World Hair Transfer via Diffusion Model

305 20 Updated Jul 22, 2024
26 Updated Jul 26, 2024

The official PyTorch implementation of "The 18th European Conference on Computer Vision" (ECCV 2024) paper Length-Aware Motion Synthesis via Latent Diffusion.

Python 9 1 Updated Jul 17, 2024

[MICCAI 2024] TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs

9 Updated Jun 19, 2024

Official implementation for the paper "InsertDiffusion: Identity Preserving Visualization of Objects through a Training-Free Diffusion Architecture".

Jupyter Notebook 8 Updated Aug 12, 2024

Official Code Release for [SIGGRAPH 2024] DilightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

Python 84 4 Updated Aug 6, 2024

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Jupyter Notebook 965 57 Updated Sep 21, 2023

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 30,643 7,824 Updated Aug 3, 2024

CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner

Python 371 17 Updated Jul 24, 2024

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Python 682 30 Updated Aug 14, 2024

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 172 14 Updated Aug 19, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 654 50 Updated Jul 29, 2024

Code Repository for MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos (ECCV 2024)

Python 53 Updated Aug 10, 2024

Official Implement of the work "Coherent and Multi-modality Image Inpainting via Latent Space Optimization"

Python 33 2 Updated Jul 30, 2024

Official Implement of ECCV 2024 paper "Multi-modal Crowd Counting via a Broker Modality"

8 Updated Jul 3, 2024

Understand Human Behavior to Align True Needs

Python 3,184 279 Updated Jul 20, 2024

Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 94 2 Updated Jul 28, 2024

distributed trainer for LLMs

Python 513 74 Updated May 20, 2024
Jupyter Notebook 97 3 Updated Aug 13, 2024

Official Implementation of ECCV2024 paper: Chat Edit 3D: Interactive 3D Scene Editing via Text Prompts

Python 37 Updated Aug 10, 2024

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 327 9 Updated Jul 16, 2024
Next