Skip to content
View songuyenerza's full-sized avatar

Block or report songuyenerza

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 4,955 473 Updated Sep 4, 2024
Python 76 3 Updated Jun 15, 2024
Python 6 4 Updated Sep 4, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,920 3,786 Updated Sep 4, 2024

(ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''

Python 32 Updated Jul 23, 2024
Python 58 3 Updated Apr 27, 2024

Multi-view Diffusion for 3D Generation

Python 766 56 Updated Oct 7, 2023

Single Image Reflection Removal with Edge Guidance, Reflection Classifier, and Recurrent Decomposition, WACV2021

Python 12 2 Updated Mar 16, 2022

Text to 3D generation

Python 52 Updated Aug 12, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 4,341 541 Updated Aug 9, 2024

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 6,636 1,164 Updated Jul 21, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,587 76 Updated Aug 5, 2024

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,550 99 Updated Aug 20, 2024

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Python 1,657 115 Updated Feb 23, 2024

⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Python 1,120 36 Updated Jun 7, 2024

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Python 3,854 341 Updated Jan 2, 2024

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 8,462 1,454 Updated Jun 26, 2024
Python 153 10 Updated Aug 16, 2024

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis

Python 231 12 Updated Feb 21, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 4,953 542 Updated Aug 10, 2024

Code release for CVPR'24 submission 'OmniGlue'

Python 510 42 Updated Aug 12, 2024

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Jupyter Notebook 877 84 Updated Aug 17, 2024

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,201 138 Updated Aug 28, 2024

Code for Master research internship report : https://arxiv.org/abs/1909.13579

Python 104 16 Updated Dec 8, 2022

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 786 56 Updated Jul 10, 2024

[ICLR 2023] Unicom: Universal and Compact Representation Learning for Image Retrieval

Python 216 16 Updated Jul 24, 2024

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 630 37 Updated Jul 30, 2024

[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Python 292 28 Updated Jul 22, 2024
Next