[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 414 14 Updated Jul 13, 2024

muslll / neosr

neosr is a framework for training real-world single-image super-resolution networks.

Python 116 27 Updated Jul 20, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 1,924 80 Updated Jul 23, 2024

kijai / ComfyUI-Florence2

Inference Microsoft Florence2 VLM

Python 363 25 Updated Jul 25, 2024

RockeyCoss / SPO

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Python 114 2 Updated Jul 10, 2024

UnblockNeteaseMusic / server

Revive unavailable songs for Netease Cloud Music (Refactored & Enhanced version)

JavaScript 6,015 611 Updated Jul 22, 2024

sdfxai / sdfx

The ultimate no-code platform to build and share AI apps with beautiful UI.

TypeScript 358 20 Updated Jun 30, 2024

langflow-ai / langflow

⛓️ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.

JavaScript 22,414 3,191 Updated Jul 25, 2024

G-U-N / Phased-Consistency-Model

Boosting the performance of consistency models with PCM!

Python 311 10 Updated Jul 16, 2024

MyNiuuu / MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 500 26 Updated Jul 14, 2024

lllyasviel / Omost

Your image is almost there!

Python 6,955 405 Updated Jul 14, 2024

YvanYin / Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,085 75 Updated Jul 23, 2024

mulanai / MuLan

MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)

Python 111 3 Updated May 27, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,392 338 Updated Jul 25, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,701 391 Updated May 29, 2024

QL-boy

Block or report QL-boy

Lists (10)

AI

CG ART

Develop

Generative AI

NetWorking

Postprocessing AI

Recognition AI

Software

Study

Summary List

Stars