Block or Report
Block or report QL-boy
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (10)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
Media Downloader is a Qt/C++ front end to yt-dlp, youtube-dl, gallery-dl, lux, you-get, svtplay-dl, aria2c, wget and safari books..
ControlNet++: All-in-one ControlNet for image generations and editing!
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Understand Human Behavior to Align True Needs
Open-Sora: Democratizing Efficient Video Production for All
Open source real-time translation app for Android that runs locally
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
neosr is a framework for training real-world single-image super-resolution networks.
Lumina-T2X is a unified framework for Text to Any Modality Generation
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
Revive unavailable songs for Netease Cloud Music (Refactored & Enhanced version)
The ultimate no-code platform to build and share AI apps with beautiful UI.
⛓️ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.
Boosting the performance of consistency models with PCM!
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
a state-of-the-art-level open visual language model | 多模态预训练模型