Skip to content
View Owen718's full-sized avatar
🤠
I may be slow to respond.
🤠
I may be slow to respond.

Block or report Owen718

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

「大模型」3小时从0训练27M参数的视觉多模态VLM,个人显卡即可推理训练!

Python 154 15 Updated Oct 10, 2024

Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation

Python 31 2 Updated Aug 14, 2024

Practice Code for text to image trainer

Python 64 3 Updated Oct 8, 2024

[train + eval + deploy] Aurora Series: A more efficient multimodal large language model series for video.

Python 28 1 Updated Oct 10, 2024

Next-Token Prediction is All You Need

Python 929 26 Updated Oct 8, 2024

[ECCV‘24] Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint

Python 20 Updated Sep 25, 2024

JAX port of FLUX.1 models using flax.nnx

Python 19 Updated Sep 28, 2024

🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org

Python 5,438 670 Updated Oct 11, 2024

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Python 2,254 267 Updated Oct 11, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 410 32 Updated Oct 8, 2024

[ICLR 2024] Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

10 Updated Apr 2, 2024

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

177 11 Updated Sep 19, 2024

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 377 12 Updated May 24, 2024
Python 51 11 Updated Jun 24, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,257 66 Updated Oct 1, 2024

Segmind Distilled diffusion

Python 562 36 Updated Oct 18, 2023

⛏💎 STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment

28 1 Updated Dec 27, 2023

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 863 52 Updated Jul 20, 2024

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 411 24 Updated Oct 7, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,040 86 Updated Aug 6, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,059 304 Updated Oct 6, 2024

Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.

151 1 Updated Jul 22, 2024

Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.

Python 157 11 Updated Jul 22, 2024

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

81 2 Updated Jul 16, 2024

"SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu

Python 34 Updated Sep 28, 2024

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 423 26 Updated May 29, 2024

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Python 542 20 Updated Sep 27, 2024

This repo provides a YOLOv8 model, finely trained for detecting human heads in complex crowd scenes, with the CrowdHuman dataset serving as training data. To boost accessibility and compatibility, …

Python 12 Updated Jul 16, 2024

Kolors Team

Python 3,696 245 Updated Sep 4, 2024
Next