Skip to content
View tobran's full-sized avatar
  • NanJing
Block or Report

Block or report tobran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Official inference repo for FLUX.1 models

Python 7,385 458 Updated Aug 14, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,009 72 Updated Jun 15, 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Python 93 1 Updated Jul 31, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 651 49 Updated Jul 29, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 92,269 11,623 Updated Aug 11, 2024

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

CSS 20,474 1,553 Updated Aug 9, 2024

Kolors Team

Python 3,046 179 Updated Aug 6, 2024

VideoTetris: Towards Compositional Text-To-Video Generation

Python 193 6 Updated Aug 1, 2024

[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!

Python 91 1 Updated Jul 5, 2024

Bring portraits to life!

Python 9,993 967 Updated Aug 14, 2024

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Go 84,899 6,527 Updated Aug 14, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 5,006 388 Updated Aug 14, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,133 41 Updated Jul 14, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,651 104 Updated Aug 3, 2024

Enjoy the magic of Diffusion models!

Python 6,130 550 Updated Aug 14, 2024

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

103 1 Updated Jun 18, 2024
Python 1,836 111 Updated Aug 14, 2024

RS5M: a large-scale vision language dataset for remote sensing

Python 187 7 Updated Jul 31, 2024

LLM101n: Let's build a Storyteller

26,941 1,461 Updated Aug 1, 2024

Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"

Python 157 3 Updated Jun 20, 2024

A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding

Python 45 2 Updated Aug 3, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,062 1,067 Updated Jul 4, 2024

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

92 5 Updated Jul 19, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

210 4 Updated Jun 16, 2024

A general fine-tuning kit geared toward diffusion models.

Python 1,108 84 Updated Aug 14, 2024

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

109 1 Updated Jun 13, 2024

A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.

2,042 261 Updated Jul 27, 2024

Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".

Python 638 44 Updated Jul 23, 2024

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Python 681 40 Updated Jul 1, 2024
Next