Skip to content
View xinntao's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TencentARC @XPixelGroup
Block or Report

Block or report xinntao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kolors Team

Python 2,647 149 Updated Jul 19, 2024

Bring portraits to life!

Python 7,741 694 Updated Jul 23, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 338 10 Updated Jul 10, 2024

A native PyTorch Library for large model training

Python 1,367 117 Updated Jul 23, 2024

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 294 8 Updated Jul 16, 2024
Python 316 9 Updated Jul 16, 2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 2,777 272 Updated Jun 20, 2024

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 6,591 318 Updated Jul 22, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,222 105 Updated Jul 17, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫

Python 15,355 4,985 Updated Jul 20, 2024

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 456 15 Updated Jun 26, 2024

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 842 69 Updated Nov 7, 2023

A simple HTML visualization tool for computer vision research 🛠️

Python 228 14 Updated Aug 16, 2021

Transparent Image Layer Diffusion using Latent Transparency

1,914 22 Updated Jun 16, 2024

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,187 170 Updated Jul 19, 2024

ICLR 2024 (Spotlight)

Python 691 21 Updated Mar 2, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 8,827 695 Updated Jul 24, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,186 66 Updated Jul 19, 2024

Official code of SmartEdit [CVPR-2024 Highlight]

Python 203 3 Updated Jun 21, 2024

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

325 13 Updated Mar 29, 2024

Easily create large video dataset from video urls

Python 512 59 Updated Jul 18, 2024

A lightweight tool for camera pose visualization

Python 80 5 Updated Oct 19, 2023

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Python 283 8 Updated Jul 11, 2024

Official implementation of SEED-LLaMA (ICLR 2024).

Python 530 30 Updated Apr 11, 2024

Official codes for DeSRA (ICML 2023)

Python 120 Updated Feb 2, 2024

Implementation of “DreamDiffusion: Generating High-Quality Images from Brain EEG Signals”

Python 438 50 Updated Jan 30, 2024

NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Python 381 18 Updated May 14, 2024

A unified framework for 3D content generation.

Python 5,974 463 Updated May 27, 2024

General video interaction platform based on LLMs, including Video ChatGPT

Python 247 17 Updated Jul 26, 2023

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Python 747 55 Updated Dec 19, 2023
Next