Skip to content
View yuanli2333's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Organizations

@PKU-YuanGroup
Block or Report

Block or report yuanli2333

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Python 162 13 Updated Jul 31, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,617 101 Updated Jul 29, 2024

LLMBind: A Unified Modality-Task Integration Framework

Python 14 2 Updated Jun 16, 2024

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,186 37 Updated Jul 8, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,322 144 Updated Jul 31, 2024

Your image is almost there!

Python 7,002 411 Updated Jul 26, 2024

Deep Contextual Video Compression

Python 360 57 Updated Feb 28, 2024
Python 279 7 Updated Jun 27, 2024

Experiencing lightning fast (~1s) and accurate drag-based image editing

242 10 Updated Jul 17, 2024

Unified Multi-modal IAA Baseline and Benchmark

68 5 Updated Apr 16, 2024

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Python 1,237 119 Updated Jul 29, 2024

Envision3D: One Image to 3D with Anchor Views Interpolation

Python 99 8 Updated May 16, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,067 989 Updated Aug 1, 2024

The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"

Python 44 6 Updated Mar 4, 2024

A Protein Large Language Model for Multi-Task Protein Language Processing

Python 115 13 Updated Jul 17, 2024

The official code for "Deep peak property learning for efficient chiral molecules ECD spectra prediction"

Python 28 Updated Jun 14, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,864 115 Updated May 15, 2024

Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV 2024)

261 6 Updated Jul 26, 2024

An MBTI Exploration of Large Language Models

Python 438 20 Updated Feb 2, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,588 2,426 Updated Apr 28, 2024

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

817 72 Updated Aug 1, 2024

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!

Python 110 2 Updated Dec 31, 2023

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,754 197 Updated Jul 27, 2024

GPT-4V(ision) as A Social Media Analysis Engine

30 2 Updated Nov 16, 2023

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 736 39 Updated Jul 21, 2024

Official implementation of "Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts" [ICLR 2024]

Python 99 3 Updated Jun 27, 2024

[NeurIPS 2023] Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs

Python 102 6 Updated Nov 15, 2023

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python 644 48 Updated Mar 25, 2024
Next