Block or Report
Block or report yuanli2333
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
LLMBind: A Unified Modality-Task Integration Framework
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Experiencing lightning fast (~1s) and accurate drag-based image editing
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Envision3D: One Image to 3D with Anchor Views Interpolation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"
A Protein Large Language Model for Multi-Task Protein Language Processing
The official code for "Deep peak property learning for efficient chiral molecules ECD spectra prediction"
Mixture-of-Experts for Large Vision-Language Models
Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV 2024)
An MBTI Exploration of Large Language Models
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
GPT-4V(ision) as A Social Media Analysis Engine
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Official implementation of "Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts" [ICLR 2024]
[NeurIPS 2023] Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment