jpthu17

🎯

Focusing

Peng Jin jpthu17

🎯

Focusing

Good morning, good afternoon, good evening, and good night!

124 followers · 53 following

Peking University
09:54 (UTC +08:00)
https://jpthu17.github.io/

Achievements

Organizations

Stars

baaivision / Emu3

Next-Token Prediction is All You Need

Python 406 6 Updated Sep 28, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,878 213 Updated Sep 27, 2024

AlonzoLeeeooo / awesome-video-generation

A collection of awesome video generation studies.

TeX 279 7 Updated Sep 28, 2024

Drexubery / ViewCrafter

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 741 24 Updated Sep 23, 2024

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

146 1 Updated Sep 9, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 866 39 Updated Sep 26, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,354 131 Updated Sep 24, 2024

wyhuai / SkillMimic

Official code release for the paper "SkillMimic: Learning Reusable Basketball Skills from Demonstrations"

Python 152 9 Updated Sep 17, 2024

wdndev / mllm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师多模态相关知识

HTML 66 1 Updated May 12, 2024

hrtang22 / MUSE

Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval"

Python 10 Updated Sep 8, 2024

Alpha-VLLM / Lumina-mGPT

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 471 19 Updated Aug 16, 2024

VITA-MLLM / VITA

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 793 40 Updated Sep 22, 2024

DaiShiResearch / TransNeXt

[CVPR 2024] Code release for TransNeXt model

Python 382 15 Updated Jun 13, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,775 108 Updated Jul 29, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,078 940 Updated Aug 21, 2024

PKU-YuanGroup / Cycle3D

Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

173 7 Updated Aug 10, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,570 1,244 Updated Sep 28, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 12,012 1,259 Updated Sep 6, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 3,634 237 Updated Sep 4, 2024

yxymessi / yxymessi.github.io

CSS 3 Updated Sep 28, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,695 112 Updated Sep 19, 2024

PKU-YuanGroup / ChronoMagic-Bench

[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Python 168 14 Updated Sep 28, 2024

SUSTechBruce / LOOK-M

Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"

Python 65 3 Updated Sep 19, 2024

Kipok / NeMo-Skills

A pipeline to improve skills of large language models

Python 151 33 Updated Sep 29, 2024

FoundationVision / OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 231 5 Updated Jul 9, 2024

PKU-YuanGroup / LLMBind

LLMBind: A Unified Modality-Task Integration Framework

Python 14 2 Updated Jun 16, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,214 48 Updated Aug 15, 2024

ChocoWu / SeTok

32 Updated Jun 19, 2024

kvablack / ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 405 41 Updated Mar 22, 2024

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,275 202 Updated Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peng Jin jpthu17

Achievements

Achievements

Organizations

Block or report jpthu17

Stars

baaivision / Emu3

hijkzzz / Awesome-LLM-Strawberry

AlonzoLeeeooo / awesome-video-generation

Drexubery / ViewCrafter

showlab / Awesome-Unified-Multimodal-Models

showlab / Show-o

QwenLM / Qwen2-VL

wyhuai / SkillMimic

wdndev / mllm_interview_note

hrtang22 / MUSE

Alpha-VLLM / Lumina-mGPT

VITA-MLLM / VITA

DaiShiResearch / TransNeXt

facebookresearch / chameleon

facebookresearch / segment-anything-2

PKU-YuanGroup / Cycle3D

Dao-AILab / flash-attention

KwaiVGI / LivePortrait

Kwai-Kolors / Kolors

yxymessi / yxymessi.github.io

cambrian-mllm / cambrian

PKU-YuanGroup / ChronoMagic-Bench

SUSTechBruce / LOOK-M

Kipok / NeMo-Skills

FoundationVision / OmniTokenizer

PKU-YuanGroup / LLMBind

FoundationVision / LlamaGen

ChocoWu / SeTok

kvablack / ddpo-pytorch

opendilab / awesome-RLHF