lifeGWT

lifeGWT

6 followers · 48 following

Stars

och-mac / TraDiffusion

TraDiffusion: Trajectory-Based Training-Free Image Generation

Python 37 2 Updated Aug 23, 2024

ZCMax / ScanReason

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

43 1 Updated Jul 2, 2024

WeihuangLin / INF-LLaVA

INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model

Python 36 Updated Aug 4, 2024

mrwu-mac / ControlMLLM

Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 43 2 Updated Aug 26, 2024

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

3,209 275 Updated Jul 3, 2024

mrwu-mac / DHD

PyTorch implementation of the paper `Toward Open-set Human Object Interaction Detection' (AAAI2024)

Python 1 Updated Jun 5, 2024

heshuting555 / SegPoint

23 Updated Jul 19, 2024

YouHuang67 / mamba-code-explained

Cuda 16 1 Updated Jul 17, 2024

fmu2 / snag_release

Official Implementation of SnAG (CVPR 2024)

Python 32 3 Updated Apr 22, 2024

huangb23 / VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 204 11 Updated Jun 13, 2024

HebeiFast / EventLowLightVOS

7 Updated Jun 5, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,255 107 Updated Jul 19, 2024

YouHuang67 / High-Resolution-Segment-Anything

Python 18 3 Updated Jul 4, 2024

52CV / ECCV-2024-Papers

43 1 Updated Aug 29, 2024

MarkMoHR / Awesome-Referring-Image-Segmentation

📚 A collection of papers about Referring Image Segmentation.

596 56 Updated Aug 30, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,482 425 Updated Sep 10, 2024

nini0919 / SemiRES

[ICML2024]The official implementation of SemiRES in PyTorch.

Python 18 Updated Jun 20, 2024

rsdler / Remote-Sensing-in-CVPR2024

Papers related to remote sensing in CVPR 2024

116 7 Updated Jun 24, 2024

lx709 / VRSBench

Python 23 1 Updated Sep 7, 2024

wusize / F-LMM

Code Release of F-LMM: Grounding Frozen Large Multimodal Models

Python 35 Updated Aug 5, 2024

jy0205 / LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 496 25 Updated Jul 1, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,259 68 Updated Aug 21, 2024

LinZhekai / X-Oscar

About Official repository for "X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation"

Python 50 2 Updated Jun 25, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,099 2,925 Updated Aug 12, 2024

embodied-generalist / embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 329 30 Updated Jul 30, 2024

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

973 65 Updated Sep 12, 2024

PKU-YuanGroup / MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Python 1,270 124 Updated Jul 29, 2024

PKU-EPIC / MaskClustering

[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation

Python 71 2 Updated Apr 25, 2024

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,766 122 Updated Sep 5, 2024

amusi / CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

17,698 2,559 Updated Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly