lizhaoliu-Lec

🎯

Focusing

lizhaoliu lizhaoliu-Lec

🎯

Focusing

Persistence and Concentration.

24 followers · 11 following

South China University of Technology
Guangzhou/China

Achievements

Stars

DepthAnything / Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,141 245 Updated Aug 14, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,238 776 Updated Aug 21, 2024

xzhih / one-key-hidpi

Enable macOS HiDPI and have a native setting.

Shell 8,598 985 Updated Jul 3, 2024

XinyuSun / PSL-InstanceNav

official implementation for ECCV 2024 paper "Prioritized Semantic Learning for Zero-shot Instance Navigation"

Python 5 1 Updated Jul 15, 2024

Li-ChangHao / CoNav

5 Updated Jul 16, 2024

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Python 1,946 31 Updated Jun 6, 2024

ZSHsh98 / MMD-MP

This is the source code for Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy (ICLR2024).

Python 38 3 Updated Aug 12, 2024

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

943 62 Updated Jul 4, 2024

xai-org / grok-1

Grok open release

Python 49,394 8,332 Updated Aug 30, 2024

alaamaalouf / FollowAnything

Jupyter Notebook 351 45 Updated Dec 5, 2023

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,550 441 Updated May 3, 2024

GAP-LAB-CUHK-SZ / SAMPro3D

SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation

Python 91 7 Updated Jan 12, 2024

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 678 43 Updated Jul 29, 2024

3d-vista / 3D-VisTA

Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"

Python 176 9 Updated Sep 7, 2023

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 735 37 Updated Jun 2, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,133 1,528 Updated Aug 29, 2024