Skip to content
View happy2019-k's full-sized avatar

Block or report happy2019-k

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,742 124 Updated Jul 2, 2024

YOLO-World + EfficientViT SAM

Python 66 7 Updated Feb 18, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 898 27 Updated Jul 31, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,308 459 Updated Aug 19, 2024

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Python 495 25 Updated Jun 11, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 38,340 4,950 Updated Aug 9, 2024

一些关于目标检测的脚本的改进思路代码,详细请看readme.md

Python 5,051 460 Updated Sep 8, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,809 1,363 Updated Aug 9, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,628 170 Updated Sep 9, 2024

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,491 100 Updated Jul 22, 2024
Python 86 4 Updated Nov 17, 2023

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,299 418 Updated Jul 30, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,658 479 Updated Jan 29, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,739 517 Updated Jul 17, 2024

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,149 102 Updated Dec 20, 2023

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,243 107 Updated Jul 19, 2024
Python 206 10 Updated Jun 28, 2024
Python 320 46 Updated Mar 8, 2024

[CVPR 2024] Deformable Convolution v4

Python 467 27 Updated May 17, 2024

Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios

Python 750 54 Updated Aug 5, 2023

"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)

Python 2,153 141 Updated Dec 12, 2023

[ICCV 2023] DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Python 313 32 Updated Aug 7, 2024

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Python 6,535 1,691 Updated Sep 9, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 42 8 Updated Apr 8, 2023

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

Python 268 39 Updated Jul 6, 2023

Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer

Python 221 18 Updated Feb 16, 2023

TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)

Python 91 10 Updated Sep 8, 2022

yolo+deepsort 原始

Python 32 4 Updated May 8, 2022

[ECCV'22 Oral] Towards Grand Unification of Object Tracking

Python 953 87 Updated Oct 17, 2022
Next