Skip to content
View Alice1820's full-sized avatar

Highlights

  • Pro

Block or report Alice1820

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Python 217 8 Updated Feb 5, 2024

System for AI Education Resource.

Python 3,595 443 Updated Oct 25, 2024

🤠 Agent-as-a-Judge and DevAI dataset

Python 184 17 Updated Nov 1, 2024

This is the public repository for the Poster Abstract: Realistic Multiuser, Multimodal (IMU, Acoustic) HAR Data Generation through Single User Data Augmentation accepted in ACM/IEEE IPSN 2022.

Python 1 Updated Aug 12, 2023
Python 5 5 Updated Feb 28, 2022

3-layer-CNN and ResNet with OPPORTUNITY dataset, PAMAP2 dataset, UCI-HAR dataset, UniMiB-SHAR dataset, USC-HAD dataset, and WISDM dataset.

Python 51 7 Updated May 21, 2022

GTP engine and self-play learning in Go

C++ 3,566 565 Updated Oct 8, 2024
Jupyter Notebook 572 53 Updated Nov 8, 2024

Meta-Transformer for Unified Multimodal Learning

Python 1,520 113 Updated Dec 5, 2023

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 584 32 Updated Oct 22, 2024

Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.

23 Updated Dec 21, 2023

high Building Throw Det

Python 11 Updated Feb 26, 2023

[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding

Python 45 Updated Aug 28, 2023

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 457 14 Updated Aug 9, 2024

The official Talk2Car dataset repo

Python 68 6 Updated Jul 25, 2024

MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning

Python 128 6 Updated Jul 2, 2023

[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

Python 155 1 Updated Nov 1, 2024

Official PyTorch implementation of FocalFormer3D [ICCV 2023]

Python 168 19 Updated Feb 20, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,850 129 Updated Jul 2, 2024

[ICCV 2023] Code for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Python 285 18 Updated Sep 14, 2023

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 30,480 7,478 Updated Nov 7, 2024

[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Python 180 13 Updated Aug 24, 2023

Official code base of the BEVDet series .

Python 1,434 266 Updated Jul 4, 2024

The official implementation of ICLR2021 paper "Improve Object Detection with Feature-based Knowledge Distillation: Towards Accurate and Efficient Detectors".

57 6 Updated Jun 16, 2021

Official code for BEVDepth.

Python 725 100 Updated Jan 18, 2023

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

Python 269 25 Updated Oct 13, 2023

Segment Any RGBD

Python 788 45 Updated May 24, 2023

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 2,551 385 Updated Jul 29, 2024

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,522 234 Updated Aug 1, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,803 501 Updated Jan 29, 2024
Next