Skip to content
View isLinXu's full-sized avatar
🎯
Focusing coding and think
🎯
Focusing coding and think

Organizations

@StraitRobot

Block or report isLinXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Unofficial PyTorch Reimplementation of RandAugment.

Python 626 98 Updated Mar 14, 2023

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 468 25 Updated Sep 5, 2024

LLaVA-Interactive-Demo

Python 344 25 Updated Jul 25, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 1,158 94 Updated Sep 4, 2024

Efficient Multimodal Large Language Models: A Survey

226 8 Updated Aug 16, 2024

Weakly Supervised Learning On Images

Python 597 63 Updated Oct 14, 2021
Python 153 24 Updated Jul 24, 2024

Task Residual for Tuning Vision-Language Models (CVPR 2023)

Python 65 7 Updated May 27, 2023

[CVPR 2024] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP).

Python 49 3 Updated Jun 1, 2024

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 761 70 Updated Apr 17, 2024

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 46 3 Updated Mar 27, 2024

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Python 1,208 134 Updated Mar 18, 2024

[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"

Python 69 5 Updated Aug 21, 2024

[ICCV 2021] Instances as Queries

Python 401 56 Updated Oct 20, 2023

Official implementation of WACV2024 paper: Object-centric Video Representation for Long-term Action Anticipation

Python 6 Updated Dec 31, 2023

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Python 382 30 Updated Nov 10, 2023

USB: Universal-Scale Object Detection Benchmark (BMVC 2022)

Python 423 55 Updated Jul 8, 2023

Accompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston-Cambridge Keynote)

Jupyter Notebook 27 7 Updated Jul 5, 2022

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,490 168 Updated Oct 31, 2023

Code release for ConvNeXt V2 model

Python 1,455 115 Updated Aug 14, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,004 113 Updated Aug 4, 2024

Quick exploration into fine tuning florence 2

Jupyter Notebook 245 22 Updated Jul 31, 2024

Bridging Vision and Language Model

Python 279 31 Updated Mar 27, 2023

Mixture-of-Experts for Large Vision-Language Models

Python 1,896 121 Updated May 15, 2024

tiny vision language model

Jupyter Notebook 4,851 431 Updated Aug 27, 2024

A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector)

Python 26 1 Updated Jul 25, 2024

Code release for "Language-conditioned Detection Transformer"

Python 82 4 Updated Jun 17, 2024

Segment Any RGBD

Python 770 45 Updated May 24, 2023

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Python 348 16 Updated Sep 19, 2023
Python 123 15 Updated Jan 11, 2024
Next