isLinXu

🎯

Focusing coding and think

Hertz isLinXu

🎯

Focusing coding and think

world is large model.

175 followers · 682 following

@Tencent
China
21:32 (UTC +08:00)
https://islinxu.github.io/

Achievements

x2 x2

Achievements

x2 x2

Organizations

Lists (9)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

ildoonet / pytorch-randaugment

Unofficial PyTorch Reimplementation of RandAugment.

Python 626 98 Updated Mar 14, 2023

jishengpeng / WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 468 25 Updated Sep 5, 2024

LLaVA-VL / LLaVA-Interactive-Demo

LLaVA-Interactive-Demo

Python 344 25 Updated Jul 25, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 1,158 94 Updated Sep 4, 2024

swordlidev / Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

226 8 Updated Aug 16, 2024

facebookresearch / WSL-Images

Weakly Supervised Learning On Images

Python 597 63 Updated Oct 14, 2021

LilianHollard / LeYOLO

Python 153 24 Updated Jul 24, 2024

geekyutao / TaskRes

Task Residual for Tuning Vision-Language Models (CVPR 2023)

Python 65 7 Updated May 27, 2023

jusiro / CLAP

[CVPR 2024] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP).

Python 49 3 Updated Jun 1, 2024

LeapLabTHU / DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Python 761 70 Updated Apr 17, 2024

OpenGVLab / InternVL-MMDetSeg

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 46 3 Updated Mar 27, 2024

czczup / ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Python 1,208 134 Updated Mar 18, 2024

xiuqhou / Relation-DETR

[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"

Python 69 5 Updated Aug 21, 2024

hustvl / QueryInst

[ICCV 2021] Instances as Queries

Python 401 56 Updated Oct 20, 2023

brown-palm / ObjectPrompt

Official implementation of WACV2024 paper: Object-centric Video Representation for Long-term Action Anticipation

Python 6 Updated Dec 31, 2023

microsoft / UniCL

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Python 382 30 Updated Nov 10, 2023

shinya7y / UniverseNet

USB: Universal-Scale Object Detection Benchmark (BMVC 2022)

Python 423 55 Updated Jul 8, 2023

stanleyjzheng / PyData-Pseudolabelling-Keynote

Accompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston-Cambridge Keynote)

Jupyter Notebook 27 7 Updated Jul 5, 2022