Skip to content
View mmmmimic's full-sized avatar
🧸
Baby doll Tongtong
🧸
Baby doll Tongtong
  • Danmarks Tekniske Universitet
  • Kongens Lyngby, Danmark
  • 21:39 (UTC +03:00)
  • X @holidaypiggy233

Highlights

  • Pro

Block or report mmmmimic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Python 183 9 Updated Sep 16, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,696 155 Updated Oct 4, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,982 558 Updated Oct 15, 2024

[ECCV 2024] Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

Python 58 10 Updated Sep 24, 2024

Outlier detection challenge 2024 - a DTU Compute summer school challenge

Python 5 5 Updated Aug 12, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,775 1,039 Updated Oct 14, 2024

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 1 Updated May 15, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

1,157 266 Updated Dec 14, 2023

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,507 232 Updated Aug 1, 2024

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Python 1,237 138 Updated Mar 18, 2024

将Mask2Former的backbone替换成DINOv2训练好的ViT模型

Python 26 Updated May 12, 2023

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 2,518 384 Updated Jul 29, 2024

Clone of COCO API - Dataset @ https://cocodataset.org/ - with changes to support Windows build and python3

Jupyter Notebook 1,131 466 Updated Dec 29, 2022

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 752 22 Updated Aug 9, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 376 17 Updated Apr 8, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,304 111 Updated Jul 19, 2024

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,173 105 Updated Dec 20, 2023
Python 1,455 254 Updated Apr 19, 2024
Python 2 Updated Sep 21, 2023

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,953 407 Updated May 29, 2024

Official implementation of "Controllable Prompt Tuning For Balancing Group Distributional Robustness" (ICML 2024), coming soon.

4 Updated May 3, 2024

Official Implementation of Avoiding spurious correlations via logit correction

Python 17 1 Updated May 6, 2023

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)

Jupyter Notebook 108 4 Updated Aug 29, 2022

Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)

Python 17 3 Updated Dec 15, 2023
Python 4 1 Updated Mar 28, 2024

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 299 13 Updated Oct 7, 2024

Official PyTorch implementation of ChAda-ViT [CVPR 2024]

Python 28 1 Updated May 14, 2024

[MICCAI'2024] EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera

Python 31 3 Updated May 15, 2024

[IPCAI'2024 (IJCARS special issue)] Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery

Python 47 2 Updated May 22, 2024
Next