Skip to content
View roymiles's full-sized avatar
🚲
🚲

Highlights

  • Pro
Block or Report

Block or report roymiles

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

When do we not need larger vision models?

Python 259 7 Updated Jul 4, 2024

[ECCV 2024] Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching.

Python 73 9 Updated Jul 2, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,512 89 Updated Jul 6, 2024

Efficient computing methods developed by Huawei Noah's Ark Lab

Jupyter Notebook 1,145 200 Updated Jul 6, 2024
CSS 2 Updated Jun 25, 2024
Python 84 3 Updated Jul 6, 2024

An open-source implementation of LLaVA-NeXT.

Python 128 4 Updated Jun 12, 2024

[CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections

Python 30 2 Updated Apr 29, 2024

[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

Python 376 19 Updated Jul 3, 2024

LLM training in simple, raw C/CUDA

Cuda 21,617 2,350 Updated Jul 12, 2024

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Train…

Python 1,322 128 Updated May 28, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,587 162 Updated Jun 25, 2024

[ICCV 2023] Binary Adapters, [AAAI 2023] FacT, [Tech report] Convpass

Python 162 7 Updated Aug 1, 2023

[AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation

Jupyter Notebook 11 Updated Feb 13, 2024

Mamba SSM architecture

Python 11,633 952 Updated Jul 3, 2024

Implementation of the paper "Learning to Prompt CLIP for Monocular Depth Estimation: Exploring the Limits of Human Language", ICCV Workshop on Open Vocabulary Scene Understanding (OpenSUN3D) 2023

Jupyter Notebook 6 Updated Nov 8, 2023
Python 1,703 52 Updated Jun 28, 2024
Python 9 3 Updated Oct 11, 2023

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Python 2,021 159 Updated Apr 5, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,010 1,432 Updated Jul 11, 2024

Fine-tuning Vision Transformers on various classification datasets

Python 73 9 Updated May 5, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,449 338 Updated Mar 20, 2024

A playbook for systematically maximizing the performance of deep learning models.

25,787 2,157 Updated Jun 18, 2024

Official DeiT repository

Python 3,949 548 Updated Mar 15, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,756 336 Updated May 8, 2024
Python 16 4 Updated Jan 10, 2024

Official implementation of "Monocular Depth Estimation Using Cues Inspired by Biological Vision Systems", D. Auty and K. Mikolajczyk, International Conference on Pattern Recognition (ICPR) 2022

Jupyter Notebook 6 1 Updated Jan 27, 2023

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 45,538 5,379 Updated Jun 24, 2024

[BMVC 2022] Information Theoretic Representation Distillation

Python 14 1 Updated Oct 6, 2023
Next