Highlights
- Pro
Block or Report
Block or report roymiles
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language: Python
Sort by: Most stars
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Dual Attention Network for Scene Segmentation (CVPR2019)
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Train…
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
When do we not need larger vision models?
Implementation of Posenet in TensorFlow
PyTorch implementation of [1412.6553] and [1511.06530] tensor decomposition methods for convolutional layers.
RetinaFace (Single-stage Dense Face Localisation in the Wild, 2019) implemented (ResNet50, MobileNetV2 trained on single GPU) in Tensorflow 2.0+. This is an unofficial implementation. With Colab.
An open-source implementation for training LLaVA-NeXT.
[ICCV 2019] Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters
[ICCV 2023] Binary Adapters, [AAAI 2023] FacT, [Tech report] Convpass
A Simulation Framework for Memristive Deep Learning Systems
[ECCV 2024] Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching.
Fine-tuning Vision Transformers on various classification datasets
[ACM MM22] Towards Robust Video Object Segmentation with Adaptive Object Calibration, ACM Multimedia 2022
[CVPR 2022] ScaleNet: A Shallow Architecture for Scale Estimation