Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,244 595 Updated Feb 14, 2024

Picsart-AI-Research / MI-GAN

[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices

Python 406 37 Updated Jan 23, 2024

DepthAnything / Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 2,195 152 Updated Jul 1, 2024

THU-MIG / RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Jupyter Notebook 668 54 Updated Jun 14, 2024

voxel51 / fiftyone

The open-source tool for building high-quality datasets and computer vision models

Python 7,865 518 Updated Jul 4, 2024

apple / ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 467 25 Updated Jun 21, 2024

apple / ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,772 98 Updated Nov 30, 2023

shoutOutYangJie / MobileOne

An Improved One millisecond Mobile Backbone

Python 140 31 Updated Aug 20, 2022

apple / ml-mobileone

This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".

Swift 700 58 Updated Jul 25, 2022

d-li14 / mobilenetv3.pytorch

74.3% MobileNetV3-Large and 67.2% MobileNetV3-Small model on ImageNet

Python 512 124 Updated Mar 8, 2023

moein-shariatnia / OpenAI-CLIP

Simple implementation of OpenAI CLIP model in PyTorch.

Jupyter Notebook 570 84 Updated Apr 17, 2024

facebookresearch / ConvNeXt

Code release for ConvNeXt model

Python 5,626 684 Updated Jan 8, 2023

rohit901 / cooperative-foundational-models

Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

Python 45 2 Updated Feb 19, 2024

cake-lab / Mobile-AR-Depth-Estimation

The official repository for Mobile AR Depth Estimation: Challenges & Prospects (HotMobile24)

Jupyter Notebook 5 1 Updated Mar 15, 2024

sayakpaul / FunMatch-Distillation

TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.

Jupyter Notebook 84 8 Updated Sep 24, 2021

IDEA-Research / Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 590 20 Updated Jun 13, 2024

tonylins / pytorch-mobilenet-v2

A PyTorch implementation of MobileNet V2 architecture and pretrained model.

Python 1,361 329 Updated Oct 20, 2019

jaiwei98 / MobileNetV4-pytorch

An unofficial implementation of MobileNetV4 in Pytorch

Python 101 8 Updated May 11, 2024

baaivision / Uni3D

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Python 426 25 Updated Jan 17, 2024

jiaowoguanren0615 / MobileNetV4

This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.

Python 34 4 Updated Jul 2, 2024

hbb1 / 2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Python 1,575 77 Updated Jul 3, 2024

CSAILVision / semantic-segmentation-pytorch

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Python 4,888 1,092 Updated Jan 15, 2024

YvanYin / Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 977 70 Updated Jun 27, 2024

1hao-Liu / SM4Depth

Official Pytorch code for SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model

33 Updated Mar 14, 2024

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 4,652 515 Updated Jun 26, 2024

nihui / ncnn-android-nanodet

C++ 349 72 Updated Apr 11, 2024

FeiGeChuanShu / ncnn-android-depth_anything

a Android demo of depth_anything_v1 and depth_anything_v2

C++ 48 3 Updated Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haiyan Wang Haiyan-Chris-Wang

Block or report Haiyan-Chris-Wang

Stars

OpenGVLab / VisionLLM

ChaoningZhang / MobileSAM

DAMO-NLP-SG / VideoLLaMA2

isl-org / MiDaS