Skip to content
View Haiyan-Chris-Wang's full-sized avatar
Block or Report

Block or report Haiyan-Chris-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VisionLLM Series

Python 697 12 Updated Jul 2, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,480 469 Updated Jan 29, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 467 28 Updated Jul 3, 2024

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,244 595 Updated Feb 14, 2024

[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices

Python 406 37 Updated Jan 23, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 2,195 152 Updated Jul 1, 2024

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Jupyter Notebook 668 54 Updated Jun 14, 2024

The open-source tool for building high-quality datasets and computer vision models

Python 7,865 518 Updated Jul 4, 2024

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 467 25 Updated Jun 21, 2024

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,772 98 Updated Nov 30, 2023

An Improved One millisecond Mobile Backbone

Python 140 31 Updated Aug 20, 2022

This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".

Swift 700 58 Updated Jul 25, 2022

74.3% MobileNetV3-Large and 67.2% MobileNetV3-Small model on ImageNet

Python 512 124 Updated Mar 8, 2023

Simple implementation of OpenAI CLIP model in PyTorch.

Jupyter Notebook 570 84 Updated Apr 17, 2024

Code release for ConvNeXt model

Python 5,626 684 Updated Jan 8, 2023

Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

Python 45 2 Updated Feb 19, 2024

The official repository for Mobile AR Depth Estimation: Challenges & Prospects (HotMobile24)

Jupyter Notebook 5 1 Updated Mar 15, 2024

TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.

Jupyter Notebook 84 8 Updated Sep 24, 2021

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 590 20 Updated Jun 13, 2024

A PyTorch implementation of MobileNet V2 architecture and pretrained model.

Python 1,361 329 Updated Oct 20, 2019

An unofficial implementation of MobileNetV4 in Pytorch

Python 101 8 Updated May 11, 2024

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Python 426 25 Updated Jan 17, 2024

This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.

Python 34 4 Updated Jul 2, 2024

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Python 1,575 77 Updated Jul 3, 2024

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Python 4,888 1,092 Updated Jan 15, 2024

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 977 70 Updated Jun 27, 2024

Official Pytorch code for SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model

33 Updated Mar 14, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 4,652 515 Updated Jun 26, 2024

a Android demo of depth_anything_v1 and depth_anything_v2

C++ 48 3 Updated Jun 18, 2024
Next