Skip to content
View Xt117's full-sized avatar

Block or report Xt117

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Robust and Versatile Monocular Visual-Inertial State Estimator

C++ 5,004 2,098 Updated Aug 14, 2024

An OpenCV based implementation of Monocular Visual Odometry

C++ 778 294 Updated Apr 18, 2017

a reimplementation of LiteFlowNet in PyTorch that matches the official Caffe version

Python 407 80 Updated Mar 1, 2024

deep learning for image processing including classification and object-detection etc.

Python 22,954 7,976 Updated Jul 25, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,598 447 Updated Jul 30, 2024

The first challenge on short-form video quality assessment

Python 60 Updated Aug 28, 2024

A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Python 58 8 Updated Mar 8, 2023

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 681 42 Updated Jul 30, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,934 360 Updated Oct 29, 2024

This is a collection of our NAS and Vision Transformer work.

Python 1,675 227 Updated Jul 25, 2024

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 607 36 Updated Oct 14, 2024

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Python 785 62 Updated Jun 2, 2024

The first international standard for image aesthetics assessment metadata. 首个面向图像美学评估元数据的国际标准.

12 1 Updated Feb 8, 2024

End-to-end learning of deep visual representations for image retrieval

Python 644 101 Updated May 19, 2021

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,672 691 Updated Jul 25, 2024

A PyTorch implementation of our method from "An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos", Proc. MMM 2024

Python 4 2 Updated May 30, 2024

Get hundred of million of image+url from the crawling at home dataset and preprocess them

Python 204 20 Updated May 26, 2024

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 873 122 Updated Apr 12, 2024

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,058 3,688 Updated Jul 4, 2024

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,398 209 Updated Apr 15, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,482 462 Updated Aug 6, 2024

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

Swift 120 10 Updated May 11, 2023

Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]

Python 205 39 Updated Mar 6, 2024

Official PyTorch Implementation of Correlation Verification for Image Retrieval, CVPR 2022 (Oral Presentation)

Python 176 11 Updated Aug 21, 2023

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 284 25 Updated Jun 6, 2024

[CVPR 2023] DepGraph: Towards Any Structural Pruning

Python 2,688 331 Updated Oct 15, 2024

Disk code release

Python 317 46 Updated Dec 15, 2023

CNN Image Retrieval in PyTorch: Training and evaluating CNNs for Image Retrieval in PyTorch

Python 1,430 323 Updated May 13, 2024

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,382 328 Updated Jun 20, 2024

Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022

Jupyter Notebook 2,306 361 Updated May 31, 2024
Next