Lists (10)
Sort Name ascending (A-Z)
Stars
This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
There are 50 Visualizations which can you to finish 7 different purposes of data analysis.
A synthetic data generator for text recognition
[TCSVT2023] [LASNet] RGB-T Semantic Segmentation with Location, Activation, and Sharpening
Unaligned RGB-T Semantic Segmentation
Image translation from Nighttime thermal infrared images to Daytime color images.
[ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with pseudo depth and/or misaligned thermal images)
RGB-T Fusion, RGB-T SOD, RGB-T Vehicle Detection, RGB-T Crowd Counting, RGB-T Pedestrian Detection, RGB-T Semantic Segmeantaion, RGB-T Tracking
Registration for power equipment infrared and visible images
A robust registration method for UAV thermal infrared and visible images taken by the camera with two sensors
[IJCAI2022 Oral] Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and Registration
[IEEE TMM 23] Focal Inverse Distance Transform Maps for Crowd Localization
A collection of deep learning based RGB-T-Fusion methods, codes, and datasets. The main directions involved are Multispectral Pedestrian Detection, RGB-T Aerial Object Detection, RGB-T Semantic Seg…
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
The official implementation of the crowd counting model CLIP-EBC.
ImageBind One Embedding Space to Bind Them All
The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
我的 ComfyUI 工作流合集 | My ComfyUI workflows collection
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Code for CVPR2024 'Segment Every Out-of-Distribution Object '
Use visible and infrared images to train the network. This method is better to face the dark environment.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
[ICCV 2023] Deep Active Contours for Real-time 6-DoF Object Tracking
汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used for data augmentation, CSC misclassified character detection and rec…
🇨🇳最全最新中国【省、市、区县、乡镇街道】json,csv,sql数据