Stars
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
Official repository for TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Official repository for VIGOR : Cross-View Image Geo-localization beyond One-to-one Retrieval
Lending Orientation to Neural Networks for Cross-view Geo-localization
A Survey on Vision-Language Geo-Foundation Models (VLGFMs)
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
MOT using deepsort and yolov3 with pytorch
Generative Models by Stability AI
Official code repository for ICLR 2024 paper "DiffusionSat: A Generative Foundation Model for Satellite Imagery"
The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"
An open source implementation of CLIP.
YOLOv10: Real-Time End-to-End Object Detection
🔥🔥🔥 专注于YOLOv5,YOLOv7、YOLOv8、YOLOv9改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of paper - DEYO: DETR with YOLO for End-to-End Object Detection
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-Time Object Detection
Efficient computing methods developed by Huawei Noah's Ark Lab
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
QLoRA: Efficient Finetuning of Quantized LLMs
[CVPR 2024] 🎬💭 chat with over 10K frames of video!