Block or Report
Block or report Yamameeee
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Acceptance rates for the major AI conferences
Human Trajectory Prediction Dataset Benchmark (ACCV 2020)
Code and GMVD Dataset for "Bringing Generalization to Deep Multi-view Pedestrian Detection". Accepted at WACV 2023 Workshop (Real-World Surveillance: Applications and Challenges).
Generalized Multi-View Detection (GMVD) dataset curated using GTA V and Unity. Accepted at WACV 2023 Workshop (Real-World Surveillance: Applications and Challenges).
Official Code for "Lifting Multi-View Detection and Tracking to the Bird’s Eye View"
Official Code for "EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye View"
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Example deep learning projects that use wandb's features.
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Torchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Spatio-Temporal Action Localization System
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
The AVA dataset densely annotates 80 atomic visual actions in 351k movie clips with actions localized in space and time, resulting in 1.65M action labels with multiple labels per human occurring fr…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
This repository provides evaluation code for paper titled "Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts."
Medical image captioning using OpenAI's CLIP
High-Resolution Image Synthesis with Latent Diffusion Models
A latent text-to-image diffusion model
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
An open source implementation of CLIP.
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
A Unified Toolbox for Object Perception & Application