Block or Report
Block or report amalsu0
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Visual tracking library based on PyTorch.
PyTorch implementation of some attentions for Deep Learning Researchers.
Collection of common code that's shared among different research projects in FAIR computer vision team.
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
A best practice for deep learning project template architecture.
DataCV2024: The 2nd DataCV Challenge in conjunction with the CVPR 2024 Visual Dataset Understanding workshop
Social Link Inference via Multiview Matching Network From Spatiotemporal Trajectories
Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM software. Supports high-resolution formats and images with rotations. Both CLI and GUI are supported.
A collection of educational notebooks on multi-view geometry and computer vision.
Rectify and stitch images together using multiview geometry.
[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
Real time one-stage multi-class & multi-object tracking based on anchor-free detection and ReID
Best Practices, code samples, and documentation for Computer Vision.
Simultaneous object detection and tracking using center points.
[CVPR2019] Fast Online Object Tracking and Segmentation: A Unifying Approach
Simple, online, and realtime tracking of multiple objects in a video sequence.
SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Single and multiple view camera calibration tool
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object …
A Python toolbox for conformal prediction research on deep learning models, using PyTorch.
Interactively explore unstructured datasets from your dataframe.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A curated list of Large Language Model (LLM) Interpretability resources.
✨✨Latest Advances on Multimodal Large Language Models