Block or Report
Block or report trivedisarthak
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
yermandy / ultralytics
Forked from ultralytics/ultralyticsNEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
Continuation of an abandoned project fast-coco-eval
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark