I am ZHANG SHANYUN,a student in SUN YAT-SEN UNIVERSITY, "I am eager to learn on GitHub and contribute my part to this community."
-
SUN YAT-SEN UNIVERSITY
- China(GuangDong)
-
10:33
(UTC -12:00)
Highlights
- Pro
Stars
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs
A benchmark and analysis for fine-grained visual comprehension (FGVC) tasks in large vision language models (LVLMs).
[ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection
SamSamhuns / yolov5_adversarial
Forked from ultralytics/yolov5Generate adversarial patches against YOLOv5 🚀
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-Time Object Detection
一些关于目标检测的脚本的改进思路代码,详细请看readme.md