Stars
✨✨Latest Advances on Multimodal Large Language Models
Implementation of Nougat Neural Optical Understanding for Academic Documents
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Segment-Anything + 3D. Let's lift anything to 3D.
SIGGRAPH Asia 2022: Code for "Efficient Neural Radiance Fields for Interactive Free-viewpoint Video"
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Official Pytorch implementation of "Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose", ECCV 2020
🚀 General data management framework, objects are pages