Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
-
Updated
Aug 7, 2024 - Python
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)【安全加固,暂停交互,请耐心等待】
Automate browser-based workflows with LLMs and Computer Vision
Open source hardware and software platform to build a small scale self driving car.
The Open Source Framework for Machine Vision
TEN Agent is a world-class multimodal AI agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG.
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
High-performance Vision library in Python. Scale your research, not boilerplate.
Implementation of Bottleneck Transformer in Pytorch
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)
[ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Add a description, image, and links to the vision topic page so that developers can more easily learn about it.
To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."