Highlights
- Pro
Block or Report
Block or report stefan-ainetter
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch
Official PyTorch implementation of SegFormer
Few-Shot Panoptic Segmentation With Foundation Models
Annotations for the ScanNet dataset generated using scannotate and HOC-Search.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
[CVPR2019 Oral] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation on Python3, Tensorflow, and Keras
Repository for WACV23 paper "Automatically Annotating Indoor Images with CAD Models via RGB-D Scans"
Implementation of Grondin et al. 2022 "Tree Detection and Diameter Estimation Based on Deep Learning". Also includes datasets and some of the pretrained models.
Code repository for paper Instance Segmentation for Autonomous Log Grasping in Forestry Operations
Inpaint anything using Segment Anything and inpainting models.
Bringing Old Photo Back to Life (CVPR 2020 oral)
Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a…
High-Resolution Image Synthesis with Latent Diffusion Models
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A latent text-to-image diffusion model
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"
Repo for visualization of MCSS outputs and its evaluation
Official implementation of the NeurIPS 2021 paper "Panoptic 3D Scene Reconstruction from a Single RGB Image"