Stars
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.
Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
Code & data accompanying the NeurIPS 2020 paper "Iterative Deep Graph Learning for Graph Neural Networks: Better and Robust Node Embeddings".
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Line Extraction in Handwritten Documents via Instance Segmentation
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
FFCV: Fast Forward Computer Vision (and other ML workloads!)