Stars
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Code Release for MViTv2 on Image Recognition.
A deep learning library for video understanding research.
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Efficient 3D Backbone Network for Temporal Modeling
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
A Simple and Versatile Framework for Object Detection and Instance Recognition
📖 A curated list of resources dedicated to Natural Language Processing (NLP)
A curated list of action recognition and related area resources
FBN: Factorized Bilinear Models for Image Recognition (ICCV 2017)
MXNet Code For Demystifying Neural Style Transfer (IJCAI 2017)
FH by Neighbor Embedding Facial Components (ICIP 2015)
Implement "Hallucinating Face by Eigentransformation"
Neighborhood Regression for Edge-Preserving Image Super-Resolution (ICASSP 2015)
A curated list of deep learning resources for computer vision
Master the command line, in one page
Quickly download, clean up, and install public datasets into a database management system