Stars
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Efficient Triton Kernels for LLM Training
[CVPR 2024 Highlight✨] Official Pytorch Code for EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Causal depthwise conv1d in CUDA, with a PyTorch interface
Unofficial reimplementation of ViR: Vision Retention Networks by Hatamizadeh et. al. (https://arxiv.org/abs/2310.19731)
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionne…
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Making large AI models cheaper, faster and more accessible
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Official PyTorch implementation of Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion (CVPR 2020)
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
TensorFlow 2.X reimplementation of Global Context Vision Transformers, Ali Hatamizadeh, Hongxu (Danny) Yin, Jan Kautz Pavlo Molchanov.
Tensorflow 2.0 Implementation of GCViT: Global Context Vision Transformer
Keras (TensorFlow v2) reimplementation of Global Context Vision Transformer models
Official Repository for Deep Active Lesion Segmentation (DALS)
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
NVIDIA Clara Viz is a platform for visualization of 2D/3D medical imaging data
Segmentation deep learning ALgorithm based on MONai toolbox: single and multi-label segmentation software developed by QIMP team-Vienna.
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation