Starred repositories
Virtual whiteboard for sketching hand-drawn like diagrams
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Source code examples from the Parallel Forall Blog
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
4 bits quantization of LLaMA using GPTQ
Reverse engineered API of Microsoft's Bing Chat AI
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
GLIDE: a diffusion-based text-conditional image synthesis model
fast-stable-diffusion + DreamBooth
Fast and memory-efficient exact attention
Simple samples for TensorRT programming
TensorFlow code and pre-trained models for BERT
A toolkit showing GPU's all-round capability in video processing
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Transformer related optimization, including BERT, GPT
How to export PyTorch models with unsupported layers to ONNX and then to Intel OpenVINO
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Tutorial for Using Custom Layers with OpenVINO (Intel Deep Learning Toolkit)
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Group Fisher Pruning for Practical Network Compression(ICML2021)
More practical frame interpolation approach.
[ICCV 2021, Oral 3%] Official repository of XVFI
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation