-
University of Washington
- Seattle
Stars
llama3 implementation one matrix multiplication at a time
Neural Networks: Zero to Hero
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Bidirectional C++/Python converters for ROS messages, using pybind11.
C++ High Performance Second Edition, published by Packt
rpclib is a modern C++ msgpack-RPC server and client library
The first competitive instance segmentation approach that runs on small edge devices at real-time speeds.
In defence of metric learning for speaker recognition
A simple, fully convolutional model for real-time instance segmentation.
Solutions and Notes for Labs of Computer Systems: A Programmer's Perspective 3rd Editon // 《深入理解计算机系统》第三版的实验文件、解答与笔记
Cross-platform, customizable ML solutions for live and streaming media.
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
A virtual world where Autonomous Systems from different Formula Student teams can compete in time-trial challenges
A C++ wrapper for a hungarian algorithm implementation
This repository is for my YT video series about optimizing a Tensorflow deep learning model using TensorRT. We demonstrate optimizing LeNet-like model and YOLOv3 model, and get 3.7x and 1.5x faster…
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Model Predictive Contouring Controller (MPCC) for Autonomous Racing
This repository is a collection of scripts/programs I use to set up the software development environment on my Jetson Nano, TX2, and Xavier NX.