🎯
Focusing
Block or Report
Block or report theoajc
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Tensors and Dynamic neural networks in Python with strong GPU acceleration