arawxx / FSDP-Distributed-Training-of-ConvNextV2-on-CIFAR10 Star 6 Code Issues Pull requests A script for training the ConvNextV2 on CIFAR10 dataset using the FSDP technique for a distributed training scheme. deep-learning distributed-computing pytorch distributed distributed-training distributed-learning convnext convnextv2 fsdp fully-sharded-data-parallel Updated Dec 11, 2023 Python
ridwan-salau / transformer-xl Star 0 Code Issues Pull requests Fully Sharded Data Parallel (FSDP) implementation of Transformer XL pytorch transformer fsdp fully-sharded-data-parallel Updated Apr 24, 2023 Python