sudhakarsingh27

Follow

Sudhakar Singh sudhakarsingh27

Follow

LLMs, Parallel Systems, Quantum Computing

8 followers · 17 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Block or Report

Block or report sudhakarsingh27

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Pinned

NVIDIA/Megatron-LM NVIDIA/Megatron-LM Public

Ongoing research training transformer models at scale

Python 8.9k 2k
NVIDIA/TransformerEngine NVIDIA/TransformerEngine Public

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1.5k 231
lit-llama lit-llama Public

Forked from Lightning-AI/lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python
huggingface/accelerate huggingface/accelerate Public

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7.2k 838
state-spaces/mamba state-spaces/mamba Public

Python 10.2k 798
NVIDIA/NeMo NVIDIA/NeMo Public

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 10.3k 2.2k