vgoklani

Vishal Goklani vgoklani

Interested in Deep Learning (self-supervised learning & LLMs), Astrophysics (exoplanets), and Cosmology (CMB).... I like to build things

150 followers · 695 following

New York, NY
@vgoklani_ai

Block or Report

Block or report vgoklani

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 2,934 408 Updated Jul 31, 2024

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Cuda 103 3 Updated Jul 27, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 8,501 971 Updated Jul 31, 2024

KylinC / Llama-3-Distill

Distillation version of llama-68m only for MLsys research use.

Python 1 Updated Apr 25, 2024

pytorch / ao

Custom data types and layouts for training and inference

Python 449 57 Updated Jul 31, 2024

lessw2020 / Best-Deep-Learning-Optimizers

Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable

Jupyter Notebook 211 41 Updated Apr 4, 2021

vithursant / nanoGPT_mlx

Port of Andrej Karpathy's nanoGPT to Apple MLX framework.

Python 91 8 Updated Feb 12, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,092 2,895 Updated Jul 25, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

26,183 1,401 Updated Jul 29, 2024

neuralmagic / AutoFP8

Python 107 13 Updated Jul 23, 2024

OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 634 49 Updated Jul 24, 2024

melisa-writer / short-transformers

Prune transformer layers

Python 57 9 Updated May 30, 2024

likejazz / llama3.cuda

llama3.cuda is a pure C/CUDA implementation for Llama 3 model.

Cuda 276 18 Updated Jun 4, 2024

pytorch / torchtitan

A native PyTorch Library for large model training

Python 1,380 125 Updated Jul 31, 2024

mgmalek / efficient_cross_entropy

Python 51 5 Updated May 28, 2024

mgmalek / ring-attention

Python 3 Updated Apr 1, 2024

alessiodm / drl-zh

Deep Reinforcement Learning: Zero to Hero!

Jupyter Notebook 1,976 69 Updated Jul 6, 2024

brianhelba / zipfile-deflate64

Extract Deflate64 ZIP archives with Python's `zipfile` API.

Python 16 6 Updated Dec 5, 2023

commaai / controls_challenge

Python 66 76 Updated Jun 20, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 24,990 2,733 Updated Jul 28, 2024

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 425 18 Updated Jul 22, 2024

mistralai / mistral-common

Python 474 37 Updated Jul 29, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 22,399 2,484 Updated Jul 30, 2024

lweitkamp / triton_exercises

An introduction to programming in Triton

Python 1 Updated Jan 2, 2024

epfml / llm-baselines

Python 67 17 Updated Jun 18, 2024

pytorch-labs / applied-ai

Applied AI experiments and examples for PyTorch

Python 93 7 Updated Jul 2, 2024

mistralai-sf24 / hackathon

Python 451 38 Updated Apr 1, 2024

pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python 3,694 311 Updated Jul 31, 2024

PicoMLX / PicoMLXServer

The easiest way to run the fastest MLX-based LLMs locally

Swift 195 13 Updated Jul 10, 2024

cloneofsimo / fim-llama-deepspeed

Python 31 Updated Jan 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly