Skip to content
View okarthikb's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report okarthikb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 stars written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,622 25,742 Updated Jul 22, 2024

Inference code for Llama models

Python 54,331 9,335 Updated Jul 19, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 29,246 2,678 Updated Jul 22, 2024

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Python 27,569 3,312 Updated Jul 22, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 25,221 2,783 Updated Jul 22, 2024

The official Meta Llama 3 GitHub site

Python 23,445 2,531 Updated Jul 17, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,344 3,320 Updated Jul 22, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,014 5,447 Updated Jun 11, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,499 2,417 Updated Apr 28, 2024

Ongoing research training transformer models at scale

Python 9,474 2,138 Updated Jul 22, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,605 970 Updated Jun 27, 2024

structured outputs for llms

Python 6,776 544 Updated Jul 22, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,724 981 Updated Jul 12, 2024

Use commands in English to control Blender with OpenAI's GPT-4

Python 4,313 307 Updated Jun 5, 2024

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,136 403 Updated Apr 24, 2023

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,329 247 Updated Jun 26, 2024

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 1,972 50 Updated Jun 15, 2024

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 1,933 131 Updated Jul 22, 2024

Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable. https://docs.kidger.site/diffrax/

Python 1,329 122 Updated Jul 22, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,230 247 Updated Jul 18, 2024

GPS receiver from a raw antenna 🛰️

Python 280 15 Updated Apr 15, 2024