Skip to content
View sashank06's full-sized avatar

Block or report sashank06

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 31,720 3,769 Updated Nov 8, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 979 108 Updated Nov 8, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,610 469 Updated Nov 9, 2024

Examples in the MLX framework

Python 6,177 873 Updated Nov 9, 2024

A curated list of engineering blogs

Ruby 31,677 1,628 Updated Aug 21, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 937 54 Updated Jan 30, 2024

LLM inference in C/C++

C++ 67,508 9,693 Updated Nov 9, 2024

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Python 3,236 641 Updated Feb 7, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,210 191 Updated Mar 1, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,116 2,505 Updated Aug 15, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,613 613 Updated Nov 5, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,211 5,922 Updated Aug 19, 2024

a web ui & api for 🤗 diffusers

Python 584 86 Updated Jun 4, 2023

A prize for finding tasks that cause large language models to show inverse scaling

597 25 Updated Oct 11, 2023

DALL·E Mini - Generate images from a text prompt

Python 14,750 1,208 Updated Nov 9, 2023

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Python 14 3 Updated Dec 22, 2022

ACL 2022

Python 124 21 Updated Dec 7, 2023

Repo for external large-scale work

Python 6,514 725 Updated Apr 27, 2024
Python 48 5 Updated Feb 5, 2023

Succinct Data Structure Library 2.0

C++ 2,213 351 Updated Jun 2, 2023

Search Engines with Autoregressive Language models

Python 277 24 Updated Apr 4, 2023

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 3,889 1,216 Updated Aug 21, 2024

Graph Data Augmentation Library for PyTorch Geometric

Python 128 7 Updated Aug 17, 2022
Python 131 17 Updated Jul 5, 2023

Automatic metrics for GEM tasks

Python 61 20 Updated Oct 25, 2022

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 4,931 381 Updated Mar 17, 2024

RAPH - Reinforcement Agent Playing netHack

Python 3 Updated Mar 21, 2022

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Python 58 15 Updated Jan 3, 2023

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 4,764 842 Updated Mar 5, 2024
Next