sashank06

Sashank Santhanam sashank06

PhD Candidate focused on Dialogue System and Cognitive Architectures

18 followers · 46 following

Charlotte, NC
https://sashank06.github.io

Achievements

Stars

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 31,720 3,769 Updated Nov 8, 2024

allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.

Python 979 108 Updated Nov 8, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,610 469 Updated Nov 9, 2024

ml-explore / mlx-examples

Examples in the MLX framework

Python 6,177 873 Updated Nov 9, 2024

kilimchoi / engineering-blogs

A curated list of engineering blogs

Ruby 31,677 1,628 Updated Aug 21, 2024

Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 937 54 Updated Jan 30, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 67,508 9,693 Updated Nov 9, 2024

facebookresearch / esm

Evolutionary Scale Modeling (esm): Pretrained language models for proteins

Python 3,236 641 Updated Feb 7, 2024

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,210 191 Updated Mar 1, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,116 2,505 Updated Aug 15, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,613 613 Updated Nov 5, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,211 5,922 Updated Aug 19, 2024

abhishekkrthakur / diffuzers

a web ui & api for 🤗 diffusers

Python 584 86 Updated Jun 4, 2023

amazon-science / alexa-teacher-models

Python 363 27 Updated Apr 9, 2023

inverse-scaling / prize

A prize for finding tasks that cause large language models to show inverse scaling

597 25 Updated Oct 11, 2023

borisdayma / dalle-mini

DALL·E Mini - Generate images from a text prompt

Python 14,750 1,208 Updated Nov 9, 2023

akashkm99 / duelnlg

Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Python 14 3 Updated Dec 22, 2022

albertkx / Berkeley-Crossword-Solver

ACL 2022

Python 124 21 Updated Dec 7, 2023

facebookresearch / metaseq

Repo for external large-scale work

Python 6,514 725 Updated Apr 27, 2024

McGill-NLP / FaithDial

Python 48 5 Updated Feb 5, 2023

simongog / sdsl-lite

Succinct Data Structure Library 2.0

C++ 2,213 351 Updated Jun 2, 2023

facebookresearch / SEAL

Search Engines with Autoregressive Language models

Python 277 24 Updated Apr 4, 2023

nlp-with-transformers / notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 3,889 1,216 Updated Aug 21, 2024

rish-16 / grafog

Graph Data Augmentation Library for PyTorch Geometric

Python 128 7 Updated Aug 17, 2022

salesforce / Converse

Python 131 17 Updated Jul 5, 2023

GEM-benchmark / GEM-metrics

Automatic metrics for GEM tasks

Python 61 20 Updated Oct 25, 2022

salesforce / CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 4,931 381 Updated Mar 17, 2024

dmitrySorokin / raph

RAPH - Reinforcement Agent Playing netHack

Python 3 Updated Mar 21, 2022

maciej-sypetkowski / autoascend

The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge

Python 58 15 Updated Jan 3, 2023

alirezadir / Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 4,764 842 Updated Mar 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sashank Santhanam sashank06

Achievements

Achievements

Block or report sashank06

Stars

rasbt / LLMs-from-scratch

allenai / dolma

allenai / OLMo

ml-explore / mlx-examples

kilimchoi / engineering-blogs

Liuhong99 / Sophia

ggerganov / llama.cpp

facebookresearch / esm

allenai / RL4LMs

karpathy / minGPT

facebookresearch / xformers

karpathy / nanoGPT

abhishekkrthakur / diffuzers

amazon-science / alexa-teacher-models

inverse-scaling / prize

borisdayma / dalle-mini

akashkm99 / duelnlg

albertkx / Berkeley-Crossword-Solver

facebookresearch / metaseq

McGill-NLP / FaithDial

simongog / sdsl-lite

facebookresearch / SEAL

nlp-with-transformers / notebooks

rish-16 / grafog

salesforce / Converse

GEM-benchmark / GEM-metrics

salesforce / CodeGen

dmitrySorokin / raph

maciej-sypetkowski / autoascend

alirezadir / Machine-Learning-Interviews