Skip to content
View gmittal's full-sized avatar

Block or report gmittal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,480 152 Updated Aug 17, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,080 839 Updated Jul 1, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 710 40 Updated Sep 28, 2024

MLX: An array framework for Apple silicon

C++ 16,634 954 Updated Oct 6, 2024

Find all the fundamental UXI guidelines and pattern-based web components to build brand driven, consistent and intuitive designs for digital Porsche products.

TypeScript 477 24 Updated Oct 4, 2024
Python 27 6 Updated Sep 2, 2024

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,574 191 Updated Jul 28, 2024

Inference Llama 2 in one file of pure 🔥

Mojo 2,098 143 Updated May 21, 2024

Python pdb for multiple processes

Python 30 6 Updated Nov 5, 2022

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,711 454 Updated May 3, 2024

Inference code for CodeLlama models

Python 15,929 1,850 Updated Aug 12, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 191 13 Updated Jun 14, 2023

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 6,786 669 Updated Oct 5, 2024

commaVQ is a dataset of compressed driving video

Jupyter Notebook 289 46 Updated Jul 8, 2024

Tools for building GPU clusters

Shell 1,253 326 Updated Mar 8, 2024

LLMs for your CLI

Python 1,278 75 Updated May 29, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,087 137 Updated Oct 2, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,878 4,112 Updated Oct 6, 2024

The Memory layer for your AI apps

Python 22,165 2,034 Updated Oct 5, 2024

Write scalable load tests in plain Python 🚗💨

Python 24,681 2,960 Updated Oct 3, 2024

CUDA on non-NVIDIA GPUs

Rust 9,125 608 Updated Oct 6, 2024

It's React, but in Python

Python 7,852 315 Updated Jul 18, 2024

Implementation of Flash Attention in Jax

Python 189 23 Updated Mar 1, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 933 52 Updated Jan 30, 2024

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Python 767 60 Updated Jul 1, 2024

Tevatron - A flexible toolkit for neural retrieval research and development.

Python 494 94 Updated Aug 20, 2024

The Mojo Programming Language

Mojo 22,978 2,587 Updated Oct 5, 2024
Python 2,508 155 Updated Sep 24, 2024

Tiny data-over-sound library

C++ 1,943 156 Updated Sep 26, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,535 346 Updated Aug 8, 2024
Next