Skip to content
View cm2435's full-sized avatar
  • stealth
  • London Uk
Block or Report

Block or report cm2435

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLMPerf is a library for validating and benchmarking LLMs

Python 6 Updated Jul 19, 2024

P2P Docker registry capable of distributing TBs of data in seconds

Go 6,005 412 Updated Jul 23, 2024

Cataloging released Triton kernels.

68 2 Updated May 30, 2024

function calling-based LLM agents

Python 258 23 Updated Jun 24, 2024
Python 432 52 Updated Jul 22, 2024

Using graph network to solve PDEs

Python 317 82 Updated Aug 15, 2023

This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.

Shell 803 114 Updated Oct 3, 2022

[ACL 2024] ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis.

Python 37 2 Updated Jul 12, 2024

perfect programming language

10,987 338 Updated Jul 26, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,803 803 Updated Jul 1, 2024
Rust 256 18 Updated Jul 23, 2024

Official implementation of Half-Quadratic Quantization (HQQ)

Python 580 55 Updated Jul 21, 2024
Python 4 11 Updated May 27, 2024

Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.

Python 193 40 Updated Apr 19, 2024

Tile primitives for speedy kernels

Cuda 1,411 51 Updated Jul 26, 2024

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 355 12 Updated Jul 19, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,918 1,251 Updated Jul 26, 2024

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python 1,194 77 Updated Jul 26, 2024

Terraform-based setup for a production-grade ECS on Fargate setup on AWS.

HCL 20 7 Updated Apr 25, 2023

Module, Model, and Tensor Serialization/Deserialization

Python 155 25 Updated Jul 18, 2024

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,274 289 Updated Jul 14, 2024

Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors, now in oobabooga text generation webui!

Python 38 2 Updated Mar 21, 2024

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Jupyter Notebook 290 31 Updated Jul 26, 2024

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Python 503 50 Updated Jul 6, 2024

🧊 LLM-Observability for Developers. The open-source platform for logging, monitoring, and debugging.

TypeScript 1,510 160 Updated Jul 26, 2024

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Python 85 5 Updated Jul 26, 2024

Puzzles for learning Triton

Jupyter Notebook 886 51 Updated Jul 17, 2024

Studying the variance in neural net predictions across training time

Python 3 Updated Apr 24, 2024

Resources from the EleutherAI Math Reading Group

Julia 47 1 Updated Jun 23, 2024
Next