Block or Report
Block or report cm2435
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
ori-edge / BeFOri
Forked from ray-project/llmperfLLMPerf is a library for validating and benchmarking LLMs
P2P Docker registry capable of distributing TBs of data in seconds
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
[ACL 2024] ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Official implementation of Half-Quadratic Quantization (HQQ)
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Terraform-based setup for a production-grade ECS on Fargate setup on AWS.
Module, Model, and Tensor Serialization/Deserialization
A list of awesome compiler projects and papers for tensor computation and deep learning.
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors, now in oobabooga text generation webui!
The nnsight package enables interpreting and manipulating the internals of deep learned models.
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
🧊 LLM-Observability for Developers. The open-source platform for logging, monitoring, and debugging.
EmbeddedLLM / vllm-rocm
Forked from vllm-project/vllmvLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
Studying the variance in neural net predictions across training time
Resources from the EleutherAI Math Reading Group