Stars
Entropy Based Sampling and Parallel CoT Decoding
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A high-throughput and memory-efficient inference and serving engine for LLMs
You like pytorch? You like micrograd? You love tinygrad! ❤️
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
Code for fine-tuning Platypus fam LLMs using LoRA
Accessible large language models via k-bit quantization for PyTorch.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
nannyml: post-deployment data science in python
Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.
An example cohttp server w/ dockerfile for deploying to now.sh
Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda
A list of semi to fully remote-friendly companies (jobs) in tech.
BlackJAX is a Bayesian Inference library designed for ease of use, speed and modularity.
A collection of infrastructure and tools for research in neural network interpretability.
☁️ Build multimodal AI applications with cloud-native stack
Transformer-based models for Natural Language Processing in OCaml
Tiled scrollable window management for Gnome Shell
Research language for array processing in the Haskell/ML family
Full text geoparsing as a Python library
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).