Block or Report
Block or report kozuch
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Official inference library for Mistral models
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Simulated annealing for neural networks with JAX.
Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1
Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1
Efficient utility of image similarity using deep neural network and deep learning.
SMDK, Scalable Memory Development Kit, is developed for Samsung CXL(Compute Express Link) Memory Expander to enable full-stack Software-Defined Memory system
Open source FPGA-based NIC and platform for in-network compute
Code examples and resources for DBRX, a large language model developed by Databricks
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
A fast inference library for running LLMs locally on modern consumer-class GPUs
A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..
anthonix / llm.c
Forked from karpathy/llm.cLLM training in simple, raw C/HIP for AMD GPUs
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Code for the paper "Language Models are Unsupervised Multitask Learners"
AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)
A large-scale simulation framework for LLM inference
A Gradio web UI for Large Language Models.
Fast and memory-efficient exact attention
PygmalionAI's large-scale inference engine