Skip to content
View kozuch's full-sized avatar
Block or Report

Block or report kozuch

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference library for Mistral models

Jupyter Notebook 9,376 817 Updated Jul 24, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 29,337 2,685 Updated Jul 31, 2024

Simulated annealing for neural networks with JAX.

Python 4 2 Updated Sep 21, 2022

Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Lua 292 94 Updated Nov 10, 2021

Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Python 1,040 347 Updated Nov 28, 2018

MNIST : data files and images

16 11 Updated Jul 14, 2020

Efficient utility of image similarity using deep neural network and deep learning.

Python 221 43 Updated Aug 4, 2019

SMDK, Scalable Memory Development Kit, is developed for Samsung CXL(Compute Express Link) Memory Expander to enable full-stack Software-Defined Memory system

C 270 61 Updated Jun 26, 2024

Run generative AI models in sophgo BM1684X

Python 84 15 Updated Jul 30, 2024
C++ 303 48 Updated Jul 30, 2024
C 269 29 Updated Jul 30, 2024

Open source FPGA-based NIC and platform for in-network compute

Verilog 1,582 395 Updated Jul 5, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,487 231 Updated May 1, 2024

Grok open release

Python 49,224 8,311 Updated May 29, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,251 5,450 Updated Jul 19, 2024
Python 48 7 Updated Jul 30, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,119 2,316 Updated Jul 31, 2024

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 243 24 Updated Jul 31, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,317 247 Updated Jul 30, 2024

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Python 140 5 Updated May 27, 2024

LLM training in simple, raw C/HIP for AMD GPUs

Cuda 24 2 Updated Jul 30, 2024

Implementation for MatMul-free LM.

Python 2,778 169 Updated Jun 27, 2024

Foundation Architecture for (M)LLMs

Python 2,979 201 Updated Apr 11, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,283 2,458 Updated Jul 15, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,057 5,451 Updated Jun 11, 2024

AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)

Python 18 5 Updated Jul 18, 2024

A large-scale simulation framework for LLM inference

Python 165 18 Updated Jul 28, 2024

A Gradio web UI for Large Language Models.

Python 38,736 5,101 Updated Jul 29, 2024

Fast and memory-efficient exact attention

Python 12,679 1,133 Updated Jul 30, 2024

PygmalionAI's large-scale inference engine

Python 823 94 Updated Jul 29, 2024
Next