Skip to content
View alexandrnikitin's full-sized avatar

Organizations

@nsubstitute

Block or report alexandrnikitin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Blazingly fast LLM inference.

Rust 3,281 238 Updated Aug 28, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,318 149 Updated Aug 28, 2024

Module, Model, and Tensor Serialization/Deserialization

Python 167 23 Updated Aug 26, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 16,149 1,251 Updated Aug 28, 2024

🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.

Shell 127 5 Updated Jul 25, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,864 511 Updated Aug 18, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 29,709 3,652 Updated Aug 27, 2024

Helping developers to use AWS Graviton2, Graviton3, and Graviton4 processors which power the 6th, 7th, and 8th generation of Amazon EC2 instances (C6g[d], M6g[d], R6g[d], T4g, X2gd, C6gn, I4g, Im4g…

Python 857 190 Updated Aug 26, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,518 1,191 Updated Apr 7, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,006 245 Updated Jul 15, 2024

TextAugment: Text Augmentation Library

Python 388 60 Updated Feb 20, 2024

Reverse Engineering: Decompiling Binary Code with Large Language Models

Python 2,879 209 Updated Aug 16, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,416 2,469 Updated Aug 28, 2024

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 573 82 Updated Aug 14, 2024

Official implementation of Half-Quadratic Quantization (HQQ)

Python 648 62 Updated Aug 28, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 5,960 602 Updated Aug 26, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,610 1,502 Updated Aug 26, 2024

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,510 618 Updated Apr 9, 2024

Run VSCode (codeserver) on Google Colab or Kaggle Notebooks

Python 2,061 274 Updated May 31, 2023

CUDA Core Compute Libraries

C++ 1,086 127 Updated Aug 28, 2024

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,432 447 Updated Aug 28, 2024

🌐 The Internet OS! Free, Open-Source, and Self-Hostable.

JavaScript 24,190 1,557 Updated Aug 28, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 9,495 944 Updated Aug 28, 2024

Inference code for Llama models

Python 55,253 9,413 Updated Aug 18, 2024

Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.

Python 279 78 Updated Aug 1, 2024

Deep Learning Fundamentals -- Code material and exercises

Jupyter Notebook 327 156 Updated Feb 28, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 5,937 514 Updated Aug 22, 2024

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Python 27,844 3,335 Updated Aug 22, 2024

A text augmentation tool for named entity recognition.

Python 53 2 Updated Jul 22, 2021

Data augmentation for NLP

Jupyter Notebook 4,394 460 Updated Jun 24, 2024
Next