Skip to content
View anttttti's full-sized avatar

Block or report anttttti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🙌 OpenHands: Code Less, Make More

Python 30,966 3,568 Updated Sep 4, 2024

A throughput-oriented high-performance serving framework for LLMs

Cuda 371 11 Updated Sep 2, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 375 26 Updated Sep 5, 2024

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,095 44 Updated Aug 9, 2024

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Rust 867 38 Updated Aug 27, 2024

QQQ is an innovative and hardware-optimized W4A8 quantization solution.

Python 54 4 Updated Aug 26, 2024

Corpus of Te Reo derived from the New Zealand Hansard

Python 6 Updated Sep 22, 2021

The main repository for building Pascal-compatible versions of ML applications and libraries.

4 Updated Sep 3, 2024

RES-Q: Evaluating the Code-Editing Capability of Large Language Model Systems at the Repository Scale

Python 23 1 Updated Jun 28, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,035 141 Updated Aug 1, 2024

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Jupyter Notebook 1,760 123 Updated Jun 27, 2024

A self-generalizing gradient boosting machine which doesn't need hyperparameter optimization

Rust 124 6 Updated Aug 4, 2024

Open source project for data preparation of LLM application builders

Python 106 102 Updated Sep 4, 2024

Parallel S3 and local filesystem execution tool.

Go 2,559 223 Updated Jul 31, 2024

An easy-to-use LLM quantization and inference toolkit based on GPTQ algorithm (weight-only quantization).

Python 81 15 Updated Aug 28, 2024

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 542 42 Updated Sep 4, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 248 9 Updated Sep 4, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,715 288 Updated Sep 3, 2024

An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)

TypeScript 7,977 2,695 Updated Aug 14, 2024

🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.

Shell 127 5 Updated Jul 25, 2024

Boosting 4-bit inference kernels with 2:4 Sparsity

Cuda 43 2 Updated Sep 4, 2024

Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent

Python 5 1 Updated Jul 27, 2024

Label, clean and enrich text datasets with LLMs.

Python 1,998 137 Updated Sep 2, 2024

A Python package for LLM dynamic routing through the Unify REST API.

Python 166 21 Updated Sep 4, 2024

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 739 46 Updated Sep 2, 2024
Next