Skip to content
View hvaara's full-sized avatar
:shipit:
:shipit:

Organizations

@InteractiveBrokers @EpicGames @oslojs @plutusfund @stableinfra
Block or Report

Block or report hvaara

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast and memory-efficient exact attention

Python 12,644 1,129 Updated Jul 27, 2024

Transformer related optimization, including BERT, GPT

C++ 5,690 878 Updated Mar 27, 2024

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capa…

Python 6,622 707 Updated Jul 28, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

283 12 Updated Jul 24, 2024

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,137 457 Updated Jul 29, 2024

Tensor library for machine learning

C++ 10,447 968 Updated Jul 28, 2024

Agentic components of the Llama Stack APIs

Python 2,424 233 Updated Jul 29, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,240 1,464 Updated Jul 29, 2024

Portable Text is a JSON based rich text specification for modern content editing platforms.

1,248 21 Updated Jul 25, 2024

Utilities intended for use with Llama models.

Python 2,757 353 Updated Jul 29, 2024

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 130 21 Updated Jul 28, 2024

A collective list of free APIs

Python 307,212 32,886 Updated Jul 24, 2024

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 4,926 226 Updated Jul 28, 2024

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,445 124 Updated May 25, 2024

Python Algorithms for Randomized Linear Algebra

Python 41 5 Updated May 3, 2023

Matlab Algorithms for Randomized Linear Algebra

MATLAB 14 3 Updated Jun 30, 2023

A massively parallel, high-level programming language

Rust 16,927 413 Updated Jul 27, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 2,416 165 Updated Jul 25, 2024

A massively parallel, optimal functional runtime in Rust

Cuda 10,334 389 Updated Jul 25, 2024

Temporary repository for Kind2's refactor based on HVM2

Rust 252 24 Updated Jul 23, 2024

LLM101n: Let's build a Storyteller

25,899 1,374 Updated Jul 29, 2024

Inference code for Llama models

Python 54,699 9,375 Updated Jul 25, 2024

The official Meta Llama 3 GitHub site

Python 24,835 2,709 Updated Jul 28, 2024

RuLES: a benchmark for evaluating rule-following in language models

Python 202 15 Updated Jun 21, 2024

Implementation for MatMul-free LM.

Python 2,770 169 Updated Jun 27, 2024

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python 6,329 438 Updated Jul 29, 2024

Self-hosted AI coding assistant

Rust 20,139 918 Updated Jul 28, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 33,263 3,693 Updated Jul 29, 2024

gpt-2 from scratch in mlx

Python 334 22 Updated Jun 12, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 720 39 Updated Jul 11, 2024
Next