Skip to content
View Kiv's full-sized avatar
Block or Report

Block or report Kiv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 109 9 Updated Jun 18, 2024

AICI: Prompts as (Wasm) Programs

Rust 1,841 74 Updated Jun 29, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,521 282 Updated Jun 29, 2024

Blazingly πŸ”₯ fast πŸš€ memory vulnerabilities, written in 100% safe Rust. πŸ¦€

Rust 3,690 94 Updated Apr 18, 2024

NVIDIA Linux open GPU with P2P support

C 770 56 Updated Jun 7, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,709 480 Updated Jun 30, 2024

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 12,075 781 Updated Jun 30, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 10,978 744 Updated Jun 21, 2024

Extremely simple implementation of path patching (aka causal scrubbing) in PyTorch.

Jupyter Notebook 4 Updated Oct 5, 2023

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

Python 538 45 Updated Jun 17, 2024

If tinygrad wasn't small enough for you...

Python 613 85 Updated Mar 9, 2024

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 698 32 Updated Nov 4, 2023

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 167 76 Updated Feb 7, 2024
Python 6 3 Updated Nov 15, 2022

Named tensors with first-class dimensions for PyTorch

Jupyter Notebook 321 12 Updated Jun 14, 2023

Must-read Papers on Textual Adversarial Attack and Defense

Python 1,469 193 Updated Apr 12, 2024

πŸ¦œπŸ”— Build context-aware reasoning applications

Python 88,260 13,836 Updated Jun 30, 2024

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

Python 282 25 Updated Nov 25, 2023

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 7,960 563 Updated Jun 27, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,147 238 Updated Jun 30, 2024

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Cuda 202 33 Updated May 12, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 11,944 823 Updated Jun 11, 2024

Supporting code for diagnostic seals paper

Python 2 Updated Dec 3, 2020

The Operating System

Assembly 224 1 Updated Nov 7, 2023

An engine + analysis interface for duck chess

Rust 17 Updated Jun 4, 2024

Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorch module

Python 143 9 Updated Nov 25, 2022

Duelyst is a digital collectible card game and turn-based strategy hybrid, developed by Counterplay Games.

JavaScript 3,592 554 Updated May 8, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,385 466 Updated Jan 8, 2024

Cinder is Meta's internal performance-oriented production version of CPython.

Python 3,411 122 Updated Jun 28, 2024
Next