Skip to content
View pszemraj's full-sized avatar
Block or Report

Block or report pszemraj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Agents 👭

a list for things related to LLM agents
4 repositories

ai with docs 💁📓🗄

GPT with your documents type stuff
11 repositories

alternate attention

LLMs and other models with linear/alternative takes to traditional attention
1 repository

audio 🔈

things related to generating audio & music production
28 repositories

basics 🚸

basics and tutorials
10 repositories

chatbot and dialogue 👥

chatbot and dialogue based apps
50 repositories

computer-vision 👀

general computer vision things that don't fit in my other lists
56 repositories

🗃️ datasets

datasets for NLP et all
43 repositories
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Assembly 21 1 Updated Mar 22, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 11,197 899 Updated Jul 16, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,444 174 Updated Mar 8, 2024

Gemma 2 optimized for your local machine.

Python 259 17 Updated Jul 10, 2024

Pretraining Samba model

Python 1 Updated Jun 17, 2024

A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.

Python 386 25 Updated Jul 15, 2024

LLM vulnerability scanner

Python 1,069 126 Updated Jul 16, 2024
Python 352 30 Updated Jul 16, 2024

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 46 8 Updated Jul 1, 2024

Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.

TypeScript 201 10 Updated Jul 5, 2024

Implementation for MatMul-free LM.

Python 2,674 159 Updated Jun 27, 2024

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

Rust 211 14 Updated Jul 16, 2024

GGUF implementation in C as a library and a tools CLI program

C 228 12 Updated Jul 3, 2024

Stable Diffusion in pure C/C++

C++ 2,923 237 Updated Jul 14, 2024

[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Python 58 5 Updated May 24, 2024

The lazier way to manage everything docker

Go 35,353 1,149 Updated Jul 15, 2024

LOMO: LOw-Memory Optimization

Python 956 69 Updated Jul 2, 2024

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

Python 5,986 379 Updated Jul 6, 2024

Truly flash T5 realization!

Python 35 3 Updated May 20, 2024

A fast implementation of T5/UL2 in PyTorch using Flash Attention

Python 54 6 Updated Jun 26, 2024

Reference implementation of Megalodon 7B model

Cuda 495 51 Updated Apr 18, 2024

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 18,219 1,908 Updated Jul 6, 2024

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Python 156 27 Updated Sep 21, 2022

Convert PDF to markdown quickly with high accuracy

Python 14,391 742 Updated Jul 12, 2024

A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixes to the original codebase.

Jupyter Notebook 9 Updated Dec 11, 2023

Codebase for fine-tuning / evaluating nougat-based image2latex generation models

Python 103 11 Updated Feb 25, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,788 1,231 Updated Jul 15, 2024

Extract structured text from pdfs quickly

Python 247 18 Updated May 27, 2024

A native PyTorch Library for large model training

Python 1,346 118 Updated Jul 16, 2024

CoreNet: A library for training deep neural networks

Python 6,747 519 Updated May 28, 2024
Next