- Atlanta, GA
-
12:36
(UTC -04:00) - pszemraj.carrd.co/
- https://gitlab.ethz.ch/pszemraj
- https://hf.co/pszemraj
- https://wandb.ai/pszemraj
Block or Report
Block or report pszemraj
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Name ascending (A-Z)
Agents 👭
a list for things related to LLM agentsai with docs 💁📓🗄
GPT with your documents type stuffalternate attention
LLMs and other models with linear/alternative takes to traditional attentionaudio 🔈
things related to generating audio & music productionbasics 🚸
basics and tutorialschatbot and dialogue 👥
chatbot and dialogue based appscomputer-vision 👀
general computer vision things that don't fit in my other lists🗃️ datasets
datasets for NLP et alldeep learning 🧠
general deep learning list🌏 domain adaptation
repos for domain adapting neural netsebm ⚡
things for energy based models🪆embedding
text embeddings and potentially other modalitiesFramework 🖼️
Various frameworks of the machine learning variety, as they sayfront end 💻
front end resources for making interfaces and other busy work tasks🔨 eval
evaluationinformation-extraction 📅
LLM 🔢
large language models in generalmobile 📲
mobile models/computing/frameworksnlp 📚
a very general list of high-level NLP reposOG machine learning 🧮
sci-kit learn et alprojects 🕴️
things I am working onQuantization 👝
Make big model smallreinforcement learning 🤖
remote sensing 🌍
render media 🖼️
summ-retrieval-project
laion worksummarization ⬇️
neural summarizationSurvey/📄
Repos more of us on papers/theorythesis
things for msc thesistraining 🏋♂
utils 🔧
video 🎦
Deep learning and related utils for video. Might overlap a tad with the computer vision listLanguage
Sort by: Recently starred
Starred repositories
A modular graph-based Retrieval-Augmented Generation (RAG) system
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Gemma 2 optimized for your local machine.
A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
GGUF implementation in C as a library and a tools CLI program
[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
The lazier way to manage everything docker
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.
A fast implementation of T5/UL2 in PyTorch using Flash Attention
Reference implementation of Megalodon 7B model
Crawl a site to generate knowledge files to create your own custom GPT from a URL
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Convert PDF to markdown quickly with high accuracy
A full codebase for replicating the results of Nougat from downloading arXiv dataset to the final evaluation. It also contains a few fixes to the original codebase.
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
A native PyTorch Library for large model training
CoreNet: A library for training deep neural networks