Skip to content
View aplz's full-sized avatar
🍉
🍉
Block or Report

Block or report aplz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 21,526 2,182 Updated Jul 5, 2024

An LLM playground you can run on your laptop

TypeScript 6,154 478 Updated Jun 18, 2024

A library that generates an interactive radar, inspired by https://thoughtworks.com/radar/.

CSS 2,115 1,006 Updated Jun 25, 2024

🌹 Cookiecutter template featuring the modern and extensible Python project manager hatch

Python 56 2 Updated May 31, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,695 790 Updated Jul 1, 2024

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 2,460 174 Updated Jun 26, 2024

Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/course/search-fundamentals?utm_source=daniel

Python 37 270 Updated Oct 11, 2023

Repository for the NaLLM project

TypeScript 1,106 217 Updated Jun 28, 2024

🚀 The ultimate Python client for Notion!

Python 72 2 Updated Jul 5, 2024

A repository for my PyCon talk: "Building a personal assistant with Haystack and GPT: How to feed facts to large language models and reduce hallucinations"

Jupyter Notebook 19 1 Updated Apr 16, 2023

A minimal ChatGPT-like UI built with Streamlit

Python 141 70 Updated May 7, 2024
Python 7 Updated Nov 4, 2022

Compound splitter for German

Python 101 24 Updated Apr 5, 2020

utt is the universal text transformer

Java 450 7 Updated Jun 28, 2024

🧮 Extended Latent Dirichlet Allocation for Collaborative Filtering in Recommender Systems.

Jupyter Notebook 40 6 Updated May 16, 2022

Stemmer for German

C 43 11 Updated Apr 29, 2022

Deep Learning for Natural Language Processing - Lectures 2023

TeX 158 28 Updated Jul 5, 2024

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 1,992 203 Updated Jan 9, 2024

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,428 350 Updated Jul 3, 2024

Git auf deutsch

Shell 1,110 91 Updated Apr 24, 2024

⭐ Use repo badges (build passing, coverage, etc) in your readme/markdown file to signal code quality in a project.

HTML 2,885 1,232 Updated Jan 11, 2023
Python 3,144 632 Updated Nov 16, 2021

Pure Python Spell Checking http:https://pyspellchecker.readthedocs.io/en/latest/

Python 686 101 Updated Mar 9, 2024

An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity

Python 208 49 Updated Jul 2, 2024

A toolkit for making real world machine learning and data analysis applications in C++

C++ 13,196 3,335 Updated Jun 26, 2024

Transforms ClaML classifications into FHIR code systems.

Java 18 6 Updated Apr 13, 2024

Computational Linguistics 1, Fall 2019, University of Maryland

Jupyter Notebook 48 6 Updated Dec 12, 2019

Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more

Jupyter Notebook 2,692 2,009 Updated Mar 28, 2024

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Rust 45,067 1,695 Updated Jul 4, 2024

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases

Jupyter Notebook 2,406 1,496 Updated Jun 19, 2024
Next