Block or Report
Block or report gerardnoth
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Unsupervised text tokenizer for Neural Network-based text generation.
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
llama3 implementation one matrix multiplication at a time
Documentation and source code powering Twitter's Community Notes
Sample code for the Twitter API v2 endpoints
A cross-platform command-line tool to convert images into ascii art and print them on the console. Now supports braille art!
Official Code for Stable Cascade
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Foundational Models for State-of-the-Art Speech and Text Translation
AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
Machine Learning Engineering Open Book
An innovative superfamily of fonts for code
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
A unified framework for machine learning with time series
Java implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"