Skip to content
View gerardnoth's full-sized avatar
Block or Report

Block or report gerardnoth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 9,827 1,135 Updated Jul 10, 2024

LLM101n: Let's build a Storyteller

15,335 732 Updated Jun 28, 2024

A cat(1) clone with wings.

Rust 47,746 1,187 Updated Jul 5, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,770 109 Updated Jul 10, 2024

Jlama is a modern LLM inference engine for Java

Java 423 39 Updated Jun 29, 2024

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 972 85 Updated Jul 10, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,048 3,555 Updated Jun 11, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,128 837 Updated May 23, 2024

Documentation and source code powering Twitter's Community Notes

Python 1,375 186 Updated Jul 9, 2024

Sample code for the Twitter API v2 endpoints

JavaScript 2,592 966 Updated Jul 10, 2024

Scalable toolkit for data curation

Python 338 35 Updated Jul 10, 2024

A simple, performant and scalable Jax LLM!

Python 1,367 242 Updated Jul 11, 2024

A cross-platform command-line tool to convert images into ascii art and print them on the console. Now supports braille art!

Go 2,024 121 Updated Apr 14, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,445 519 Updated Mar 12, 2024

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.

Python 984 71 Updated Jul 10, 2024

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,413 320 Updated Jul 8, 2024

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 782 65 Updated Jul 10, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,537 1,018 Updated Jun 26, 2024

AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/

Jupyter Notebook 43 38 Updated Jan 10, 2024

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Python 403 30 Updated Jul 1, 2024

Machine Learning Engineering Open Book

Python 10,198 606 Updated Jul 8, 2024

Intel One Mono font repository

9,234 314 Updated Nov 16, 2023

An innovative superfamily of fonts for code

TypeScript 13,202 220 Updated Jun 23, 2024

Java bindings for TensorFlow

Java 785 193 Updated Jun 21, 2024

Semantically Structured Sentence Embeddings

Python 64 4 Updated Nov 9, 2023

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,324 427 Updated May 3, 2024

A unified framework for machine learning with time series

Python 7,587 1,294 Updated Jul 10, 2024

Java implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"

Java 1,536 791 Updated Dec 20, 2023