Skip to content
View Murhaf's full-sized avatar
Block or Report

Block or report Murhaf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Open-source scientific and technical publishing system built on Pandoc.

JavaScript 3,612 295 Updated Jul 25, 2024

Minimalist NMT for educational purposes

Python 668 211 Updated Jan 29, 2024

Pympress is a simple yet powerful PDF reader designed for dual-screen presentations

Python 1,118 88 Updated Jul 23, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,566 450 Updated Jul 11, 2024

Utility for behavioral and representational analyses of Language Models

Python 111 27 Updated Jul 25, 2024

Agentless🐱: an agentless approach to automatically solve software development problems

Python 500 45 Updated Jul 23, 2024

ReFT: Representation Finetuning for Language Models

Python 979 84 Updated Jul 25, 2024
Python 6 Updated Jul 2, 2024

The most streamlined road map to learn ML fundamentals for free.

233 27 Updated Jul 23, 2024

AI Observability & Evaluation

Jupyter Notebook 3,130 233 Updated Jul 25, 2024

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 5,505 167 Updated Jul 25, 2024

Reconquer the canvas: beautiful Tikz figures without clunky Tikz code

Python 371 33 Updated Nov 18, 2020

LLM101n: Let's build a Storyteller

25,473 1,352 Updated Jul 21, 2024

Python module (C extension and plain python) implementing Aho-Corasick algorithm

C 918 122 Updated Mar 21, 2024

Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)

Python 667 21 Updated Jul 21, 2024

AI + Data, online. https://vespa.ai

Java 5,493 587 Updated Jul 25, 2024

The ultimate Vim configuration (vimrc)

Vim Script 30,413 7,261 Updated May 27, 2024

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 944 68 Updated Jun 14, 2024

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python 1,180 73 Updated Jul 25, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,241 2,452 Updated Jul 15, 2024

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,098 210 Updated Jul 19, 2024

Experiments for efforts to train a new and improved t5

Python 75 5 Updated Apr 15, 2024

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,500 177 Updated Jun 27, 2024

Paper List for Contrastive Learning for Natural Language Processing

516 56 Updated Apr 27, 2023

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,163 712 Updated Jul 22, 2024

Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)

Python 9 Updated Apr 29, 2024

MTEB: Massive Text Embedding Benchmark

Python 1,672 221 Updated Jul 25, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 2,944 167 Updated Jul 19, 2024

Scale LLM Engine public repository

Python 760 50 Updated Jul 25, 2024

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,171 100 Updated Mar 16, 2024
Next