Murhaf

Follow

Murhaf Murhaf

Follow

14 followers · 36 following

Norway

Achievements

Achievements

Block or Report

Block or report Murhaf

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Lists (2)

Sort

arabic

llms

17 repositories

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

quarto-dev / quarto-cli

Open-source scientific and technical publishing system built on Pandoc.

JavaScript 3,612 295 Updated Jul 25, 2024

joeynmt / joeynmt

Minimalist NMT for educational purposes

Python 668 211 Updated Jan 29, 2024

Cimbali / pympress

Pympress is a simple yet powerful PDF reader designed for dual-screen presentations

Python 1,118 88 Updated Jul 23, 2024

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,566 450 Updated Jul 11, 2024

kanishkamisra / minicons

Utility for behavioral and representational analyses of Language Models

Python 111 27 Updated Jul 25, 2024

OpenAutoCoder / Agentless

Agentless🐱: an agentless approach to automatically solve software development problems

Python 500 45 Updated Jul 23, 2024

stanfordnlp / pyreft

ReFT: Representation Finetuning for Language Models

Python 979 84 Updated Jul 25, 2024

for-ai / llm-profiling-toolkit

Python 6 Updated Jul 2, 2024

loganthorneloe / ml-road-map

The most streamlined road map to learn ML fundamentals for free.

233 27 Updated Jul 23, 2024

Arize-ai / phoenix

AI Observability & Evaluation

Jupyter Notebook 3,130 233 Updated Jul 25, 2024

marimo-team / marimo

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 5,505 167 Updated Jul 25, 2024

negrinho / sane_tikz

Reconquer the canvas: beautiful Tikz figures without clunky Tikz code

Python 371 33 Updated Nov 18, 2020

karpathy / LLM101n

LLM101n: Let's build a Storyteller

25,473 1,352 Updated Jul 21, 2024

WojciechMula / pyahocorasick

Python module (C extension and plain python) implementing Aho-Corasick algorithm

C 918 122 Updated Mar 21, 2024

xhluca / bm25s

Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)

Python 667 21 Updated Jul 21, 2024

vespa-engine / vespa

AI + Data, online. https://vespa.ai

Java 5,493 587 Updated Jul 25, 2024

amix / vimrc

The ultimate Vim configuration (vimrc)

Vim Script 30,413 7,261 Updated May 27, 2024

PiotrNawrot / nanoT5

Fast & Simple repository for pre-training and fine-tuning T5-style models

Python 944 68 Updated Jun 14, 2024

argilla-io / distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Python 1,180 73 Updated Jul 25, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,241 2,452 Updated Jul 15, 2024

huggingface / setfit

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,098 210 Updated Jul 19, 2024

EleutherAI / improved-t5

Experiments for efforts to train a new and improved t5

Python 75 5 Updated Apr 15, 2024

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,500 177 Updated Jun 27, 2024

ryanzhumich / Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

516 56 Updated Apr 27, 2023

cleanlab / cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,163 712 Updated Jul 22, 2024

UBC-NLP / octopus

Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)

Python 9 Updated Apr 29, 2024

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Python 1,672 221 Updated Jul 25, 2024

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 2,944 167 Updated Jul 19, 2024

scaleapi / llm-engine

Scale LLM Engine public repository

Python 760 50 Updated Jul 25, 2024

bheinzerling / bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,171 100 Updated Mar 16, 2024

Starred topics

Machine learning

Deep learning

ml

macOS

Python

Natural language processing