Skip to content
View alirezamshi's full-sized avatar
Block or Report

Block or report alirezamshi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

RewardBench: the first evaluation tool for reward models.

Python 307 35 Updated Jul 26, 2024

Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.

Jupyter Notebook 138 17 Updated Jul 21, 2024

Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!

Python 707 92 Updated Jul 9, 2024

Go ahead and axolotl questions

Python 7,090 775 Updated Jul 29, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 13,317 880 Updated Jul 28, 2024

Zep: Long-Term Memory for ‍AI Assistants.

Go 2,196 330 Updated Jun 24, 2024

Large Language Model Text Generation Inference

Python 8,491 969 Updated Jul 29, 2024

A bagel, with everything.

Python 303 31 Updated Apr 11, 2024

Scalable Meta-Evaluation of LLMs as Evaluators

Python 36 3 Updated Feb 15, 2024
Python 2,471 155 Updated Jul 23, 2024

An open-sourced LLM judge for evaluating LLM-generated answers.

Python 282 20 Updated Nov 2, 2023

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 355 38 Updated Jul 26, 2024

AI for all: Build the large graph of the language models

Python 203 17 Updated Jun 3, 2024

Minimalistic large language model 3D-parallelism training

Python 1,010 91 Updated Jul 29, 2024
Python 375 34 Updated Jul 17, 2024

Easily embed, cluster and semantically label text datasets

Python 404 27 Updated Mar 28, 2024

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Jupyter Notebook 28,856 4,221 Updated Jul 30, 2024

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate, Groq (100+ LLMs)

Python 10,842 1,237 Updated Jul 30, 2024

Superfast AI decision making and intelligent processing of multi-modal data.

Python 1,744 178 Updated Jul 26, 2024

All available datasets for Instruction Tuning of Large Language Models

230 11 Updated Nov 30, 2023

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 499 60 Updated Jul 29, 2024

🥷 Run AI-agents with an API

TypeScript 5,010 816 Updated Jul 22, 2024

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 376 22 Updated Jun 2, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,811 116 Updated Jul 22, 2024

LLM training code for Databricks foundation models

Python 3,894 510 Updated Jul 29, 2024

Supercharge Your Model Training

Python 5,077 408 Updated Jul 30, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 28,009 3,433 Updated Jul 29, 2024

Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to EMNLP 2022.

Python 20 2 Updated Feb 8, 2023

An open science effort to benchmark legal reasoning in foundation models

Python 306 36 Updated Jul 29, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 9,945 630 Updated May 2, 2024
Next