Skip to content
View bzantium's full-sized avatar

Block or report bzantium

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Use contrastive learning to train a large language model (LLM) as a retriever

Python 5 1 Updated Jul 19, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,651 478 Updated Sep 2, 2024

Agentic components of the Llama Stack APIs

Python 3,139 305 Updated Aug 30, 2024

LLM101n: Let's build a Storyteller

27,828 1,518 Updated Aug 1, 2024

Multilingual Sentence & Image Embeddings with BERT

Python 14,762 2,428 Updated Aug 30, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,102 78 Updated Aug 30, 2024

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,765 232 Updated Sep 2, 2024

LLM training code for Databricks foundation models

Python 3,950 518 Updated Sep 2, 2024

Supercharge Your Model Training

Python 5,111 412 Updated Sep 2, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,968 192 Updated Sep 1, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 15,013 1,007 Updated Sep 2, 2024
Python 139 15 Updated Sep 3, 2024
Python 412 40 Updated Jul 17, 2024

LLM training in simple, raw C/CUDA

Cuda 23,109 2,567 Updated Aug 26, 2024

An Extensible Deep Learning Library

Python 1,753 225 Updated Aug 30, 2024
C++ 774 109 Updated May 24, 2023

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 486 75 Updated Sep 2, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,356 252 Updated Aug 13, 2024

Development repository for the Triton language and compiler

C++ 12,433 1,506 Updated Sep 2, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,774 296 Updated Sep 1, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,899 131 Updated Sep 2, 2024

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 545 67 Updated Sep 2, 2024

The Python micro framework for building web applications.

Python 67,436 16,145 Updated Sep 1, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 36,672 3,842 Updated Jul 28, 2024

Minimalistic large language model 3D-parallelism training

Python 1,080 103 Updated Sep 1, 2024

Making the food-delivery experience easy for busy folks :)

Python 193 49 Updated Feb 14, 2024

Examples and guides for using the OpenAI API

MDX 58,359 9,247 Updated Aug 29, 2024

Easy and Efficient Quantization for Transformers

C++ 171 14 Updated Jul 15, 2024

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,404 240 Updated Aug 22, 2024
Next