LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 497 60 Updated Jul 28, 2024

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,690 878 Updated Mar 27, 2024

hanjanghoon / BERT_FP

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Python 96 19 Updated Jul 8, 2021

Tongji-KGLLM / RAG-Survey

1,600 116 Updated May 8, 2024

serge-chat / serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Svelte 5,606 401 Updated Jul 29, 2024

swj0419 / detect-pretrain-code

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 195 21 Updated Nov 3, 2023

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,348 684 Updated Jul 11, 2024

princeton-nlp / AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 252 17 Updated Feb 26, 2024

FranxYao / chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,453 122 Updated Apr 22, 2024

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,438 887 Updated Jul 29, 2024

pinecone-io / canopy

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

Python 930 113 Updated Jun 17, 2024

sustcsonglin / flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 784 45 Updated Jul 29, 2024

elyase / awesome-gpt3

4,588 358 Updated Aug 27, 2023

csnlp / Dialogue-Generation

A Paper List for Open-Domain Dialogue Generation, and related datasets.

204 29 Updated May 24, 2020

haixu-qin / MiniGrad

A Mini Gradient Descent library.

Python 5 Updated Oct 11, 2023

huggingface / trl

Train transformer language models with reinforcement learning.

Python 8,890 1,091 Updated Jul 28, 2024

wxjiao / Is-ChatGPT-A-Good-Translator

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

Python 236 16 Updated Nov 3, 2023

SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,184 109 Updated Apr 3, 2024

HazyResearch / based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 196 12 Updated Jun 3, 2024

qingsongedu / time-series-transformers-review

A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.

2,292 232 Updated Apr 6, 2024

xcfcode / DDAMS

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

Python 59 12 Updated Jun 15, 2021

crazyofapple / Reading_groups

A paper & resource list of large language models, including course, paper, demo, figures

175 8 Updated Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resonance Router Starshipping

Block or report Starshipping

Stars

fudan-generative-vision / hallo

GoEdgeLab / EdgeAPI

stanfordnlp / dspy

zorazrw / filco

mli / paper-reading

wgwang / awesome-LLMs-In-China

Mintplex-Labs / anything-llm

talkdai / dialog

huggingface / lighteval