Lists (2)
Sort Name ascending (A-Z)
Starred repositories
2024中国翻墙软件VPN推荐以及科学上网避坑,稳定好用。对比SSR机场、蓝灯、V2ray、老王VPN、VPS搭建梯子等科学上网与翻墙软件,中国最新科学上网翻墙梯子VPN下载推荐,访问Chatgpt。
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.
Manage scalable open LLM inference endpoints in Slurm clusters
Robust Speech Recognition via Large-Scale Weak Supervision
This is the repository for the Tool Learning survey.
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
slot filling, intent detection, joint training, ATIS & SNIPS datasets, the Facebook’s multilingual dataset, MIT corpus, E-commerce Shopping Assistant (ECSA) dataset, CoNLL2003 NER, ELMo, BERT, XLNet
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
OpenChat: Advancing Open-source Language Models with Imperfect Data
The official implementation of Self-Play Fine-Tuning (SPIN)
Tools for merging pretrained large language models.
[ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
Robust recipes to align language models with human and AI preferences
Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Official inference library for Mistral models
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.