Skip to content
View BowieHsu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Alibaba
  • China Hangzhou

Block or report BowieHsu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

2024中国翻墙软件VPN推荐以及科学上网避坑,稳定好用。对比SSR机场、蓝灯、V2ray、老王VPN、VPS搭建梯子等科学上网与翻墙软件,中国最新科学上网翻墙梯子VPN下载推荐,访问Chatgpt。

HTML 15,674 1,456 Updated Sep 26, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,377 144 Updated Sep 27, 2024

WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.

Python 27 4 Updated Jul 23, 2024

Manage scalable open LLM inference endpoints in Slurm clusters

Python 222 22 Updated Jul 11, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 68,186 8,060 Updated Sep 27, 2024

This is the repository for the Tool Learning survey.

204 8 Updated Sep 25, 2024

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 712 27 Updated Sep 25, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,823 772 Updated Aug 24, 2023

slot filling, intent detection, joint training, ATIS & SNIPS datasets, the Facebook’s multilingual dataset, MIT corpus, E-commerce Shopping Assistant (ECSA) dataset, CoNLL2003 NER, ELMo, BERT, XLNet

Python 392 106 Updated Feb 4, 2021

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 804 58 Updated Sep 25, 2024

DataComp for Language Models

HTML 1,121 99 Updated Sep 5, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,366 467 Updated Sep 28, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,322 285 Updated Aug 15, 2024

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 1,978 255 Updated Sep 17, 2024
Python 275 15 Updated Sep 18, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,230 399 Updated Sep 13, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 990 88 Updated May 8, 2024

Tools for merging pretrained large language models.

Python 4,573 406 Updated Sep 16, 2024

[ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

Python 63 8 Updated Mar 21, 2024

Robust recipes to align language models with human and AI preferences

Python 4,525 393 Updated Sep 23, 2024

Go ahead and axolotl questions

Python 7,645 839 Updated Sep 30, 2024

Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.

Python 674 42 Updated Jul 10, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,111 65 Updated Feb 14, 2024

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

935 35 Updated Jul 31, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 976 47 Updated Jan 16, 2024

Official inference library for Mistral models

Jupyter Notebook 9,573 847 Updated Sep 20, 2024

Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

Python 65 2 Updated Dec 14, 2023

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 4,898 530 Updated Aug 8, 2024

leaked prompts of GPTs

28,399 3,834 Updated Sep 27, 2024

FireAct: Toward Language Agent Fine-tuning

Python 245 19 Updated Oct 22, 2023
Next