Stars
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Awesome list of Korean Large Language Models.
A high-throughput and memory-efficient inference and serving engine for LLMs
High-level APIs for Amazon Web Services (AWS) in Dart