Second-year graduate student.
-
Harbin Institute of Technology
- Shenzhen, China
-
05:13
(UTC +08:00) - zchuz.github.io
Stars
2
stars
written in Shell
Clear filter
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".