PhD Student in ML/NLP
-
Tsinghua University
- Beijing, China
- sunyt32.github.io
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
-
-
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedAug 26, 2024 -
optimizers Public
Forked from facebookresearch/optimizersFor optimization algorithm research and development.
Python Other UpdatedJun 28, 2024 -
flash-linear-attention Public
Forked from sustcsonglin/flash-linear-attentionEfficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Python MIT License UpdatedMar 14, 2024 -
torchscale Public
Forked from microsoft/torchscaleTransformers at any scale
-
-
-
-