Skip to content
View dawson-chen's full-sized avatar
  • Xi'an Jiao Tong university

Block or report dawson-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • picgo-repo Public

    Updated Sep 25, 2024
  • Python Updated Jul 27, 2024
  • mla-fuse Public

    MLA的设计可以保证效果与传统的MHA效果相同的情况下,实现更低的kv-cache开销。但是官方并没有给出矩阵融合后的推理代码,这对于对齐论文中的效果是必要的一步。本仓库的代码用来实现MLA的参数融合,以及融合后的pytorch推理代码。

    Python 3 1 Updated Jul 17, 2024
  • Lancer Public

    Lancer是一个基于pyqt6开发的PC端copilot工具,它非常精简sharp,可以根据前台软件改变功能。

    Python 5 3 GNU General Public License v3.0 Updated Jul 9, 2024
  • HTML Updated May 9, 2024
  • A curated reading list of research in Mixture-of-Experts(MoE).

    Apache License 2.0 Updated Sep 4, 2023
  • DeepSpeed Public

    Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Python Apache License 2.0 Updated Aug 9, 2023
  • Example models using DeepSpeed

    Python Apache License 2.0 Updated Apr 13, 2023
  • mlc_project Public

    Python Updated Jun 27, 2022
  • Updated May 21, 2022
  • Research Public

    Forked from PaddlePaddle/Research

    novel deep learning research works with PaddlePaddle

    Python Apache License 2.0 Updated Mar 29, 2022
  • 🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Aug 21, 2021
  • Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

    Python Apache License 2.0 Updated Dec 15, 2020
  • FastBERT Public

    Forked from autoliuweijie/FastBERT

    The score code of FastBERT (ACL2020)

    Python Updated May 13, 2020
  • A tensorflow implementation of paper "A Deep Learning Approach To Universal Image Manipulation Detection Using A New Convolutional Layer"

    Jupyter Notebook 15 Updated Dec 24, 2018
  • Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

    HTML 1 MIT License Updated Dec 18, 2018