Skip to content
View ZhiYuanZeng's full-sized avatar
Block or Report

Block or report ZhiYuanZeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 783 45 Updated Jul 25, 2024

Some preliminary explorations of Mamba's context scaling.

Python 177 9 Updated Feb 8, 2024

Official release of InternLM2.5 7B base and chat models. 1M context support

Python 5,885 423 Updated Jul 23, 2024

Collaborative Training of Large Language Models in an Efficient Way

Python 393 56 Updated Jun 18, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,260 115 Updated Jun 13, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,752 749 Updated Mar 15, 2024

An open-source tool-augmented conversational language model from Fudan University

Python 11,895 1,147 Updated Jul 13, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,439 2,203 Updated Feb 23, 2024

Codebase for multilingual neural machine translation

Python 13 2 Updated Nov 24, 2022

Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation

Python 25 1 Updated Sep 30, 2022

A retrieval augmented sequence modeling toolkit implemented based on Fairseq

Python 28 2 Updated Mar 3, 2023

Awesome papers on Language-Model-as-a-Service (LMaaS)

542 31 Updated May 14, 2024

The entmax mapping and its loss, a family of sparse softmax alternatives.

Python 401 43 Updated Jun 22, 2024

Repo for external large-scale work

Python 6,441 722 Updated Apr 27, 2024

Zero -- A neural machine translation system

Python 148 19 Updated May 8, 2023

PDFs and Codelabs for the Efficient Deep Learning book.

Jupyter Notebook 184 25 Updated May 29, 2023

Boosting your Web Services of Deep Learning Applications.

Python 1,222 187 Updated May 13, 2021

A curated reading list of research in Mixture-of-Experts(MoE).

499 38 Updated Sep 4, 2023

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 690 85 Updated Jul 25, 2024

2470 Deep Learning

Jupyter Notebook 6 Updated Dec 11, 2020

Making large AI models cheaper, faster and more accessible

Python 38,407 4,316 Updated Jul 26, 2024

Ongoing research training transformer models at scale

Python 9,510 2,145 Updated Jul 27, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 129,821 25,791 Updated Jul 27, 2024

Methods and Implements of Deep Clustering

2,752 410 Updated Jul 7, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 29,893 6,330 Updated Jul 26, 2024

如果可以,你最想穿越到哪部电影,小说里?利用 paddlenlp 中提供的 GPT2 和 wechaty 库展开对话故事续写,与 AI 互动共同创造剧情

Python 34 14 Updated Sep 5, 2021

DataBase: RHD_published_v2

Python 37 6 Updated Mar 4, 2020

A Word Sense Disambiguation system integrating implicit and explicit external knowledge.

Python 66 17 Updated Sep 14, 2021

TextBox 2.0 is a text generation library with pre-trained language models

Python 1,066 117 Updated Jul 27, 2023

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,685 2,244 Updated Jun 27, 2024
Next