-
Tencent AI Lab
- https://linear95.github.io/
- @cheng_pengyu
- in/pengyu-cheng
Highlights
- Pro
Block or Report
Block or report Linear95
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…
BERT-based intent and slots detector for chatbots.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873
Collection of papers for scalable automated alignment.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471
Self-playing Adversarial Language Game Enhances LLM Reasoning
Autonomous Agents (LLMs) research papers. Updated Daily.
Large language model code completion for Emacs
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]
Yuanhy1997 / ehrdiff
Forked from sczzz3/EHRDiffEHRDiff: Exploring Realistic EHR Synthesis with Diffusion Models [TMLR]
A framework for few-shot evaluation of language models.
The official implementation of Self-Play Fine-Tuning (SPIN)
Code for paper - On Diversified Preferences of Large Language Model Alignment
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Aligning Large Language Models with Human: A Survey
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
Code for ACL2024 paper - Adversarial Preference Optimization (APO).