Pengcheng He

Cited by

	All	Since 2019
Citations	11979	11954
h-index	37	37
i10-index	47	47

4400

2200

1100

3300

201920202021202220232024219 723 1316 1872 3447 4314

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Weizhu ChenMicrosoftVerified email at microsoft.com
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Xiaodong LiuMicrosoft Research, RedmondVerified email at microsoft.com
Tuo ZhaoAssociate Professor, Georgia TechVerified email at gatech.edu
Haoming JiangAmazon; Georgia Institute of TechnologyVerified email at gatech.edu
Baolin PengMicrosoft Research, RedmondVerified email at microsoft.com
Jiawei HanAbel Bliss Professor of Computer Science, University of IllinoisVerified email at cs.uiuc.edu
Hao ChengMicrosoft ResearchVerified email at microsoft.com
Liyuan LiuMicrosoft ResearchVerified email at illinois.edu
Hoifung PoonGeneral Manager, Microsoft Health FuturesVerified email at microsoft.com
Adam TrischlerMicrosoft Research, McGill UniversityVerified email at microsoft.com
Tao ShenOracleVerified email at oracle.com
Guodong LongAssociate Professor, Faculty of Engineering and IT, University of Technology SydneyVerified email at uts.edu.au
William DarlingCohereVerified email at cohere.com
Yu WangMicrosoft ResearchVerified email at microsoft.com

Pengcheng He

Microsoft

Verified email at microsoft.com

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deberta: Decoding-enhanced bert with disentangled attention P He, X Liu, J Gao, W Chen ICLR 2021, 2020	2583	2020
On the variance of the adaptive learning rate and beyond L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han ICLR 2019, 2019	2276	2019
Multi-task deep neural networks for natural language understanding X Liu, P He, W Chen, J Gao ACL 2019, 2019	1443	2019
Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing P He, J Gao, W Chen ICLR 2023, 2021	875	2021
Instruction tuning with gpt-4 B Peng, C Li, P He, M Galley, J Gao arXiv preprint arXiv:2304.03277, 2023	706	2023
Smart: Robust and efficient fine-tuning for pre-trained natural language models through principled regularized optimization H Jiang, P He, W Chen, X Liu, J Gao, T Zhao ACL 2020, 2019	484	2019
Check your facts and try again: Improving large language models with external knowledge and automated feedback B Peng, M Galley, P He, H Cheng, Y Xie, Y Hu, Q Huang, L Liden, Z Yu, ... arXiv preprint arXiv:2302.12813, 2023	359	2023
AdaLoRA: Adaptive budget allocation for parameter-efficient fine-tuning Q Zhang, M Chen, A Bukharin, N Karampatziakis, P He, Y Cheng, ... arXiv preprint arXiv:2303.10512, 2023	337	2023
Generation-augmented retrieval for open-domain question answering Y Mao, P He, X Liu, Y Shen, J Gao, J Han, W Chen arXiv preprint arXiv:2009.08553, 2020	218	2020
Improving multi-task deep neural networks via knowledge distillation for natural language understanding X Liu, P He, W Chen, J Gao arXiv preprint arXiv:1904.09482, 2019	215	2019
Diffusion-GAN: Training GANs with Diffusion Z Wang, H Zheng, P He, W Chen, M Zhou ICLR 2023, 2022	205	2022
Adversarial training for large neural language models X Liu, H Cheng, P He, W Chen, Y Wang, H Poon, J Gao arXiv preprint arXiv:2004.08994, 2020	188	2020
Dola: Decoding by contrasting layers improves factuality in large language models YS Chuang, Y Xie, H Luo, Y Kim, J Glass, P He arXiv preprint arXiv:2309.03883, 2023	160	2023
Patch diffusion: Faster and more data-efficient training of diffusion models Z Wang, Y Jiang, H Zheng, P Wang, P He, Z Wang, W Chen, M Zhou Advances in neural information processing systems 36, 2024	135	2024
Query rewriting for retrieval-augmented large language models X Ma, Y Gong, P He, H Zhao, N Duan arXiv preprint arXiv:2305.14283, 2023	128	2023
X-SQL: reinforce schema representation with context P He, Y Mao, K Chakrabarti, W Chen arXiv preprint arXiv:1908.08113, 2019	107	2019
Loftq: Lora-fine-tuning-aware quantization for large language models Y Li, Y Yu, C Liang, P He, N Karampatziakis, W Chen, T Zhao arXiv preprint arXiv:2310.08659, 2023	100	2023
On the variance of the adaptive learning rate and beyond. arXiv 2019 L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han arXiv preprint arXiv:1908.03265, 2019	97	2019
Platon: Pruning large transformer models with upper confidence bound of weight importance Q Zhang, S Zuo, C Liang, A Bukharin, P He, W Chen, T Zhao International Conference on Machine Learning, 26809-26823, 2022	74	2022
NeurIPS 2020 EfficientQA competition: Systems, analyses and lessons learned S Min, J Boyd-Graber, C Alberti, D Chen, E Choi, M Collins, K Guu, ... NeurIPS 2020, 2021	73	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors