Skip to content

⚡Research papers about leveraging the capabilities of language models⚡

License

Notifications You must be signed in to change notification settings

pbhu1024/awesome-augmented-language-model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🔥Awesome Augmented Language Model Awesome

This repo collect research papers about leveraging the capabilities of language models, which can be a good reference for building upper-layer applications.

Contents

Model

❗ Here, only generative language models are included.

Foundation Model

  • Köpf, Andreas and Kilcher, Yannic and von Rütte, Dimitri and Anagnostidis, Sotiris et al., Alexander,OpenAssistant Conversations -- Democratizing Large Language Model Alignment,2023, [Paper]

  • Zhao, Wayne Xin and Zhou, Kun and Li, Junyi et al.,A Survey of Large Language Models,2023, [Paper]

  • Xu, Canwen and Guo, Daya and Duan, Nan and McAuley, Julian,Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data,2023, [Paper]

  • Lin, Baihan and Bouneffouf, Djallel and Cecchi, Guillermo and Varshney, Kush R.,Towards Healthy AI: Large Language Models Need Therapists Too,2023, [Paper]

  • Xiong, Honglin and Wang, Sheng and Zhu, Yitao and Zhao, Zihao and Liu, Yuxiao and Wang, Qian and Shen, Dinggang,DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task,2023, [Paper]

  • Wu, Shijie and Irsoy, Ozan and Lu, Steven et al.,BloombergGPT: A Large Language Model for Finance,2023, [Paper]

  • Bubeck, Sébastien and Chandrasekaran, Varun and Eldan, Ronen and Gehrke, Johannes and Horvitz, Eric and Kamar, Ece and Lee, Peter and Lee, Yin Tat and Li, Yuanzhi and Lundberg, Scott and Nori, Harsha and Palangi, Hamid and Ribeiro, Marco Tulio and Zhang, Yi, Sparks of Artificial General Intelligence: Early experiments with GPT-4, 2023 [Paper]

  • OpenAI, GPT-4 Technical Report, 2023 [Paper]

  • Touvron, Hugo and Lavril, Thibaut and Izacard, Gautier and Martinet, Xavier and Lachaux, Marie-Anne and Lacroix, Timothée and Rozière, Baptiste and Goyal, Naman and Hambro, Eric and Azhar, Faisal and Rodriguez, Aurelien and Joulin, Armand and Grave, Edouard and Lample, Guillaume, LLaMA: Open and Efficient Foundation Language Models, 2023 [Paper]

  • Wei, Chengwei and Wang, Yun-Cheng and Wang, Bin and Kuo, C.-C. Jay, An Overview on Language Models: Recent Developments and Outlook, 2023 [Paper]

  • Amatriain, Xavier, Transformer models: an introduction and catalog, 2023 [Paper]

  • Zhou, Ce and Li, Qian and Li, Chen and Yu, Jun et al., A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT, 2023 [Paper]

  • Touvron, Hugo and Lavril, Thibaut and Izacard, Gautier et al., LLaMA: Open and Efficient Foundation Language Models, 2023 [Paper]

  • Longpre, Shayne and Hou, Le and Vu, Tu and Webson, Albert and Chung, Hyung Won and Tay, Yi and Zhou, Denny and Le, Quoc V. and Zoph, Barret and Wei, Jason and Roberts, Adam, The Flan Collection: Designing Data and Methods for Effective Instruction Tuning, 2023 [Paper]

  • Iyer, Srinivasan and Lin, Xi Victoria and Pasunuru, Ramakanth et al., OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization, 2023 [Paper]

  • Radford, Alec and Narasimhan, Karthik and Salimans, Tim and Sutskever, Ilya, Improving Language Understanding by Generative Pre-Training, 2023 [Paper]

  • Radford, Alec and Wu, Jeffrey and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya, Language Models are Unsupervised Multitask Learners, 2023 [Paper]

  • Smith, Shaden and Patwary, Mostofa and Norick, Brandon et al., Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model, 2022 [Paper]

  • Du, Nan and Huang, Yanping and Dai, Andrew M. and Tong, Simon et al., GLaM: Efficient Scaling of Language Models with Mixture-of-Experts, 2022 [Paper]

  • Borgeaud, Sebastian and Mensch, Arthur and Hoffmann, Jordan et al., Improving language models by retrieving from trillions of tokens, 2022 [Paper]

  • Hoffmann, Jordan and Borgeaud, Sebastian and Mensch, Arthur et al., Training Compute-Optimal Large Language Models, 2022 [Paper]

  • Workshop, BigScience and Scao, Teven Le, et al., BLOOM: A 176B-Parameter Open-Access Multilingual Language Model, 2022 [Paper]

  • Bai, Yuntao and Jones, Andy and Ndousse, Kamal et al., Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback, 2022 [Paper]

  • Ouyang, Long and Wu, Jeff and Jiang, Xu and Almeida, Diogo et al., Training language models to follow instructions with human feedback, 2022 [Paper]

  • Hoffmann, Jordan and Borgeaud, Sebastian and Mensch, Arthur et al., Training Compute-Optimal Large Language Models, 2022 [Paper]

  • Chung, Hyung Won and Hou, Le and Longpre, Shayne and Zoph, Barret et al., Scaling Instruction-Finetuned Language Models, 2022 [Paper]

  • Scheurer, Jérémy and Campos, Jon Ander and Chan, Jun Shern and Chen, Angelica and Cho, Kyunghyun and Perez, Ethan, Training Language Models with Language Feedback, 2022 [Paper]

  • Zhang, Susan and Roller, Stephen and Goyal, Naman et al., OPT: Open Pre-trained Transformer Language Models, 2022 [Paper]

  • Chowdhery, Aakanksha and Narang, Sharan and Devlin, Jacob et al., PaLM: Scaling Language Modeling with Pathways, 2022 [Paper]

  • Bai, Yuntao and Kadavath, Saurav and Kundu, Sandipan et al., Constitutional AI: Harmlessness from AI Feedback, 2022 [Paper]

  • Chen, Mark and Tworek, Jerry and Jun, Heewoo and Yuan, Qiming et al., Evaluating Large Language Models Trained on Code, 2021 [Paper]

  • Raffel, Colin and Shazeer, Noam and Roberts, Adam and Lee, Katherine and Narang, Sharan and Matena, Michael and Zhou, Yanqi and Li, Wei and Liu, Peter J., Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer, 2020 [Paper]

  • Brown, Tom B. and Mann, Benjamin and Ryder, Nick and Subbiah, Melanie et al., Language Models are Few-Shot Learners, 2020 [Paper]

  • Lieber, Opher and Sharir, Or and Lentz, Barak and Shoham, Yoav, Jurassic-1: Technical Details and Evaluation, 2020 [Paper]

  • Lewis, Mike and Liu, Yinhan and Goyal, Naman and Ghazvininejad, Marjan and Mohamed, Abdelrahman and Levy, Omer and Stoyanov, Ves and Zettlemoyer, Luke, BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension, 2019 [Paper]

Alignment and RLHF

  • Dong, Hanze and Xiong, Wei and Goyal, Deepanshu and Pan, Rui and Diao, Shizhe and Zhang, Jipeng and Shum, Kashun and Zhang, Tong,RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment,2023, [Paper]

  • Scheurer, Jérémy and Campos, Jon Ander and Korbak, Tomasz and Chan, Jun Shern and Chen, Angelica and Cho, Kyunghyun and Perez, Ethan,Training Language Models with Language Feedback at Scale,2023, [Paper]

  • Kwon, Minae and Xie, Sang Michael and Bullard, Kalesha and Sadigh, Dorsa, Reward Design with Language Models, 2023 [Paper]

  • Go, Dongyoung and Korbak, Tomasz and Kruszewski, Germán and Rozen, Jos and Ryu, Nahyeon and Dymetman, Marc, Aligning Language Models with Preferences through f-divergence Minimization, 2023 [Paper]

  • Myers, Vivek and Bıyık, Erdem and Sadigh, Dorsa, Active Reward Learning from Online Preferences, 2023 [Paper]

  • Hartmann, Jochen and Schwenzow, Jasper and Witte, Maximilian, The political ideology of conversational AI: Converging evidence on ChatGPT's pro-environmental, left-libertarian orientation, 2023 [Paper]

  • Liu, Ruibo and Jia, Chenyan and Zhang, Ge and Zhuang, Ziyu and Liu, Tony X. and Vosoughi, Soroush, Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits, 2023 [Paper]

  • Mechergui, Malek and Sreedharan, Sarath, Goal Alignment: A Human-Aware Account of Value Alignment Problem, 2023 [Paper]

  • Ganguli, Deep and Lovitt, Liane and Kernion, Jackson et al., Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned, 2023 [Paper]

  • Liu, Hao and Sferrazza, Carmelo and Abbeel, Pieter, Languages are Rewards: Hindsight Finetuning using Human Feedback, 2023 [Paper]

  • Prabhumoye, Shrimai and Patwary, Mostofa and Shoeybi, Mohammad and Catanzaro, Bryan, Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models, 2023 [Paper]

  • Ganguli, Deep and Askell, Amanda and Schiefer, Nicholas et al., The Capacity for Moral Self-Correction in Large Language Models, 2023 [Paper]

  • Korbak, Tomasz and Shi, Kejian and Chen, Angelica and Bhalerao, Rasika and Buckley, Christopher L. and Phang, Jason and Bowman, Samuel R. and Perez, Ethan, Pretraining Language Models with Human Preferences, 2023 [Paper]

  • Lee, Kimin and Liu, Hao and Ryu, Moonkyung and Watkins, Olivia and Du, Yuqing and Boutilier, Craig and Abbeel, Pieter and Ghavamzadeh, Mohammad and Gu, Shixiang Shane, Aligning Text-to-Image Models using Human Feedback, 2023 [Paper]

  • Bakker, Michiel A. and Chadwick, Martin J. and Sheahan, Hannah R. and Tessler, Michael Henry and Campbell-Gillingham, Lucy and Balaguer, Jan and McAleese, Nat and Glaese, Amelia and Aslanides, John and Botvinick, Matthew M. and Summerfield, Christopher, Fine-tuning language models to find agreement among humans with diverse preferences, 2022 [Paper]

  • Glaese, Amelia and McAleese, Nat and Trębacz, Maja et al., Improving alignment of dialogue agents via targeted human judgements, 2022 [Paper]

  • Stiennon, Nisan and Ouyang, Long and Wu, Jeff and Ziegler, Daniel M. and Lowe, Ryan and Voss, Chelsea and Radford, Alec and Amodei, Dario and Christiano, Paul, Learning to summarize from human feedback, 2022 [Paper]

  • Bai, Yuntao and Jones, Andy and Ndousse, Kamal et al., Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback, 2022 [Paper]

  • Bai, Yuntao and Kadavath, Saurav and Kundu, Sandipan et al., Constitutional AI: Harmlessness from AI Feedback, 2022 [Paper]

  • Zhao, Lingjun and Nguyen, Khanh and Daumé III, Hal, A Cognitive Evaluation of Instruction Generation Agents tl;dr They Need Better Theory-of-Mind Capabilities, 2022 [Paper]

  • Askell, Amanda and Bai, Yuntao and Chen, Anna et al., A General Language Assistant as a Laboratory for Alignment, 2021 [Paper]

Dialogue Model

  • Thoppilan, Romal and De Freitas, Daniel and Hall, Jamie et al., LaMDA: Language Models for Dialog Applications, 2022 [Paper]

  • Shuster, Kurt and Xu, Jing and Komeili, Mojtaba and Ju, Da et al., BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage, 2022 [Paper]

  • Glaese, Amelia and McAleese, Nat and Trębacz, Maja et al., Improving alignment of dialogue agents via targeted human judgements, 2022 [Paper]

  • Zhang, Yizhe and Sun, Siqi and Galley, Michel and Chen, Yen-Chun and Brockett, Chris and Gao, Xiang and Gao, Jianfeng and Liu, Jingjing and Dolan, Bill, DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation, 2020 [Paper]

  • Zhang, Yizhe and Sun, Siqi and Galley, Michel and Chen, Yen-Chun and Brockett, Chris and Gao, Xiang and Gao, Jianfeng and Liu, Jingjing and Dolan, Bill, DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation, 2020 [Paper]

Prompt

Hard Prompt

  • Zhou, Yulin and Zhao, Yiren and Shumailov, Ilia and Mullins, Robert and Gal, Yarin,Revisiting Automated Prompting: Are We Actually Doing Better?,2023, [Paper]

  • Liu, Chi and Wang, Haochun and Xi, Nuwa and Zhao, Sendong and Qin, Bing,Global Prompt Cell: A Portable Control Module for Effective Prompt,2023, [Paper]

  • Pitis, Silviu and Zhang, Michael R. and Wang, Andrew and Ba, Jimmy,Boosted Prompt Ensembles for Large Language Models,2023, [Paper]

  • Wu, Tongshuang and Jiang, Ellen and Donsbach, Aaron and Gray, Jeff and Molina, Alejandra and Terry, Michael and Cai, Carrie J.,PromptChainer: Chaining Large Language Model Prompts through Visual Programming,2022, [Paper]

  • Chen, Jiuhai and Chen, Lichang and Huang, Heng and Zhou, Tianyi,When do you need Chain-of-Thought Prompting for ChatGPT?,2023, [Paper]

  • Madaan, Aman and Yazdanbakhsh, Amir,Text and Patterns: For Effective Chain of Thought, It Takes Two to Tango,2022, [Paper]

  • Orhan, A. Emin,Recognition, recall, and retention of few-shot memories in large language models,2023, [Paper]

  • Zhou, Wenxuan and Zhang, Sheng and Poon, Hoifung and Chen, Muhao, Context-faithful Prompting for Large Language Models, 2023 [Paper]

  • Lyu, Xinxi and Min, Sewon and Beltagy, Iz and Zettlemoyer, Luke and Hajishirzi, Hannaneh, Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations, 2022 [Paper]

  • Beurer-Kellner, Luca and Fischer, Marc and Vechev, Martin, Prompting Is Programming: A Query Language For Large Language Models, 2022 [Paper]

  • Kim, Seungone and Joo, Se June and Jang, Yul and Chae, Hyungjoo and Yeo, Jinyoung, CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification, 2023 [Paper]

  • Wei, Jerry and Wei, Jason and Tay, Yi and Tran, Dustin and Webson, Albert and Lu, Yifeng and Chen, Xinyun and Liu, Hanxiao and Huang, Da and Zhou, Denny and Ma, Tengyu, Larger language models do in-context learning differently, 2023 [Paper]

  • Singh, Chandan and Morris, John X. and Aneja, Jyoti and Rush, Alexander M. and Gao, Jianfeng, Explaining Patterns in Data with Language Models via Interpretable Autoprompting, 2023 [Paper]

  • Zhang, Renrui and Hu, Xiangfei and Li, Bohao and Huang, Siyuan and Deng, Hanqiu and Li, Hongsheng and Qiao, Yu and Gao, Peng, Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners, 2023 [Paper]

  • Wu, Zhiyong and Wang, Yaoxiang and Ye, Jiacheng and Kong, Lingpeng, Self-adaptive In-context Learning, 2022 [Paper]

  • Lu, Yao and Bartolo, Max and Moore, Alastair and Riedel, Sebastian and Stenetorp, Pontus, Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity, 2022 [Paper]

  • Wu, Zhenyu and Wang, YaoXiang and Ye, Jiacheng and Feng, Jiangtao and Xu, Jingjing and Qiao, Yu and Wu, Zhiyong, OpenICL: An Open-Source Framework for In-context Learning, 2023 [Paper]

  • Ye, Seonghyeon and Hwang, Hyeonbin and Yang, Sohee and Yun, Hyeongu and Kim, Yireun and Seo, Minjoon, In-Context Instruction Learning, 2023 [Paper]

  • Liu, Hao and Sferrazza, Carmelo and Abbeel, Pieter, Chain of Hindsight Aligns Language Models with Feedback, 2023 [Paper]

  • Shum, KaShun and Diao, Shizhe and Zhang, Tong, Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data, 2023 [Paper]

  • Diao, Shizhe and Wang, Pengcheng and Lin, Yong and Zhang, Tong, Active Prompting with Chain-of-Thought for Large Language Models, 2023 [Paper]

  • Li, Xiaonan and Qiu, Xipeng, Finding Supporting Examples for In-Context Learning, 2023 [Paper]

  • Wang, Yunlong and Shen, Shuyuan and Lim, Brian Y., RePrompt: Automatic Prompt Editing to Refine AI-Generative Art Towards Precise Expressions, 2023 [Paper]

  • Wei, Xiang and Cui, Xingyu and Cheng, Ning and Wang, Xiaobin and Zhang, Xin and Huang, Shen and Xie, Pengjun and Xu, Jinan and Chen, Yufeng and Zhang, Meishan and Jiang, Yong and Han, Wenjuan, Zero-Shot Information Extraction via Chatting with ChatGPT, 2023 [Paper]

  • Li, Zekun and Peng, Baolin and He, Pengcheng and Galley, Michel and Gao, Jianfeng and Yan, Xifeng, Guiding Large Language Models via Directional Stimulus Prompting, 2023 [Paper]

  • Diao, Shizhe and Wang, Pengcheng and Lin, Yong and Zhang, Tong, Active Prompting with Chain-of-Thought for Large Language Models, 2023 [Paper]

  • Zhang, Zhuosheng and Zhang, Aston and Li, Mu and Zhao, Hai and Karypis, George and Smola, Alex, Multimodal Chain-of-Thought Reasoning in Language Models, 2023 [Paper]

  • Wen, Yuxin and Jain, Neel and Kirchenbauer, John and Goldblum, Micah and Geiping, Jonas and Goldstein, Tom, Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery, 2023 [Paper]

  • Goswami, Koustava and Lange, Lukas and Araki, Jun and Adel, Heike, SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains, 2023 [Paper]

  • Cheng, Zhoujun and Kasai, Jungo and Yu, Tao, Batch Prompting: Efficient Inference with Large Language Model APIs, 2023 [Paper]

  • Lyu, Qing and Havaldar, Shreya and Stein, Adam and Zhang, Li and Rao, Delip and Wong, Eric and Apidianaki, Marianna and Callison-Burch, Chris, Faithful Chain-of-Thought Reasoning, 2023 [Paper]

  • Shao, Zhihong and Gong, Yeyun and Shen, Yelong and Huang, Minlie and Duan, Nan and Chen, Weizhu, Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models, 2023 [Paper]

  • Wang, Yizhong and Kordi, Yeganeh and Mishra, Swaroop and Liu, Alisa and Smith, Noah A. and Khashabi, Daniel and Hajishirzi, Hannaneh, Self-Instruct: Aligning Language Model with Self Generated Instructions, 2022 [Paper]

  • Deng, Mingkai and Wang, Jianyu and Hsieh, Cheng-Ping and Wang, Yihan and Guo, Han and Shu, Tianmin and Song, Meng and Xing, Eric P. and Hu, Zhiting, RLPrompt: Optimizing Discrete Text Prompts With Reinforcement Learning, 2022 [Paper]

  • Khot, Tushar and Trivedi, Harsh and Finlayson, Matthew and Fu, Yao and Richardson, Kyle and Clark, Peter and Sabharwal, Ashish, Decomposed Prompting: A Modular Approach for Solving Complex Tasks, 2022 [Paper]

  • Khot, Tushar and Richardson, Kyle and Khashabi, Daniel and Sabharwal, Ashish, Hey AI, Can You Solve Complex Tasks by Talking to Agents?, 2022 [Paper]

  • Honovich, Or and Shaham, Uri and Bowman, Samuel R. and Levy, Omer, Instruction Induction: From Few Examples to Natural Language Task Descriptions, 2022 [Paper]

  • Yuan, Weizhe and Liu, Pengfei, reStructured Pre-training, 2022 [Paper]

  • Zhong, Wanjun and Gao, Yifan and Ding, Ning and Qin, Yujia and Liu, Zhiyuan and Zhou, Ming and Wang, Jiahai and Yin, Jian and Duan, Nan, ProQA: Structural Prompt-based Pre-training for Unified Question Answering, 2022 [Paper]

  • Zhang, Zhuosheng and Zhang, Aston and Li, Mu and Smola, Alex, Automatic Chain of Thought Prompting in Large Language Models, 2022 [Paper]

  • Kojima, Takeshi and Gu, Shixiang Shane and Reid, Machel and Matsuo, Yutaka and Iwasawa, Yusuke, Large Language Models are Zero-Shot Reasoners, 2022 [Paper]

  • Weng, Jinta and Hu, Yue and Tian, Zhihong and Huang, Heyan, ConsPrompt: Easily Exploiting Contrastive Samples for Few-shot Prompt Learning, 2022 [Paper]

  • Wang, Xuezhi and Wei, Jason and Schuurmans, Dale and Le, Quoc and Chi, Ed and Narang, Sharan and Chowdhery, Aakanksha and Zhou, Denny, Self-Consistency Improves Chain of Thought Reasoning in Language Models, 2022 [Paper]

  • Beurer-Kellner, Luca and Fischer, Marc and Vechev, Martin, Prompting Is Programming: A Query Language For Large Language Models, 2022 [Paper]

  • Zhou, Yongchao and Muresanu, Andrei Ioan and Han, Ziwen and Paster, Keiran and Pitis, Silviu and Chan, Harris and Ba, Jimmy, Large Language Models Are Human-Level Prompt Engineers, 2022 [Paper]

  • Anna-Maria Meck, Lisa Precht, How to Design the Perfect Prompt: A Linguistic Approach to Prompt Design in Automotive Voice Assistants – An Exploratory Study \textbar 13th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2022 [Paper]

  • Arora, Simran and Narayan, Avanika and Chen, Mayee F. and Orr, Laurel and Guha, Neel and Bhatia, Kush and Chami, Ines and Sala, Frederic and Ré, Christopher, Ask Me Anything: A simple strategy for prompting language models, 2022 [Paper]

  • Rubin, Ohad and Herzig, Jonathan and Berant, Jonathan, Learning To Retrieve Prompts for In-Context Learning, 2022 [Paper]

  • Liu, Pengfei and Yuan, Weizhe and Fu, Jinlan and Jiang, Zhengbao and Hayashi, Hiroaki and Neubig, Graham, Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing, 2021 [Paper]

  • Reynolds, Laria and McDonell, Kyle, Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm, 2021 [Paper]

  • Liu, Jiachang and Shen, Dinghan and Zhang, Yizhe and Dolan, Bill and Carin, Lawrence and Chen, Weizhu, What Makes Good In-Context Examples for GPT-3?, 2021 [Paper]

  • Perez, Ethan and Kiela, Douwe and Cho, Kyunghyun, True Few-Shot Learning with Language Models, 2021 [Paper]

  • Gao, Tianyu and Fisch, Adam and Chen, Danqi, Making Pre-trained Language Models Better Few-shot Learners, 2021 [Paper]

  • Han, Xu and Zhao, Weilin and Ding, Ning and Liu, Zhiyuan and Sun, Maosong, PTR: Prompt Tuning with Rules for Text Classification, 2021 [Paper]

  • Taylor Shin, Yasaman Razeghi, Robert L. Logan, Eric Wallace, Sameer Singh, AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts, 2020 [Paper]

  • Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Ed Chi, Quoc Le, Denny Zhou, Chain of Thought Prompting Elicits Reasoning in Large Language Models, 2020 [Paper]

Soft Prompt

  • Lei, Tao and Bai, Junwen and Brahma, Siddhartha and Ainslie, Joshua and Lee, Kenton and Zhou, Yanqi and Du, Nan et al.,Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference,2023, [Paper]

  • Hu, Zhiqiang and Lan, Yihuai and Wang, Lei and Xu, Wanyu and Lim, Ee-Peng and Lee, Roy Ka-Wei and Bing, Lidong and Poria, Soujanya,LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models,2023, [Paper]

  • Zhang, Qingru and Chen, Minshuo and Bukharin, Alexander and He, Pengcheng and Cheng, Yu and Chen, Weizhu and Zhao, Tuo, Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning, 2023 [Paper]

  • Chen, Derek and Lee, Celine and Lu, Yunan and Rosati, Domenic and Yu, Zhou, Mixture of Soft Prompts for Controllable Data Generation, 2023 [Paper]

  • Yang, Xianjun and Cheng, Wei and Zhao, Xujiang and Petzold, Linda and Chen, Haifeng, Dynamic Prompting: A Unified Framework for Prompt Tuning, 2023 [Paper]

  • Xiao, Guangxuan and Lin, Ji and Han, Song, Offsite-Tuning: Transfer Learning without Full Model, 2023 [Paper]

  • Razdaibiedina, Anastasia and Mao, Yuning and Hou, Rui and Khabsa, Madian and Lewis, Mike and Almahairi, Amjad, Progressive Prompts: Continual Learning for Language Models, 2023 [Paper]

  • Chen, Chen and Zhang, Wei Emma and Shakeri, Alireza Seyed, Incorporating Knowledge into Document Summarization: an Application of Prefix-Tuning on GPT-2, 2023 [Paper]

  • Bowman, Benjamin and Achille, Alessandro and Zancato, Luca and Trager, Matthew and Perera, Pramuditha and Paolini, Giovanni and Soatto, Stefano, \textbackslash`A-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting, 2023 [Paper]

  • Sun, Simeng and Liu, Yang and Iter, Dan and Zhu, Chenguang and Iyyer, Mohit, How Does In-Context Learning Help Prompt Tuning?, 2023 [Paper]

  • Wang, Zifeng and Zhang, Zizhao and Ebrahimi, Sayna and Sun, Ruoxi and Zhang, Han and Lee, Chen-Yu and Ren, Xiaoqi and Su, Guolong and Perot, Vincent and Dy, Jennifer and Pfister, Tomas, DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning, 2022 [Paper]

  • Nie, Xing and Ni, Bolin and Chang, Jianlong and Meng, Gaomeng and Huo, Chunlei and Zhang, Zhaoxiang and Xiang, Shiming and Tian, Qi and Pan, Chunhong, Pro-tuning: Unified Prompt Tuning for Vision Tasks, 2022 [Paper]

  • Wang, Shijie and Chang, Jianlong and Wang, Zhihui and Li, Haojie and Ouyang, Wanli and Tian, Qi, Fine-grained Retrieval Prompt Tuning, 2022 [Paper]

  • Santos, Cicero Nogueira dos and Dong, Zhe and Cer, Daniel and Nham, John and Shakeri, Siamak and Ni, Jianmo and Sung, Yun-hsuan, Knowledge Prompts: Injecting World Knowledge into Language Models through Soft Prompts, 2022 [Paper]

  • Ma, Fang and Zhang, Chen and Ren, Lei and Wang, Jingang and Wang, Qifan and Wu, Wei and Quan, Xiaojun and Song, Dawei, XPrompt: Exploring the Extreme of Prompt Tuning, 2022 [Paper]

  • Liu, Xiao and Ji, Kaixuan and Fu, Yicheng and Tam, Weng Lam and Du, Zhengxiao and Yang, Zhilin and Tang, Jie, P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks, 2022 [Paper]

  • Cui, Ganqu and Hu, Shengding and Ding, Ning and Huang, Longtao and Liu, Zhiyuan, Prototypical Verbalizer for Prompt-based Few-shot Tuning, 2022 [Paper]

  • He, Yun and Zheng, Huaixiu Steven and Tay, Yi and Gupta, Jai and Du, Yu and Aribandi, Vamsi and Zhao, Zhe and Li, YaGuang and Chen, Zhao and Metzler, Donald and Cheng, Heng-Tze and Chi, Ed H., HyperPrompt: Prompt-based Task-Conditioning of Transformers, 2022 [Paper]

  • Gu, Yuxian and Han, Xu and Liu, Zhiyuan and Huang, Minlie, PPT: Pre-trained Prompt Tuning for Few-shot Learning, 2021 [Paper]

  • Li, Xiang Lisa and Liang, Percy, Prefix-Tuning: Optimizing Continuous Prompts for Generation, 2021 [Paper]

  • Logan IV, Robert L. and Balažević, Ivana and Wallace, Eric and Petroni, Fabio and Singh, Sameer and Riedel, Sebastian, Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models, 2021 [Paper]

  • Peters, Matthew E. and Ruder, Sebastian and Smith, Noah A., To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks, 2019 [Paper]

Dialogue Prompt

  • Madaan, Aman and Tandon, Niket and Clark, Peter and Yang, Yiming, Memory-assisted prompt editing to improve GPT-3 after deployment, 2023 [Paper]

  • Chen, Maximillian and Papangelis, Alexandros and Tao, Chenyang and Kim, Seokhwan and Rosenbaum, Andy and Liu, Yang and Yu, Zhou and Hakkani-Tur, Dilek, PLACES: Prompting Language Models for Social Conversation Synthesis, 2023 [Paper]

  • White, Jules and Fu, Quchen and Hays, Sam and Sandborn, Michael and Olea, Carlos and Gilbert, Henry and Elnashar, Ashraf and Spencer-Smith, Jesse and Schmidt, Douglas C., A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT, 2023 [Paper]

  • Ross, Steven I. and Muller, Michael and Martinez, Fernando and Houde, Stephanie and Weisz, Justin D., A Case Study in Engineering a Conversational Programming Assistant's Persona, 2023 [Paper]

  • Lee, Young-Jun and Lim, Chae-Gyun and Choi, Ho-Jin, Does GPT-3 Generate Empathetic Dialogues? A Novel In-Context Example Selection Method and Automatic Evaluation Metric for Empathetic Dialogue Generation, 2022 [Paper]

  • Liu, Zihan and Patwary, Mostofa and Prenger, Ryan and Prabhumoye, Shrimai and Ping, Wei and Shoeybi, Mohammad and Catanzaro, Bryan, Multi-Stage Prompting for Knowledgeable Dialogue Generation, 2022 [Paper]

  • Mehri, Shikib and Altun, Yasemin and Eskenazi, Maxine, LAD: Language Models as Data for Zero-Shot Dialog, 2022 [Paper]

  • Gupta, Prakhar and Jiao, Cathy and Yeh, Yi-Ting and Mehri, Shikib and Eskenazi, Maxine and Bigham, Jeffrey P., InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning, 2022 [Paper]

  • Li, Zekun and Chen, Wenhu and Li, Shiyang and Wang, Hong and Qian, Jing and Yan, Xifeng, Dialogic: Controllable Dialogue Simulation with In-Context Learning, 2022 [Paper]

  • Han, Seungju and Kim, Beomsu and Yoo, Jin Yong and Seo, Seokjun and Kim, Sangbum and Erdenee, Enkhbayar and Chang, Buru, Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances, 2022 [Paper]

  • Hu, Yushi and Lee, Chia-Hsuan and Xie, Tianbao and Yu, Tao and Smith, Noah A. and Ostendorf, Mari, In-Context Learning for Few-Shot Dialogue State Tracking, 2022 [Paper]

  • Saha, Debjoy and Santra, Bishal and Goyal, Pawan, A Study on Prompt-based Few-Shot Learning Methods for Belief State Tracking in Task-oriented Dialog Systems, 2022 [Paper]

  • Kumar, Harsh and Musabirov, Ilya and Shi, Jiakai and Lauzon, Adele and Choy, Kwan Kiu and Gross, Ofek and Kulzhabayeva, Dana and Williams, Joseph Jay, Exploring The Design of Prompts For Applying GPT-3 based Chatbots: A Mental Wellbeing Case Study on Mechanical Turk, 2022 [Paper]

  • Mitsuda, Koh and Higashinaka, Ryuichiro and Saito, Kuniko, Combining Argumentation Structure and Language Model for Generating Natural Argumentative Dialogue, 2022 [Paper]

  • Valvoda, Josef and Fang, Yimai and Vandyke, David, Prompting for a conversation: How to control a dialog model?, 2022 [Paper]

  • He, Keqing and Wang, Jingang and Sun, Chaobo and Wu, Wei, Unified Knowledge Prompt Pre-training for Customer Service Dialogues, 2022 [Paper]

  • Reed, Lena and Li, Cecilia and Ramirez, Angela and Wu, Liren and Walker, Marilyn, Jurassic is (almost) All You Need: Few-Shot Meaning-to-Text Generation for Open-Domain Dialogue, 2021 [Paper]

  • Liu, Jiachang and Shen, Dinghan and Zhang, Yizhe and Dolan, Bill and Carin, Lawrence and Chen, Weizhu, What Makes Good In-Context Examples for GPT-3, 2021 [Paper]

  • Madotto, Andrea and Lin, Zhaojiang and Winata, Genta Indra and Fung, Pascale, Few-Shot Bot: Prompt-Based Learning for Dialogue Systems, 2021 [Paper]

  • Prodan, George P and Pelican, Elena, Prompt scoring system for dialogue summarization using GPT-3, 2021 [Paper]

Information Seeking

Extra Knowledge

  • Sun, Weiwei and Ren, Pengjie and Ren, Zhaochun,Generative Knowledge Selection for Knowledge-Grounded Dialogues,2023, [Paper]

  • Chen, Zichen and Singh, Ambuj K. and Sra, Misha,LMExplainer: a Knowledge-Enhanced Explainer for Language Models,2023, [Paper]

  • Peng, Baolin and Galley, Michel and He, Pengcheng and Cheng, Hao and Xie, Yujia and Hu, Yu and Huang, Qiuyuan and Liden, Lars and Yu, Zhou and Chen, Weizhu and Gao, Jianfeng, Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback, 2023 [Paper]

  • Khattab, Omar and Santhanam, Keshav and Li, Xiang Lisa and Hall, David and Liang, Percy and Potts, Christopher and Zaharia, Matei, Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP, 2023 [Paper]

  • Zhou, Shuyan and Alon, Uri and Xu, Frank F. and Wang, Zhiruo and Jiang, Zhengbao and Neubig, Graham, DocPrompting: Generating Code by Retrieving the Docs, 2023 [Paper]

  • Sun, Zhiqing and Wang, Xuezhi and Tay, Yi and Yang, Yiming and Zhou, Denny, Recitation-Augmented Language Models, 2023 [Paper]

  • Yu, Wenhao and Iter, Dan and Wang, Shuohang and Xu, Yichong and Ju, Mingxuan and Sanyal, Soumya and Zhu, Chenguang and Zeng, Michael and Jiang, Meng, Generate rather than Retrieve: Large Language Models are Strong Context Generators, 2023 [Paper]

  • Cohen, Roi and Geva, Mor and Berant, Jonathan and Globerson, Amir, Crawling the Internal Knowledge-Base of Language Models, 2023 [Paper]

  • Zamani, Hamed and Trippas, Johanne R. and Dalton, Jeff and Radlinski, Filip, Conversational Information Seeking, 2023 [Paper]

  • Guo, Zhixin and Yan, Minyxuan and Qi, Jiexing and Zhou, Jianping and He, Ziwei and Lin, Zhouhan and Zheng, Guanjie and Wang, Xinbing, Few-Shot Table-to-Text Generation with Prompt Planning and Knowledge Memorization, 2023 [Paper]

  • BehnamGhader, Parishad and Miret, Santiago and Reddy, Siva, Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model, 2022 [Paper]

  • Borgeaud, Sebastian and Mensch, Arthur and Hoffmann, Jordan et al., Improving language models by retrieving from trillions of tokens, 2022 [Paper]

  • Shaier, Sagi and Hunter, Lawrence and Kann, Katharina, Mind the Knowledge Gap: A Survey of Knowledge-enhanced Dialogue Systems, 2022 [Paper]

  • Majumder, Bodhisattwa Prasad and Jhamtani, Harsh and Berg-Kirkpatrick, Taylor and McAuley, Julian, Achieving Conversational Goals with Unsupervised Post-hoc Knowledge Injection, 2022 [Paper]

  • Pereira, Jayr and Fidalgo, Robson and Lotufo, Roberto and Nogueira, Rodrigo, Visconde: Multi-document QA with GPT-3 and Neural Reranking, 2022 [Paper]

  • Izacard, Gautier and Lewis, Patrick and Lomeli, Maria and Hosseini, Lucas et al., Atlas: Few-shot Learning with Retrieval Augmented Language Models, 2022 [Paper]

  • Trivedi, Harsh and Balasubramanian, Niranjan and Khot, Tushar and Sabharwal, Ashish, Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, 2022 [Paper]

  • Izacard, Gautier and Lewis, Patrick and Lomeli, Maria and Hosseini, Lucas and Petroni, Fabio and Schick, Timo and Dwivedi-Yu, Jane and Joulin, Armand and Riedel, Sebastian and Grave, Edouard, DocPrompting: Generating Code by Retrieving the Docs, 2022 [Paper]

  • AlKhamissi, Badr and Li, Millicent and Celikyilmaz, Asli and Diab, Mona and Ghazvininejad, Marjan, A Review on Language Models as Knowledge Bases, 2022 [Paper]

  • Dong, Qingxiu and Dai, Damai and Song, Yifan and Xu, Jingjing and Sui, Zhifang and Li, Lei, Calibrating Factual Knowledge in Pretrained Language Models, 2022 [Paper]

  • Santos, Cicero Nogueira dos and Dong, Zhe and Cer, Daniel and Nham, John and Shakeri, Siamak and Ni, Jianmo and Sung, Yun-hsuan, Knowledge Prompts: Injecting World Knowledge into Language Models through Soft Prompts, 2022 [Paper]

  • He, Hangfeng and Zhang, Hongming and Roth, Dan, Rethinking with Retrieval: Faithful Large Language Model Inference, 2022 [Paper]

  • Shuster, Kurt and Komeili, Mojtaba and Adolphs, Leonard and Roller, Stephen and Szlam, Arthur and Weston, Jason, Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion, 2022 [Paper]

  • Xue, Qiang and Takiguchi, Tetsuya and Ariki, Yasuo, Building a Knowledge-Based Dialogue System with Text Infilling, 2022 [Paper]

  • Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and Küttler, Heinrich and Lewis, Mike and Yih, Wen-tau and Rocktäschel, Tim and Riedel, Sebastian and Kiela, Douwe, Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, 2021 [Paper]

  • Adolphs, Leonard and Shuster, Kurt and Urbanek, Jack and Szlam, Arthur and Weston, Jason, Reason first, then respond: Modular Generation for Knowledge-infused Dialogue, 2021 [Paper]

  • Shuster, Kurt and Poff, Spencer and Chen, Moya and Kiela, Douwe and Weston, Jason, Retrieval Augmentation Reduces Hallucination in Conversation, 2021 [Paper]

  • Xueliang Zhao, Wei Wu, Can Xu, Chongyang Tao, Dongyan Zhao, Rui Yan, Knowledge-Grounded Dialogue Generation with Pre-trained Language Models., 2020 [Paper]

Extra Modality

  • Liu, Haotian and Li, Chunyuan and Wu, Qingyang and Lee, Yong Jae,Visual Instruction Tuning,2023, [Paper]

  • Zhu, Wanrong and Hessel, Jack and Awadalla, Anas and Gadre, Samir Yitzhak and Dodge, Jesse and Fang, Alex and Yu, Youngjae and Schmidt, Ludwig and Wang, William Yang and Choi, Yejin,Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text,2023, [Paper]

  • Ge, Jiaxin and Luo, Hongyin and Qian, Siyuan and Gan, Yulu and Fu, Jie and Zhan, Shanghang,Chain of Thought Prompt Tuning in Vision Language Models,2023, [Paper]

  • Noever, David and Noever, Samantha Elizabeth Miller,The Multimodal And Modular Ai Chef: Complex Recipe Generation From Imagery,2023, [Paper]

  • Peng, Baolin and Li, Chunyuan and He, Pengcheng and Galley, Michel and Gao, Jianfeng,Instruction Tuning with GPT-4,2023, [Paper]

  • West, Colin G.,Advances in apparent conceptual physics reasoning in ChatGPT-4,2023, [Paper]

  • Shukor, Mustafa and Dancette, Corentin and Cord, Matthieu, eP-ALM: Efficient Perceptual Augmentation of Language Models, 2023 [Paper]

  • Zhao, Ruochen and Chen, Hailin and Wang, Weishi and Jiao, Fangkai and Do, Xuan Long and Qin, Chengwei and Ding, Bosheng and Guo, Xiaobao and Li, Minzhi and Li, Xingxuan and Joty, Shafiq, Retrieving Multimodal Information for Augmented Generation: A Survey, 2023 [Paper]

  • Zhu, Deyao and Chen, Jun and Haydarov, Kilichbek and Shen, Xiaoqian and Zhang, Wenxuan and Elhoseiny, Mohamed, ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions, 2023 [Paper]

  • Zhao, Xufeng and Li, Mengdi and Weber, Cornelius and Hafez, Muhammad Burhan and Wermter, Stefan, Chat with the Environment: Interactive Multimodal Perception using Large Language Models, 2023 [Paper]

  • Driess, Danny and Xia, Fei and Sajjadi, Mehdi S. M. and Lynch, Corey and Chowdhery, Aakanksha and Ichter, Brian and Wahid, Ayzaan and Tompson, Jonathan and Vuong, Quan and Yu, Tianhe and Huang, Wenlong and Chebotar, Yevgen and Sermanet, Pierre and Duckworth, Daniel and Levine, Sergey and Vanhoucke, Vincent and Hausman, Karol and Toussaint, Marc and Greff, Klaus and Zeng, Andy and Mordatch, Igor and Florence, Pete, PaLM-E: An Embodied Multimodal Language Model, 2023 [Paper]

  • Wu, Chenfei and Yin, Shengming and Qi, Weizhen and Wang, Xiaodong and Tang, Zecheng and Duan, Nan, Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models, 2023 [Paper]

  • Camposampiero, Giacomo and Houmard, Loic and Estermann, Benjamin and Mathys, Joël and Wattenhofer, Roger, Visual Abstraction and Reasoning through Language, 2023 [Paper]

  • Shao, Zhenwei and Yu, Zhou and Wang, Meng and Yu, Jun, Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering, 2023 [Paper]

  • West, Colin G., AI and the FCI: Can ChatGPT Project an Understanding of Introductory Physics?, 2023 [Paper]

  • Huang, Shaohan and Dong, Li and Wang, Wenhui and Hao, Yaru and Singhal, Saksham and Ma, Shuming and Lv, Tengchao and Cui, Lei and Mohammed, Owais Khan and Liu, Qiang and Aggarwal, Kriti and Chi, Zewen and Bjorck, Johan and Chaudhary, Vishrav and Som, Subhojit and Song, Xia and Wei, Furu, Language Is Not All You Need: Aligning Perception with Language Models, 2023 [Paper]

  • Li, Junnan and Li, Dongxu and Savarese, Silvio and Hoi, Steven, BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models, 2023 [Paper]

  • Alayrac, Jean-Baptiste and Donahue, Jeff and Luc, Pauline et al., Flamingo: a Visual Language Model for Few-Shot Learning, 2022 [Paper]

  • Su, Yixuan and Lan, Tian and Liu, Yahui and Liu, Fangyu and Yogatama, Dani and Wang, Yan and Kong, Lingpeng and Collier, Nigel, Language Models Can See: Plugging Visual Controls in Text Generation, 2022 [Paper]

  • Yang, Zhengyuan and Gan, Zhe and Wang, Jianfeng and Hu, Xiaowei and Lu, Yumao and Liu, Zicheng and Wang, Lijuan, An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, 2022 [Paper]

  • Tsimpoukelli, Maria and Menick, Jacob and Cabi, Serkan and Eslami, S. M. Ali and Vinyals, Oriol and Hill, Felix, Multimodal Few-Shot Learning with Frozen Language Models, 2021 [Paper]

Web Search

  • Ram, Ori and Levine, Yoav and Dalmedigos, Itay and Muhlgay, Dor and Shashua, Amnon and Leyton-Brown, Kevin and Shoham, Yoav, In-Context Retrieval-Augmented Language Models, 2023 [Paper]

  • Khattab, Omar and Santhanam, Keshav and Li, Xiang Lisa and Hall, David and Liang, Percy and Potts, Christopher and Zaharia, Matei, Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP, 2023 [Paper]

  • Shi, Weijia and Min, Sewon and Yasunaga, Michihiro and Seo, Minjoon and James, Rich and Lewis, Mike and Zettlemoyer, Luke and Yih, Wen-tau, REPLUG: Retrieval-Augmented Black-Box Language Models, 2023 [Paper]

  • Lazaridou, Angeliki and Gribovskaya, Elena and Stokowiec, Wojciech and Grigorev, Nikolai, Internet-augmented language models through few-shot prompting for open-domain question answering, 2022 [Paper]

  • Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir et al., The Web Is Your Oyster - Knowledge-Intensive NLP against a Very Large Web Corpus, 2022 [Paper]

  • Nakano, Reiichiro and Hilton, Jacob and Balaji, Suchir et al., WebGPT: Browser-assisted question-answering with human feedback, 2021 [Paper]

  • Komeili, Mojtaba and Shuster, Kurt and Weston, Jason, Internet-Augmented Dialogue Generation, 2021 [Paper]

Action and Plan

  • Lu, Pan and Peng, Baolin and Cheng, Hao and Galley, Michel and Chang, Kai-Wei and Wu, Ying Nian and Zhu, Song-Chun and Gao, Jianfeng,Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models,2023, [Paper]

  • Qin, Yujia and Hu, Shengding and Lin, Yankai and Chen, Weize and Ding, Ning and Cui, Ganqu et al.,Tool Learning with Foundation Models,2023, [Paper]

  • Li, Minghao and Song, Feifan and Yu, Bowen and Yu, Haiyang and Li, Zhoujun and Huang, Fei and Li, Yongbin,API-Bank: A Benchmark for Tool-Augmented LLMs,2023, [Paper]

  • Patel, Dhruvesh and Eghbalzadeh, Hamid and Kamra, Nitin and Iuzzolino, Michael Louis and Jain, Unnat and Desai, Ruta,Pretrained Language Models as Visual Planners for Human Assistance,2023, [Paper]

  • Mai, Jinjie and Chen, Jun and Li, Bing and Qian, Guocheng and Elhoseiny, Mohamed and Ghanem, Bernard,LLM as A Robotic Brain: Unifying Egocentric Memory and Control,2023, [Paper]

  • Ge, Yingqiang and Hua, Wenyue and Ji, Jianchao and Tan, Juntao and Xu, Shuyuan and Zhang, Yongfeng,OpenAGI: When LLM Meets Domain Experts,2023, [Paper]

  • Chen, Xinyun and Lin, Maxwell and Schärli, Nathanael and Zhou, Denny,Teaching Large Language Models to Self-Debug,2023, [Paper]

  • Park, Joon Sung and O'Brien, Joseph C. and Cai, Carrie J. and Morris, Meredith Ringel and Liang, Percy and Bernstein, Michael S.,Generative Agents: Interactive Simulacra of Human Behavior,2023, [Paper]

  • Khan, Muhammad Arshad and Kenney, Max and Painter, Jack and Kamale, Disha and Batista-Navarro, Riza and Ghalamzan-E, Amir,Natural Language Robot Programming: NLP integrated with autonomous robotic grasping,2023, [Paper]

  • Francis, Jonathan and Kitamura, Nariaki and Labelle, Felix and Lu, Xiaopeng and Navarro, Ingrid and Oh, Jean,Core Challenges in Embodied Vision-Language Planning,2023, [Paper]

  • Jia, Zhiwei and Liu, Fangchen and Thumuluri, Vineet and Chen, Linghao and Huang, Zhiao and Su, Hao,Chain-of-Thought Predictive Control,2023, [Paper]

  • Tandon, Niket and Madaan, Aman and Clark, Peter and Yang, Yiming,Learning to Repair: Repairing model output errors after deployment using a dynamic memory of feedback,2022, [Paper]

  • Liang, Yaobo and Wu, Chenfei and Song, Ting et al.,TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs,2023, [Paper]

  • Shen, Yongliang and Song, Kaitao and Tan, Xu and Li, Dongsheng and Lu, Weiming and Zhuang, Yueting,HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace,2023, [Paper]

  • Kim, Geunwoo and Baldi, Pierre and McAleer, Stephen,Language Models can Solve Computer Tasks,2023, [Paper]

  • Madaan, Aman and Tandon, Niket and Gupta, Prakhar and Hallinan, Skyler and Gao, Luyu et al.,Self-Refine: Iterative Refinement with Self-Feedback,2023, [Paper]

  • Shinn, Noah and Labash, Beck and Gopinath, Ashwin, Reflexion: an autonomous agent with dynamic memory and self-reflection, 2023 [Paper]

  • Yang, Zhengyuan and Li, Linjie and Wang, Jianfeng and Lin, Kevin and Azarnasab, Ehsan and Ahmed, Faisal and Liu, Zicheng and Liu, Ce and Zeng, Michael and Wang, Lijuan, MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action, 2023 [Paper]

  • Paranjape, Bhargavi and Lundberg, Scott and Singh, Sameer and Hajishirzi, Hannaneh and Zettlemoyer, Luke and Ribeiro, Marco Tulio, ART: Automatic multi-step reasoning and tool-use for large language models, 2023 [Paper]

  • Jiang, Xue and Dong, Yihong and Wang, Lecheng and Shang, Qiwei and Li, Ge, Self-planning Code Generation with Large Language Model, 2023 [Paper]

  • Luo, Zhezheng and Mao, Jiayuan and Wu, Jiajun and Lozano-Pérez, Tomás and Tenenbaum, Joshua B. and Kaelbling, Leslie Pack, Learning Rational Subgoals from Demonstrations and Instructions, 2023 [Paper]

  • Zhang, Shun and Chen, Zhenfang and Shen, Yikang and Ding, Mingyu and Tenenbaum, Joshua B. and Gan, Chuang, Planning with Large Language Models for Code Generation, 2023 [Paper]

  • Dorbala, Vishnu Sashank and Mullen Jr., James F. and Manocha, Dinesh, Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Based Zero-Shot Object Navigation, 2023 [Paper]

  • Haldar, Siddhant and Pari, Jyothish and Rai, Anant and Pinto, Lerrel, Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations, 2023 [Paper]

  • Huang, Wenlong and Xia, Fei and Shah, Dhruv and Driess, Danny and Zeng, Andy and Lu, Yao and Florence, Pete and Mordatch, Igor and Levine, Sergey and Hausman, Karol and Ichter, Brian, Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control, 2023 [Paper]

  • Capitanelli, Alessio and Mastrogiovanni, Fulvio, A Framework to Generate Neurosymbolic PDDL-compliant Planners, 2023 [Paper]

  • Karamcheti, Siddharth and Nair, Suraj and Chen, Annie S. and Kollar, Thomas and Finn, Chelsea and Sadigh, Dorsa and Liang, Percy, Language-Driven Representation Learning for Robotics, 2023 [Paper]

  • Vemprala, Sai and Bonatti, Rogerio and Bucker, Arthur and Kapoor, Ashish, ChatGPT for Robotics: Design Principles and Model Abilities, 2023 [Paper]

  • Carta, Thomas and Romac, Clément and Wolf, Thomas and Lamprier, Sylvain and Sigaud, Olivier and Oudeyer, Pierre-Yves, Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning, 2023 [Paper]

  • Dasgupta, Ishita and Kaeser-Chen, Christine and Marino, Kenneth and Ahuja, Arun and Babayan, Sheila and Hill, Felix and Fergus, Rob, Collaborating with language models for embodied reasoning, 2023 [Paper]

  • Mialon, Grégoire and Dessì, Roberto and Lomeli, Maria and Nalmpantis, Christoforos and Pasunuru, Ram and Raileanu, Roberta and Rozière, Baptiste and Schick, Timo and Dwivedi-Yu, Jane and Celikyilmaz, Asli and Grave, Edouard and LeCun, Yann and Scialom, Thomas, Augmented Language Models: a Survey, 2023 [Paper]

  • Schick, Timo and Dwivedi-Yu, Jane and Dessì, Roberto and Raileanu, Roberta and Lomeli, Maria and Zettlemoyer, Luke and Cancedda, Nicola and Scialom, Thomas, Toolformer: Language Models Can Teach Themselves to Use Tools, 2023 [Paper]

  • Xie, Yaqi and Yu, Chen and Zhu, Tongyao and Bai, Jinbin and Gong, Ze and Soh, Harold, Translating Natural Language to Planning Goals with Large-Language Models, 2023 [Paper]

  • Du, Yuqing and Watkins, Olivia and Wang, Zihan and Colas, Cédric and Darrell, Trevor and Abbeel, Pieter and Gupta, Abhishek and Andreas, Jacob, Guiding Pretraining in Reinforcement Learning with Large Language Models, 2023 [Paper]

  • Wang, Zihao and Cai, Shaofei and Liu, Anji and Ma, Xiaojian and Liang, Yitao, Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents, 2023 [Paper]

  • Li, Belinda Z. and Chen, William and Sharma, Pratyusha and Andreas, Jacob, LaMPP: Language Models as Probabilistic Priors for Perception and Action, 2023 [Paper]

  • Yao, Shunyu and Zhao, Jeffrey and Yu, Dian and Du, Nan and Shafran, Izhak and Narasimhan, Karthik and Cao, Yuan, ReAct: Synergizing Reasoning and Acting in Language Models, 2022 [Paper]

  • Li, Shuang and Puig, Xavier and Paxton, Chris and Du, Yilun and Wang, Clinton and Fan, Linxi and Chen, Tao and Huang, De-An and Akyürek, Ekin and Anandkumar, Anima and Andreas, Jacob and Mordatch, Igor and Torralba, Antonio and Zhu, Yuke, Pre-Trained Language Models for Interactive Decision-Making, 2022 [Paper]

  • Valmeekam, Karthik and Sreedharan, Sarath and Marquez, Matthew and Olmo, Alberto and Kambhampati, Subbarao, On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark), 2023 [Paper]

  • Sharma, Pratyusha and Torralba, Antonio and Andreas, Jacob, Skill Induction and Planning with Latent Language, 2022 [Paper]

  • Xu, Jing and Ung, Megan and Komeili, Mojtaba and Arora, Kushal and Boureau, Y.-Lan and Weston, Jason, Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback, 2022 [Paper]

  • Press, Ofir and Zhang, Muru and Min, Sewon and Schmidt, Ludwig and Smith, Noah A. and Lewis, Mike, Measuring and Narrowing the Compositionality Gap in Language Models, 2022 [Paper]

  • Liu, Ruibo and Wei, Jason and Gu, Shixiang Shane and Wu, Te-Yen and Vosoughi, Soroush and Cui, Claire and Zhou, Denny and Dai, Andrew M., Mind's Eye: Grounded Language Model Reasoning through Simulation, 2022 [Paper]

  • Brohan, Anthony and Brown, Noah and Carbajal, et al., RT-1: Robotics Transformer for Real-World Control at Scale, 2022 [Paper]

  • Huang, Wenlong and Xia, Fei and Xiao, Ted and Chan, Harris and Liang, Jacky and Florence, Pete and Zeng, Andy and Tompson, Jonathan and Mordatch, Igor and Chebotar, Yevgen and Sermanet, Pierre and Brown, Noah and Jackson, Tomas and Luu, Linda and Levine, Sergey and Hausman, Karol and Ichter, Brian, Inner Monologue: Embodied Reasoning through Planning with Language Models, 2022 [Paper]

  • Ahn, Michael and Brohan, Anthony and Brown, et al., Do As I Can, Not As I Say: Grounding Language in Robotic Affordances, 2022 [Paper]

  • Shashank Kalakonda, Sai and Maheshwari, Shubh and Kiran Sarvadevabhatla, Ravi, Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Zero Shot Action Generation, 2022 [Paper]

  • Liang, Jacky and Huang, Wenlong and Xia, Fei and Xu, Peng and Hausman, Karol and Ichter, Brian and Florence, Pete and Zeng, Andy, Code as Policies: Language Model Programs for Embodied Control, 2022 [Paper]

  • Zelikman, Eric and Wu, Yuhuai and Mu, Jesse and Goodman, Noah D., STaR: Bootstrapping Reasoning With Reasoning, 2022 [Paper]

  • Nakamura, Taiki and Kawano, Seiya and Yuguchi, Akishige and Kawanishi, Yasutomo and Yoshino, Koichiro, What Should the System Do Next?: Operative Action Captioning for Estimating System Actions, 2022 [Paper]

  • Colas, Cédric and Karch, Tristan and Moulin-Frier, Clément and Oudeyer, Pierre-Yves, Language and Culture Internalisation for Human-Like Autotelic AI, 2022 [Paper]

  • Valmeekam, Karthik and Olmo, Alberto and Sreedharan, Sarath and Kambhampati, Subbarao, Large Language Models Still Can't Plan (A Benchmark for LLMs on Planning and Reasoning about Change), 2022 [Paper]

  • Song, Chan Hee and Wu, Jiaman and Washington, Clayton and Sadler, Brian M. and Chao, Wei-Lun and Su, Yu, LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models, 2022 [Paper]

  • Singh, Ishika and Blukis, Valts and Mousavian, Arsalan and Goyal, Ankit and Xu, Danfei and Tremblay, Jonathan and Fox, Dieter and Thomason, Jesse and Garg, Animesh, ProgPrompt: Generating Situated Robot Task Plans using Large Language Models, 2022 [Paper]

  • Huang, Wenlong and Abbeel, Pieter and Pathak, Deepak and Mordatch, Igor, Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents, 2022 [Paper]

Reasoning

  • Zheng, Chuanyang and Liu, Zhengying and Xie, Enze and Li, Zhenguo and Li, Yu,Progressive-Hint Prompting Improves Reasoning in Large Language Models,2023, [Paper]

  • Prystawski, Ben and Goodman, Noah D.,Why think step-by-step? Reasoning emerges from the locality of experience,2023, [Paper]

  • Yu, Fei and Zhang, Hongbo and Wang, Benyou,Nature Language Reasoning, A Survey,2023, [Paper]

  • Fu, Yao and Peng, Hao and Sabharwal, Ashish and Clark, Peter and Khot, Tushar, Complexity-Based Prompting for Multi-Step Reasoning, 2023 [Paper]

  • Imani, Shima and Du, Liang and Shrivastava, Harsh, MathPrompter: Mathematical Reasoning using Large Language Models, 2023 [Paper]

  • Surís, Dídac and Menon, Sachit and Vondrick, Carl, ViperGPT: Visual Inference via Python Execution for Reasoning, 2023 [Paper]

  • Yang, Sherry and Nachum, Ofir and Du, Yilun and Wei, Jason and Abbeel, Pieter and Schuurmans, Dale, Foundation Models for Decision Making: Problems, Methods, and Opportunities, 2023 [Paper]

  • Yu, Hangyeol and Jeong, Myeongho and Shin, Jamin and Moon, Hyeongdon and Park, Juneyoung and Choi, Seungtaek, Towards Zero-Shot Functional Compositionality of Language Models, 2023 [Paper]

  • Reppert, Justin and Rachbach, Ben and George, Charlie and Stebbing, Luke and Byun, Jungwon and Appleton, Maggie and Stuhlmüller, Andreas, Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes, 2023 [Paper]

  • Blair-Stanek, Andrew and Holzenberger, Nils and Van Durme, Benjamin, Can GPT-3 Perform Statutory Reasoning?, 2023 [Paper]

  • Huang, Jiaxin and Gu, Shixiang Shane and Hou, Le and Wu, Yuexin and Wang, Xuezhi and Yu, Hongkun and Han, Jiawei, Large Language Models Can Self-improve, 2023 [Paper]

  • Gurrapu, Sai and Kulkarni, Ajay and Huang, Lifu and Lourentzou, Ismini and Freeman, Laura and Batarseh, Feras A., Rationalization for Explainable NLP: A Survey, 2023 [Paper]

  • Gao, Luyu and Madaan, Aman and Zhou, Shuyan and Alon, Uri and Liu, Pengfei and Yang, Yiming and Callan, Jamie and Neubig, Graham, PAL: Program-aided Language Models, 2023 [Paper]

  • Li, Yifei and Lin, Zeqi and Zhang, Shizhuo and Fu, Qiang and Chen, Bei and Lou, Jian-Guang and Chen, Weizhu, On the Advance of Making Language Models Better Reasoners, 2022 [Paper]

  • Kazemi, Seyed Mehran and Kim, Najoung and Bhatia, Deepti and Xu, Xin and Ramachandran, Deepak, LAMBADA: Backward Chaining for Automated Reasoning in Natural Language, 2022 [Paper]

  • Marasović, Ana and Beltagy, Iz and Downey, Doug and Peters, Matthew E., Few-Shot Self-Rationalization with Natural Language Prompts, 2022 [Paper]

  • Chen, Yanda and Zhong, Ruiqi and Zha, Sheng and Karypis, George and He, He, Meta-learning via Language Model In-context Tuning, 2022 [Paper]

  • Lampinen, Andrew K. and Dasgupta, Ishita and Chan, Stephanie C. Y. and Matthewson, Kory and Tessler, Michael Henry and Creswell, Antonia and McClelland, James L. and Wang, Jane X. and Hill, Felix, Can language models learn from explanations in context?, 2022 [Paper]

  • Wei, Jason and Wang, Xuezhi and Schuurmans, Dale and Bosma, Maarten and Ichter, Brian and Xia, Fei and Chi, Ed and Le, Quoc and Zhou, Denny, Chain of Thought Prompting Elicits Reasoning in Large Language Models, 2022 [Paper]

  • Ye, Xi and Durrett, Greg, The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning, 2022 [Paper]

  • Jackson, Ilya and Saenz, Maria Jesus, From Natural Language to Simulations: Applying GPT-3 Codex to Automate Simulation Modeling of Logistics Systems, 2022 [Paper]

  • Lu, Pan and Qiu, Liang and Chang, Kai-Wei and Wu, Ying Nian and Zhu, Song-Chun and Rajpurohit, Tanmay and Clark, Peter and Kalyan, Ashwin, Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning, 2022 [Paper]

  • Wu, Tongshuang and Jiang, Ellen and Donsbach, Aaron and Gray, Jeff and Molina, Alejandra and Terry, Michael and Cai, Carrie J., PromptChainer: Chaining Large Language Model Prompts through Visual Programming, 2022 [Paper]

  • Creswell, Antonia and Shanahan, Murray, Faithful Reasoning Using Large Language Models, 2022 [Paper]

  • Taylor, Ross and Kardas, Marcin and Cucurull, Guillem and Scialom, Thomas and Hartshorn, Anthony and Saravia, Elvis and Poulton, Andrew and Kerkez, Viktor and Stojnic, Robert, Galactica: A Large Language Model for Science, 2022 [Paper]

  • Wu, Tongshuang and Terry, Michael and Cai, Carrie J., AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts, 2022 [Paper]

  • Wang, Boshi and Deng, Xiang and Sun, Huan, Iteratively Prompt Pre-trained Language Models for Chain of Thought, 2022 [Paper]

  • Khot, Tushar and Trivedi, Harsh and Finlayson, Matthew and Fu, Yao and Richardson, Kyle and Clark, Peter and Sabharwal, Ashish, Decomposed Prompting: A Modular Approach for Solving Complex Tasks, 2022 [Paper]

  • Dua, Dheeru and Gupta, Shivanshu and Singh, Sameer and Gardner, Matt, Successive Prompting for Decomposing Complex Questions, 2022 [Paper]

  • Zelikman, Eric and Wu, Yuhuai and Mu, Jesse and Goodman, Noah D., STaR: Bootstrapping Reasoning With Reasoning, 2022 [Paper]

  • Yang, Jingfeng and Jiang, Haoming and Yin, Qingyu and Zhang, Danqing and Yin, Bing and Yang, Diyi, SeqZero: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models, 2022 [Paper]

  • Qiao, Shuofei and Ou, Yixin and Zhang, Ningyu and Chen, Xiang and Yao, Yunzhi and Deng, Shumin and Tan, Chuanqi and Huang, Fei and Chen, Huajun, Reasoning with Language Model Prompting: A Survey, 2022 [Paper]

  • Huang, Jie and Chang, Kevin Chen-Chuan, Towards Reasoning in Large Language Models: A Survey, 2022 [Paper]

  • Press, Ofir and Zhang, Muru and Min, Sewon and Schmidt, Ludwig and Smith, Noah A. and Lewis, Mike, Measuring and Narrowing the Compositionality Gap in Language Models, 2022 [Paper]

  • Creswell, Antonia and Shanahan, Murray and Higgins, Irina, Selection-Inference: Exploiting Large Language Models for Interpretable Logical Reasoning, 2022 [Paper]

  • Karpas, Ehud and Abend, Omri and Belinkov, Yonatan and Lenz, Barak et al., MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning, 2022 [Paper]

  • Nye, Maxwell and Andreassen, Anders Johan and Gur-Ari, Guy and Michalewski, Henryk et al., Show Your Work: Scratchpads for Intermediate Computation with Language Models, 2021 [Paper]

Long-text Generation

  • Faltings, Felix and Galley, Michel and Peng, Baolin and Brantley, Kianté and Cai, Weixin and Zhang, Yizhe and Gao, Jianfeng and Dolan, Bill, Interactive Text Generation, 2023 [Paper]

  • Xie, Zhuohan and Cohn, Trevor and Lau, Jey Han, Can Very Large Pretrained Language Models Learn Storytelling With A Few Examples?, 2023 [Paper]

  • Peng, Xiangyu and Cui, Christopher and Zhou, Wei and Jia, Renee and Riedl, Mark, Story Shaping: Teaching Agents Human-like Behavior with Stories, 2023 [Paper]

  • Toplyn, Joe, Witscript: A System for Generating Improvised Jokes in a Conversation, 2023 [Paper]

  • Dan Klein, Nanyun Peng, Yuandong Tian Kevin Yang, Re3: Generating Longer Stories With Recursive Reprompting and Revision, 2022 [Paper]

  • Li, Junyi and Tang, Tianyi and Zhao, Wayne Xin and Nie, Jian-Yun and Wen, Ji-Rong, Pretrained Language Models for Text Generation: A Survey, 2022 [Paper]

  • Yang, Kevin and Klein, Dan and Peng, Nanyun and Tian, Yuandong, DOC: Improving Long Story Coherence With Detailed Outline Control, 2022 [Paper]

  • Taylor, Ross and Kardas, Marcin and Cucurull, Guillem and Scialom, Thomas and Hartshorn, Anthony and Saravia, Elvis and Poulton, Andrew and Kerkez, Viktor and Stojnic, Robert, Galactica: A Large Language Model for Science, 2022 [Paper]

  • Chang, Haw-Shiuan and Yuan, Jiaming and Iyyer, Mohit and McCallum, Andrew, Changing the Mind of Transformers for Topically-Controllable Language Generation, 2021 [Paper]

  • Li, Junyi and Tang, Tianyi and Zhao, Wayne Xin and Wen, Ji-Rong, Pretrained Language Models for Text Generation: A Survey, 2021 [Paper]

  • Garbacea, Cristina and Mei, Qiaozhu, Neural Language Generation: Formulation, Methods, and Evaluation, 2020 [Paper]

  • Dathathri, Sumanth and Madotto, Andrea and Lan, Janice and Hung, Jane and Frank, Eric and Molino, Piero and Yosinski, Jason and Liu, Rosanne, Plug and Play Language Models: A Simple Approach to Controlled Text Generation, 2020 [Paper]

Analysis and Boundary

  • Yang, Kailai and Ji, Shaoxiong and Zhang, Tianlin and Xie, Qianqian and Ananiadou, Sophia,On the Evaluations of ChatGPT and Emotion-enhanced Prompting for Mental Health Analysis,2023, [Paper]

  • Liu, Hanmeng and Ning, Ruoxi and Teng, Zhiyang and Liu, Jian and Zhou, Qiji and Zhang, Yue,Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4,2023, [Paper]

  • Wang, Zengzhi and Xie, Qiming and Ding, Zixiang and Feng, Yi and Xia, Rui,Is ChatGPT a Good Sentiment Analyzer? A Preliminary Study,2023, [Paper]

  • Zhang, Chaoning and Zhang, Chenshuang and Li, Chenghao et al.,One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era,2023, [Paper]

  • Zhong, Wanjun and Cui, Ruixiang and Guo, Yiduo and Liang, Yaobo and Lu, Shuai and Wang, Yanlin and Saied, Amin and Chen, Weizhu and Duan, Nan,AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models,2023, [Paper]

  • Puchert, Patrik and Poonam, Poonam and van Onzenoodt, Christian and Ropinski, Timo,LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models,2023, [Paper]

  • Bian, Ning and Han, Xianpei and Sun, Le and Lin, Hongyu and Lu, Yaojie and He, Ben,ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models,2023, [Paper]

  • Liu, Yang and Iter, Dan and Xu, Yichong and Wang, Shuohang and Xu, Ruochen and Zhu, Chenguang,GPTEval: NLG Evaluation using GPT-4 with Better Human Alignment,2023, [Paper]

  • Jojic, Ana and Wang, Zhen and Jojic, Nebojsa,GPT is becoming a Turing machine: Here are some ways to program it,2023, [Paper]

  • Jesse, Kevin and Ahmed, Toufique and Devanbu, Premkumar T. and Morgan, Emily, Large Language Models and Simple, Stupid Bugs, 2023 [Paper]

  • Chang, Tyler A. and Bergen, Benjamin K., Language Model Behavior: A Comprehensive Survey, 2023 [Paper]

  • Sifatkaur and Singh, Manmeet and SB, Vaisakh and Malviya, Neetiraj, Mind meets machine: Unravelling GPT-4's cognitive psychology, 2023 [Paper]

  • Ye, Junjie and Chen, Xuanting and Xu, Nuo and Zu, Can and Shao, Zekai and Liu, Shichun and Cui, Yuhan and Zhou, Zeyang and Gong, Chao and Shen, Yang and Zhou, Jie and Chen, Siming and Gui, Tao and Zhang, Qi and Huang, Xuanjing, A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models, 2023 [Paper]

  • Veseli, Blerta and Singhania, Sneha and Razniewski, Simon and Weikum, Gerhard, Evaluating Language Models for Knowledge Base Completion, 2023 [Paper]

  • Lyu, Qing and Tan, Josh and Zapadka, Mike E. and Ponnatapuram, Janardhana and Niu, Chuang and Wang, Ge and Whitlow, Christopher T., Translating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potential, 2023 [Paper]

  • White, Jules and Hays, Sam and Fu, Quchen and Spencer-Smith, Jesse and Schmidt, Douglas C., ChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Design, 2023 [Paper]

  • Hahn, Michael and Goyal, Navin, A Theory of Emergent In-Context Learning as Implicit Structure Induction, 2023 [Paper]

  • Cao, Boxi and Lin, Hongyu and Han, Xianpei and Sun, Le, The Life Cycle of Knowledge in Big Language Models: A Survey, 2023 [Paper]

  • Tan, Yiming and Min, Dehai and Li, Yu and Li, Wenbo and Hu, Nan and Chen, Yongrui and Qi, Guilin, Evaluation of ChatGPT as a Question Answering System for Answering Complex Questions, 2023 [Paper]

  • Wang, Jiaan and Liang, Yunlong and Meng, Fandong and Shi, Haoxiang and Li, Zhixu and Xu, Jinan and Qu, Jianfeng and Zhou, Jie, Is ChatGPT a Good NLG Evaluator? A Preliminary Study, 2023 [Paper]

  • Amin, Mostafa M. and Cambria, Erik and Schuller, Björn W., Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT, 2023 [Paper]

  • Zhong, Ruiqi and Zhang, Peter and Li, Steve and Ahn, Jinwoo and Klein, Dan and Steinhardt, Jacob, Goal Driven Discovery of Distributional Differences via Language Descriptions, 2023 [Paper]

  • Wang, Yizhong and Mishra, Swaroop and Alipoormolabashi, Pegah and Kordi, Yeganeh et al., Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks, 2022 [Paper]

  • Chen, Xuanting and Ye, Junjie and Peng, Minlong and Zhou, Jie and Gui, Tao and Zhang, Qi and Huang, Xuanjing, How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks, 2023 [Paper]

  • Leiter, Christoph and Zhang, Ran and Chen, Yanran and Belouadi, Jonas and Larionov, Daniil and Fresen, Vivian and Eger, Steffen, ChatGPT: A Meta-Analysis after 2.5 Months, 2023 [Paper]

  • Bommarito, Jillian and Bommarito, Michael and Katz, Daniel Martin and Katz, Jessica, GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities, 2023 [Paper]

  • Shahriar, Sakib and Hayawi, Kadhim, Let's have a chat! A Conversation with ChatGPT: Technology, Applications, and Limitations, 2023 [Paper]

  • Bommarito, Jillian and Bommarito, Michael and Katz, Daniel Martin and Katz, Jessica, GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities, 2023 [Paper]

  • Michelmann, Sebastian and Kumar, Manoj and Norman, Kenneth A. and Toneva, Mariya, Large language models can segment narrative events similarly to humans, 2023 [Paper]

  • Shi, Freda and Chen, Xinyun and Misra, Kanishka and Scales, Nathan and Dohan, David and Chi, Ed and Schärli, Nathanael and Zhou, Denny, Large Language Models Can Be Easily Distracted by Irrelevant Context, 2023 [Paper]

  • Carta, Thomas and Romac, Clément and Wolf, Thomas and Lamprier, Sylvain and Sigaud, Olivier and Oudeyer, Pierre-Yves, Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning, 2023 [Paper]

  • Fu, Jinlan and Ng, See-Kiong and Jiang, Zhengbao and Liu, Pengfei, GPTScore: Evaluate as You Desire, 2023 [Paper]

  • Xie, Sang Michael and Santurkar, Shibani and Ma, Tengyu and Liang, Percy, Data Selection for Language Models via Importance Resampling, 2023 [Paper]

  • Jang, Joel and Kim, Seungone and Ye, Seonghyeon and Kim, Doyoung and Logeswaran, Lajanugen and Lee, Moontae and Lee, Kyungjae and Seo, Minjoon, Exploring the Benefits of Training Expert Language Models over Instruction Tuning, 2023 [Paper]

  • Borji, Ali, A Categorical Archive of ChatGPT Failures, 2023 [Paper]

  • Zhang, Tianjun and Liu, Fangchen and Wong, Justin and Abbeel, Pieter and Gonzalez, Joseph E., The Wisdom of Hindsight Makes Language Models Better Instruction Followers, 2023 [Paper]

  • Ullman, Tomer, Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks, 2023 [Paper]

  • Mökander, Jakob and Schuett, Jonas and Kirk, Hannah Rose and Floridi, Luciano, Auditing large language models: a three-layered approach, 2023 [Paper]

  • Huang, Fan and Kwak, Haewoon and An, Jisun, Is ChatGPT better than Human Annotators? Potential and Limitations of ChatGPT in Explaining Implicit Hate Speech, 2023 [Paper]

  • Kocoń, Jan and Cichecki, Igor and Kaszyca, et al., ChatGPT: Jack of all trades, master of none, 2023 [Paper]

  • Wang, Jindong and Hu, Xixu and Hou, Wenxin and Chen, Hao and Zheng, Runkai and Wang, Yidong and Yang, Linyi and Huang, Haojun and Ye, Wei and Geng, Xiubo and Jiao, Binxin and Zhang, Yue and Xie, Xing, On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective, 2023 [Paper]

  • Akyürek, Ekin and Schuurmans, Dale and Andreas, Jacob and Ma, Tengyu and Zhou, Denny, What learning algorithm is in-context learning? Investigations with linear models, 2022 [Paper]

  • Dai, Damai and Sun, Yutao and Dong, Li and Hao, Yaru and Sui, Zhifang and Wei, Furu, Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers, 2022 [Paper]

  • Yang, Sen and Zhang, Yunchen and Cui, Leyang and Zhang, Yue, Do Prompts Solve NLP Tasks Using Natural Language?, 2022 [Paper]

  • Ross, Alexis and Peters, Matthew E. and Marasović, Ana, Does Self-Rationalization Improve Robustness to Spurious Correlations?, 2022 [Paper]

  • Wei, Jason and Tay, Yi and Bommasani, et al., Emergent Abilities of Large Language Models, 2022 [Paper]

  • Li, Jiaoda and Cotterell, Ryan and Sachan, Mrinmaya, Probing via Prompting, 2022 [Paper]

  • Zhang, Renrui and Zeng, Ziyao and Guo, Ziyu, Can Language Understand Depth?, 2022 [Paper]

  • Kann, Katharina and Ebrahimi, Abteen and Koh, Joewie and Dudy, Shiran and Roncone, Alessandro, Open-domain Dialogue Generation: What We Can Do, Cannot Do, And Should Do Next, 2022 [Paper]

  • Kadavath, Saurav and Conerly, Tom and Askell, et al., Language Models (Mostly) Know What They Know, 2022 [Paper]

  • Wei, Xiaokai and Wang, Shen and Zhang, Dejiao and Bhatia, Parminder and Arnold, Andrew, Knowledge Enhanced Pretrained Language Models: A Compreshensive Survey, 2021 [Paper]

  • Liu, Pengfei and Yuan, Weizhe and Fu, Jinlan and Jiang, Zhengbao and Hayashi, Hiroaki and Neubig, Graham, Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing, 2021 [Paper]

  • Webson, Albert and Pavlick, Ellie, Do Prompt-Based Models Really Understand the Meaning of their Prompts?, 2021 [Paper]

  • Elazar, Yanai and Kassner, Nora and Ravfogel, Shauli and Ravichander, Abhilasha and Hovy, Eduard and Schütze, Hinrich and Goldberg, Yoav, Measuring and Improving Consistency in Pretrained Language Models, 2021 [Paper]

  • Hase, Peter and Diab, Mona and Celikyilmaz, Asli and Li, Xian and Kozareva, Zornitsa and Stoyanov, Veselin and Bansal, Mohit and Iyer, Srinivasan, Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs, 2021 [Paper]

  • Da, Jeff and Bras, Ronan Le and Lu, Ximing and Choi, Yejin and Bosselut, Antoine, Analyzing Commonsense Emergence in Few-shot Knowledge Models, 2021 [Paper]

  • Jiang, Zhengbao and Araki, Jun and Ding, Haibo and Neubig, Graham, How Can We Know When Language Models Know? On the Calibration of Language Models for Question Answering, 2021 [Paper]

  • Le Scao, Teven and Rush, Alexander, How many data points is a prompt worth?, 2021 [Paper]

  • Jiang, Zhengbao and Xu, Frank F. and Araki, Jun and Neubig, Graham, How Can We Know What Language Models Know?, 2020 [Paper]

  • Henighan, Tom and Kaplan, Jared and Katz, Mor and Chen, Mark and Hesse, Christopher and Jackson, Jacob and Jun, Heewoo and Brown, Tom B. and Dhariwal, Prafulla and Gray, Scott and Hallacy, Chris and Mann, Benjamin and Radford, Alec and Ramesh, Aditya and Ryder, Nick and Ziegler, Daniel M. and Schulman, John and Amodei, Dario and McCandlish, Sam, Scaling Laws for Autoregressive Generative Modeling, 2020 [Paper]

Motivation

Emotion

  • Liu, Yiren and Kilicoglu, Halil, Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation, 2023 [Paper]

  • Gao, Pan and Han, Donghong and Zhou, Rui and Zhang, Xuejiao and Wang, Zikun, CAB: Empathetic Dialogue Generation with Cognition, Affection and Behavior, 2023 [Paper]

  • Qian, Yushan and Wang, Bo and Ma, Shangzhao and Bin, Wu and Zhang, Shuo and Zhao, Dongming and Huang, Kun and Hou, Yuexian, Think Twice: A Human-like Two-stage Conversational Agent for Emotional Response Generation, 2023 [Paper]

  • Kasahara, Tomohito and Kawahara, Daisuke and Tung, Nguyen and Li, Shengzhe and Shinzato, Kenta and Sato, Toshinori, Building a Personalized Dialogue System with Prompt-Tuning, 2022 [Paper]

  • Jiang, Chenglin and Zhang, Chunhong and Ji, Yang and Hu, Zheng and Zhan, Zhiqiang and Yang, Guanghua, An affective chatbot with controlled specific emotion expression, 2022 [Paper]

  • Cheng, Jiale and Sabour, Sahand and Sun, Hao and Chen, Zhuang and Huang, Minlie, PAL: Persona-Augmented Emotional Support Conversation Generation, 2022 [Paper]

  • Li, Qintong and Li, Piji and Ren, Zhaochun and Ren, Pengjie and Chen, Zhumin, Knowledge Bridging for Empathetic Dialogue Generation, 2021 [Paper]

Persona

  • Deshpande, Ameet and Murahari, Vishvak and Rajpurohit, Tanmay and Kalyan, Ashwin and Narasimhan, Karthik,Toxicity in ChatGPT: Analyzing Persona-assigned Language Models,2023, [Paper]

  • Rao, Haocong and Leung, Cyril and Miao, Chunyan, Can ChatGPT Assess Human Personalities? A General Evaluation Framework, 2023 [Paper]

  • Ramirez, Angela and Alsalihy, Mamon and Aggarwal, Kartik and Li, Cecilia and Wu, Liren and Walker, Marilyn, Controlling Personality Style in Dialogue with Zero-Shot Prompt-Based Learning, 2023 [Paper]

  • Lim, Jungwoo and Kang, Myunghoon and Hur, Yuna and Jung, Seungwon and Kim, Jinsung and Jang, Yoonna and Lee, Dongyub and Ji, Hyesung and Shin, Donghoon and Kim, Seungryong and Lim, Heuiseok, You Truly Understand What I Need: Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona, 2023 [Paper]

  • Chen, Ruijun and Wang, Jin and Yu, Liang-Chih and Zhang, Xuejie, Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues, 2023 [Paper]

  • Ross, Steven I. and Muller, Michael and Martinez, Fernando and Houde, Stephanie and Weisz, Justin D., A Case Study in Engineering a Conversational Programming Assistant's Persona, 2023 [Paper]

  • Kasahara, Tomohito and Kawahara, Daisuke and Tung, Nguyen and Li, Shengzhe and Shinzato, Kenta and Sato, Toshinori, Building a Personalized Dialogue System with Prompt-Tuning, 2022 [Paper]

  • Matveev, Yuri and Makhnytkina, Olesia and Posokhov, Pavel and Matveev, Anton and Skrylnikov, Stepan, Personalizing Hybrid-Based Dialogue Agents, 2022 [Paper]

  • Xu, Xinchao and Gou, Zhibin and Wu, Wenquan and Niu, Zheng-Yu and Wu, Hua and Wang, Haifeng and Wang, Shihang, Long Time No See! Open-Domain Conversation with Long-Term Persona Memory, 2022 [Paper]

  • Liu, Yifan and Wei, Wei and Liu, Jiayi and Mao, Xianling and Fang, Rui and Chen, Dangyang, Improving Personality Consistency in Conversation by Persona Extending, 2022 [Paper]

  • Natarajan, Bharatram and Nargund, Abhijit, MRE : Multi Relationship Extractor for Persona based Empathetic Conversational Model, 2021 [Paper]

  • Peixiang Zhong, Chen Zhang, Hao Wang, Yong Liu, Chunyan Miao, Towards Persona-Based Empathetic Conversational Models, 2020 [Paper]

  • Qian Liu, Yihong Chen, Bei Chen, Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang, You Impress Me: Dialogue Generation via Mutual Persona Perception, 2020 [Paper]

  • Haoyu Song, Yan Wang, Wei-Nan Zhang, Xiaojiang Liu, Ting Liu, Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation, 2020 [Paper]

Application

  • Zhang, Haopeng and Liu, Xiao and Zhang, Jiawei,Extractive Summarization via ChatGPT for Faithful Summary Generation,2023, [Paper]

  • Zhang, Cheng and Bauer, Stefan and Bennett, Paul and Gao, Jiangfeng and Gong, Wenbo and Hilmkil, Agrin and Jennings, Joel and Ma, Chao and Minka, Tom and Pawlowski, Nick and Vaughan, James,Understanding Causality with Large Language Models: Feasibility and Opportunities,2023, [Paper]

  • Boiko, Daniil A. and MacKnight, Robert and Gomes, Gabe,Emergent autonomous scientific research capabilities of large language models,2023, [Paper]

  • Nair, Varun and Schumacher, Elliot and Tso, Geoffrey and Kannan, Anitha,DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents,2023, [Paper]

  • Liu, Yiheng and Han, Tianle and Ma, Siyuan et al.,Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models,2023, [Paper]

  • Gao, Mingqi and Ruan, Jie and Sun, Renliang and Yin, Xunjian and Yang, Shiping and Wan, Xiaojun,Human-like Summarization Evaluation with ChatGPT,2023, [Paper]

  • Jourdan, Léane and Boudin, Florian and Dufour, Richard and Hernandez, Nicolas,Text revision in Scientific Writing Assistance: An Overview,2023, [Paper]

  • He, Xingwei and Lin, Zhenghao and Gong, Yeyun and Jin, A.-Long and Zhang, Hang and Lin, Chen and Jiao, Jian and Yiu, Siu Ming and Duan, Nan and Chen, Weizhu,AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators,2023, [Paper]

  • Yunxiang, Li and Zihan, Li and Kai, Zhang and Ruilong, Dan and You, Zhang, ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge, 2023 [Paper]

  • Peng, Keqin and Ding, Liang and Zhong, Qihuang and Shen, Li and Liu, Xuebo and Zhang, Min and Ouyang, Yuanxin and Tao, Dacheng, Towards Making the Most of ChatGPT for Machine Translation, 2023 [Paper]

  • Zhou, Pengyuan, Unleasing ChatGPT on the Metaverse: Savior or Destroyer?, 2023 [Paper]

  • Wu, Patrick Y. and Tucker, Joshua A. and Nagler, Jonathan and Messing, Solomon, Large Language Models Can Be Used to Estimate the Ideologies of Politicians in a Zero-Shot Learning Setting, 2023 [Paper]

  • Kutela, Boniphace and Li, Shoujia and Das, Subasish and Liu, Jinli, ChatGPT as the Transportation Equity Information Source for Scientific Writing, 2023 [Paper]

  • Liu, Zhengliang and Yu, Xiaowei and Zhang, Lu and Wu, Zihao and Cao, Chao and Dai, Haixing and Zhao, Lin and Liu, Wei and Shen, Dinggang and Li, Quanzheng and Liu, Tianming and Zhu, Dajiang and Li, Xiang, DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4, 2023 [Paper]

  • Wang, Jiaan and Liang, Yunlong and Meng, Fandong and Li, Zhixu and Qu, Jianfeng and Zhou, Jie, Cross-Lingual Summarization via ChatGPT, 2023 [Paper]

  • Imani, Shima and Du, Liang and Shrivastava, Harsh, Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification, 2023 [Paper]

  • Di Bartolomeo, Sara and Severi, Giorgio and Schetinger, Victor and Dunne, Cody, Ask and You Shall Receive (a Graph Drawing): Testing ChatGPT's Potential to Apply Graph Layout Algorithms, 2023 [Paper]

  • Burtell, Matthew and Woodside, Thomas, Artificial Influence: An Analysis Of AI-Driven Persuasion, 2023 [Paper]

  • Prodan, George and Pelican, Elena, Prompt scoring system for dialogue summarization using GPT-3, 2022 [Paper]

  • Kargaran, Amir Hossein and Nikeghbal, Nafiseh and Heydarnoori, Abbas and Schütze, Hinrich, MenuCraft: Interactive Menu System Design with Large Language Models, 2023 [Paper]

  • Polak, Maciej P. and Morgan, Dane, Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering -- Example of ChatGPT, 2023 [Paper]

  • Zhang, Dell and Schilder, Frank and Conrad, Jack G. and Makrehchi, Masoud and von Rickenbach, David and Moulinier, Isabelle, Making a Computational Attorney, 2023 [Paper]

  • Imani, Shima and Du, Liang and Shrivastava, Harsh, MathPrompter: Mathematical Reasoning using Large Language Models, 2023 [Paper]

  • Dai, Haixing and Liu, Zhengliang and Liao, Wenxiong and Huang, Xiaoke and Wu, Zihao and Zhao, Lin and Liu, Wei and Liu, Ninghao and Li, Sheng and Zhu, Dajiang and Cai, Hongmin and Li, Quanzheng and Shen, Dinggang and Liu, Tianming and Li, Xiang, ChatAug: Leveraging ChatGPT for Text Data Augmentation, 2023 [Paper]

  • Qin, Chengwei and Zhang, Aston and Zhang, Zhuosheng and Chen, Jiaao and Yasunaga, Michihiro and Yang, Diyi, Is ChatGPT a General-Purpose Natural Language Processing Task Solver?, 2023 [Paper]

  • Dai, Haixing and Liu, Zhengliang and Liao, Wenxiong and Huang, Xiaoke and Wu, Zihao and Zhao, Lin and Liu, Wei and Liu, Ninghao and Li, Sheng and Zhu, Dajiang and Cai, Hongmin and Li, Quanzheng and Shen, Dinggang and Liu, Tianming and Li, Xiang, ChatAug: Leveraging ChatGPT for Text Data Augmentation, 2023 [Paper]

  • Brocki, Lennart and Dyer, George C. and Gładka, Anna and Chung, Neo Christopher, Deep Learning Mental Health Dialogue System, 2023 [Paper]

  • Ott, Simon and Hebenstreit, Konstantin and Liévin, Valentin and Hother, Christoffer Egeberg and Moradi, Milad and Mayrhauser, Maximilian and Praas, Robert and Winther, Ole and Samwald, Matthias, ThoughtSource: A central hub for large language model reasoning data, 2023 [Paper]

  • Nguyen, Ha-Thanh, A Brief Report on LawGPT 1.0: A Virtual Legal Assistant Based on GPT-3, 2023 [Paper]

  • Argyle, Lisa P. and Busby, Ethan and Gubler, Joshua and Bail, Chris and Howe, Thomas and Rytting, Christopher and Wingate, David, AI Chat Assistants can Improve Conversations about Divisive Topics, 2023 [Paper]

  • Kocaballi, A. Baki, Conversational AI-Powered Design: ChatGPT as Designer, User, and Product, 2023 [Paper]

  • Shanahan, Murray, Talking About Large Language Models, 2023 [Paper]

  • Shibata, Hisaichi and Miki, Soichiro and Nakamura, Yuta, Playing the Werewolf game with artificial intelligence for language understanding, 2023 [Paper]

  • Megahed, Fadel M. and Chen, Ying-Ju and Ferris, Joshua A. and Knoth, Sven and Jones-Farmer, L. Allison, How Generative AI models such as ChatGPT can be (Mis)Used in SPC Practice, Education, and Research? An Exploratory Study, 2023 [Paper]

  • Lehman, Eric and Hernandez, Evan and Mahajan, Diwakar and Wulff, Jonas and Smith, Micah J. and Ziegler, Zachary and Nadler, Daniel and Szolovits, Peter and Johnson, Alistair and Alsentzer, Emily, Do We Still Need Clinical Language Models?, 2023 [Paper]

  • Zeng, Donghuo and Wu, Jianming and Wang, Yanan and Matsumoto, Kazunori and Hattori, Gen and Ikeda, Kazushi, Topic-switch adapted Japanese Dialogue System based on PLATO-2, 2023 [Paper]

  • Feng, Yutao and Qiang, Jipeng and Li, Yun and Yuan, Yunhao and Zhu, Yi, Sentence Simplification via Large Language Models, 2023 [Paper]

  • Noever, David and McKee, Forrest, Chatbots as Problem Solvers: Playing Twenty Questions with Role Reversals, 2022 [Paper]

  • Haolan Zhan, Yufei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Gholamreza Haffari, Let's Negotiate! A Survey of Negotiation Dialogue Systems, 2022 [Paper]

Safety

  • Li, Haoran and Guo, Dadi and Fan, Wei and Xu, Mingshi and Song, Yangqiu,Multi-step Jailbreaking Privacy Attacks on ChatGPT,2023, [Paper]

  • Raman, Mrigank and Maini, Pratyush and Kolter, J. Zico and Lipton, Zachary C. and Pruthi, Danish, Model-tuning Via Prompts Makes NLP Models Adversarially Robust, 2023 [Paper]

  • Greshake, Kai and Abdelnabi, Sahar and Mishra, Shailesh and Endres, Christoph and Holz, Thorsten and Fritz, Mario, More than you've asked for: A Comprehensive Analysis of Novel Prompt Injection Threats to Application-Integrated Large Language Models, 2023 [Paper]

  • Kang, Daniel and Li, Xuechen and Stoica, Ion and Guestrin, Carlos and Zaharia, Matei and Hashimoto, Tatsunori, Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks, 2023 [Paper]

Related Project

About

⚡Research papers about leveraging the capabilities of language models⚡

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages