📄 In-context Learning

Xiaochuang Han, Daniel Simig, Todor Mihaylov, Yulia Tsvetkov, Asli Celikyilmaz, etc . - 【Annual Meeting of the Association for Computational Linguistics】

Schema-learning and rebinding as mechanisms of in-context learning and emergence （2023.06.16）

Siva K. Swaminathan, A. Dedieu, Rajkumar Vasudeva Raju, M. Shanahan, M. Lázaro-Gredilla, etc . - 【arXiv.org】

MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models （2023.06.02）

Masoud Monajatipoor, Liunian Harold Li, Mozhdeh Rouhsedaghat, Lin F. Yang, Kai-Wei Chang . - 【arXiv.org】

Measuring and Mitigating Constraint Violations of In-Context Learning for Utterance-to-API Semantic Parsing （2023.05.24）

Shufan Wang, Sebastien Jean, Sailik Sengupta, James Gung, Nikolaos Pappas, etc

OverPrompt: Enhancing ChatGPT Capabilities through an Efficient In-Context Learning Approach （2023.05.24）

Jiazheng Li, Runcong Zhao, Yulan He, Lin Gui

Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations （2023.05.24）

Wei-Lin Chen, Cheng-Kuang Wu, Hsin-Hsi Chen . - 【Conference on Empirical Methods in Natural Language Processing】

Adversarial Demonstration Attacks on Large Language Models （2023.05.24）

Jiongxiao Wang, Zichen Liu, Keun Hee Park, Muhao Chen, Chaowei Xiao

Frugal Prompting for Dialog Models （2023.05.24）

Bishal Santra, Sakya Basak, Abhinandan De, Manish Gupta, Pawan Goyal

Coverage-based Example Selection for In-Context Learning （2023.05.24）

Shivanshu Gupta, Sameer Singh, Matt Gardner

SummIt: Iterative Text Summarization via ChatGPT （2023.05.24）

Haopeng Zhang, Xiao Liu, Jiawei Zhang

Exploring Diverse In-Context Configurations for Image Captioning （2023.05.24）

Xu Yang, Yongliang Wu, Mingzhuo Yang, Haokun Chen

Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation （2023.05.24）

Nishant Balepur, Jie Huang, Samraj Moorjani, Hari Sundaram, Kevin Chen-Chuan Chang

In-Context Demonstration Selection with Cross Entropy Difference （2023.05.24）

Dan Iter, Reid Pryzant, Ruochen Xu, Shuohang Wang, Yang Liu, etc

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts （2023.05.24）

Benfeng Xu, An Yang, Junyang Lin, Quang Wang, Chang Zhou, etc

Active Learning Principles for In-Context Learning with Large Language Models （2023.05.23）

Katerina Margatina, Timo Schick, Nikolaos Aletras, Jane Dwivedi-Yu

Skill-Based Few-Shot Selection for In-Context Learning （2023.05.23）

Shengnan An, Bo Zhou, Zeqi Lin, Qiang Fu, Bei Chen, etc

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning （2023.05.23）

Lean Wang, Lei Li, Damai Dai, Deli Chen, Hao Zhou, etc

Make a Choice! Knowledge Base Question Answering with In-Context Learning （2023.05.23）

Chuanyuan Tan, Yuehe Chen, Wenbiao Shao, Wenliang Chen

Concept-aware Training Improves In-context Learning Ability of Language Models （2023.05.23）

Michal vStef'anik, Marek Kadlvc'ik

RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning （2023.05.23）

Alexander Scarlatos, Andrew Lan

Can We Edit Factual Knowledge by In-Context Learning? （2023.05.22）

Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, etc

Iterative Forward Tuning Boosts In-context Learning in Language Models （2023.05.22）

Jiaxi Yang, Binyuan Hui, Min Yang, Binhua Li, Fei Huang, etc

Explaining How Transformers Use Context to Build Predictions （2023.05.21）

Javier Ferrando, Gerard I. Gállego, Ioannis Tsiamas, M. Costa-jussà

Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer （2023.05.20）

Kaige Xie, Tong Yu, Haoliang Wang, Junda Wu, Handong Zhao, etc

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning （2023.05.20）

Liangming Pan, Alon Albalak, Xinyi Wang, William Yang Wang

RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought （2023.05.19）

Tianci Xue, Ziqi Wang, Zhenhailong Wang, Chi Han, Pengfei Yu, etc . - 【arXiv.org】

Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning （2023.05.19）

Po-Nien Kung, Nanyun Peng . - 【arXiv.org】

AutoTrial: Prompting Language Models for Clinical Trial Design （2023.05.19）

Zifeng Wang, Cao Xiao, Jimeng Sun . - 【arXiv.org】

Efficient Prompting via Dynamic In-Context Learning （2023.05.18）

Wangchunshu Zhou, Yuchen Jiang, Ryan Cotterell, Mrinmaya Sachan . - 【arXiv.org】

Generalized Planning in PDDL Domains with Pretrained Large Language Models （2023.05.18）

Tom Silver, Soham Dan, Kavitha Srinivas, J. Tenenbaum, L. Kaelbling, etc . - 【arXiv.org】

Discriminative Diffusion Models as Few-shot Vision and Language Learners （2023.05.18）

Xuehai He, Weixi Feng, Tsu-Jui Fu, Varun Jampani, Arjun Reddy Akula, etc . - 【arXiv.org】

Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning （2023.05.17）

Dong-Ho Lee, Kian Ahrabian, Woojeong Jin, Fred Morstatter, J. Pujara . - 【arXiv.org】

What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning （2023.05.16）

Jane Pan, Tianyu Gao, Howard Chen, Danqi Chen . - 【arXiv.org】

Enriching language models with graph-based context information to better understand textual data （2023.05.10）

Albert Roethel, M. Ganzha, Anna Wr'oblewska . - 【arXiv.org】

Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts （2023.04.19）

J. Zamfirescu-Pereira, Richmond Y. Wong, Bjoern Hartmann, Qiang Yang . - 【International Conference on Human Factors in Computing Systems】

SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language Models （2023.03.18）

Vithursan Thangarasa, Abhay Gupta, William Marshall, Tianda Li, Kevin Leong, etc . - 【Conference on Uncertainty in Artificial Intelligence】

Larger language models do in-context learning differently （2023.03.07）

Jerry W. Wei, Jason Wei, Yi Tay, Dustin Tran, Albert Webson, etc . - 【ArXiv】

Language Model Crossover: Variation through Few-Shot Prompting （2023.02.23）

Elliot Meyerson, M. Nelson, Herbie Bradley, Arash Moradi, Amy K. Hoover, etc . - 【ArXiv】

How Does In-Context Learning Help Prompt Tuning? （2023.02.22）

Simeng Sun, Yang Liu, Dan Iter, Chenguang Zhu, Mohit Iyyer . - 【ArXiv】

Compositional Exemplars for In-context Learning （2023.02.11）

Jiacheng Ye, Zhiyong Wu, Jiangtao Feng, Tao Yu, Lingpeng Kong . - 【International Conference on Machine Learning】

PLACES: Prompting Language Models for Social Conversation Synthesis （2023.02.07）

Maximillian Chen, A. Papangelis, Chenyang Tao, Seokhwan Kim, Andrew Rosenbaum, etc . - 【ArXiv】

Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning （2023.01.27）

Xinyi Wang, Wanrong Zhu, William Yang Wang . - 【ArXiv】

Transformers as Algorithms: Generalization and Stability in In-context Learning （2023.01.17）

Yingcong Li, M. E. Ildiz, Dimitris Papailiopoulos, Samet Oymak

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization （2022.12.22）

S. Iyer, Xiaojuan Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, etc . - 【ArXiv】

Prompt-Augmented Linear Probing: Scaling Beyond The Limit of Few-shot In-Context Learners （2022.12.21）

Hyunsoo Cho, Hyuhng Joon Kim, Junyeob Kim, Sang-Woo Lee, Sang-goo Lee, etc . - 【ArXiv】

In-context Learning Distillation: Transferring Few-shot Learning Ability of Pre-trained Language Models （2022.12.20）

Yukun Huang, Yanda Chen, Zhou Yu, K. McKeown . - 【ArXiv】

Self-adaptive In-context Learning （2022.12.20）

Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong . - 【ArXiv】

Is GPT-3 a Good Data Annotator? （2022.12.20）

Bosheng Ding, Chengwei Qin, Linlin Liu, Lidong Bing, Shafiq R. Joty, etc . - 【ArXiv】

Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters （2022.12.20）

Boshi Wang, Sewon Min, Xiang Deng, Jiaming Shen, You Wu, etc . - 【ArXiv】

Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers （2022.12.20）

Damai Dai, Yutao Sun, Li Dong, Y. Hao, Zhifang Sui, etc . - 【ArXiv】

Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations （2022.12.19）

Xinxi Lyu, Sewon Min, Iz Beltagy, Luke Zettlemoyer, Hannaneh Hajishirzi . - 【Annual Meeting of the Association for Computational Linguistics】

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale （2022.12.18）

Hritik Bansal, Karthik Gopalakrishnan, Saket Dingliwal, S. Bodapati, Katrin Kirchhoff, etc . - 【ArXiv】

Transformers learn in-context by gradient descent （2022.12.15）

J. Oswald, Eyvind Niklasson, E. Randazzo, J. Sacramento, A. Mordvintsev, etc . - 【ArXiv】

Structured Prompting: Scaling In-Context Learning to 1, 000 Examples （2022.12.13）

Y. Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, etc . - 【ArXiv】

Diverse Demonstrations Improve In-context Compositional Generalization （2022.12.13）

Itay Levy, Ben Bogin, Jonathan Berant . - 【ArXiv】

What learning algorithm is in-context learning? Investigations with linear models （2022.11.28）

Ekin Akyürek, D. Schuurmans, Jacob Andreas, Tengyu Ma, Denny Zhou . - 【ArXiv】

What learning algorithm is in-context learning? Investigations with linear models （2022.11.28）

Ekin Akyürek, D. Schuurmans, Jacob Andreas, Tengyu Ma, Denny Zhou . - 【International Conference on Learning Representations】

Complementary Explanations for Effective In-Context Learning （2022.11.25）

Xi Ye, Srini Iyer, Asli Celikyilmaz, V. Stoyanov, Greg Durrett, etc . - 【ArXiv】

Complementary Explanations for Effective In-Context Learning （2022.11.25）

Xi Ye, Srini Iyer, Asli Celikyilmaz, Ves Stoyanov, Greg Durrett, etc . - 【Annual Meeting of the Association for Computational Linguistics】

Large Language Models with Controllable Working Memory （2022.11.09）

Daliang Li, A. Rawat, M. Zaheer, Xin Wang, M. Lukasik, etc . - 【ArXiv】

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning （2022.11.06）

Yu Meng, Martin Michalski, Jiaxin Huang, Yu Zhang, T. Abdelzaher, etc . - 【ArXiv】

Large Language Models Are Human-Level Prompt Engineers （2022.11.03）

Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, etc . - 【ArXiv】

ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback （2022.10.22）

Jiacheng Ye, Jiahui Gao, Jiangtao Feng, Zhiyong Wu, Tao Yu, etc . - 【Conference on Empirical Methods in Natural Language Processing】

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them （2022.10.17）

Mirac Suzgun, Nathan Scales, Nathanael Scharli, Sebastian Gehrmann, Yi Tay, etc . - 【ArXiv】

In-context Learning and Induction Heads （2022.09.24）

Catherine Olsson, Nelson Elhage, Neel Nanda, Nicholas Joseph, Nova DasSarma, etc . - 【ArXiv】

Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation （2022.09.22）

Xingdi Yuan, Tong Wang, Yen-Hsiang Wang, Emery Fine, Rania Abdelghani, etc . - 【Annual Meeting of the Association for Computational Linguistics】

On the Relation between Sensitivity and Accuracy in In-context Learning （2022.09.16）

Yanda Chen, Chen Zhao, Zhou Yu, K. McKeown, He He . - 【ArXiv】

What Can Transformers Learn In-Context? A Case Study of Simple Function Classes （2022.08.01）

Shivam Garg, Dimitris Tsipras, Percy Liang, G. Valiant . - 【ArXiv】

Rationale-Augmented Ensembles in Language Models （2022.07.02）

Xuezhi Wang, Jason Wei, D. Schuurmans, Quoc Le, E. Chi, etc . - 【ArXiv】

Self-Generated In-Context Learning: Leveraging Auto-regressive Language Models as a Demonstration Generator （2022.06.16）

Hyuhng Joon Kim, Hyunsoo Cho, Junyeob Kim, Taeuk Kim, Kang Min Yoo, etc . - 【ArXiv】

Instruction Induction: From Few Examples to Natural Language Task Descriptions （2022.05.22）

Or Honovich, Uri Shaham, Samuel R. Bowman, Omer Levy . - 【ArXiv】

Prototypical Calibration for Few-shot Learning of Language Models （2022.05.20）

Zhixiong Han, Y. Hao, Li Dong, Yutao Sun, Furu Wei . - 【ArXiv】

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning （2022.05.11）

Haokun Liu, Derek Tam, Mohammed Muqeeth, Jay Mohta, Tenghao Huang, etc . - 【ArXiv】

Improving In-Context Few-Shot Learning via Self-Supervised Training （2022.05.03）

Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, etc . - 【North American Chapter of the Association for Computational Linguistics】

On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model （2022.04.28）

Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, etc . - 【North American Chapter of the Association for Computational Linguistics】

Data Distributional Properties Drive Emergent In-Context Learning in Transformers （2022.04.22）

Stephanie C. Y. Chan, Adam Santoro, Andrew Kyle Lampinen, Jane X. Wang, Aaditya K Singh, etc . - 【ArXiv】

Can language models learn from explanations in context? （2022.04.05）

Andrew Kyle Lampinen, I. Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, etc . - 【Conference on Empirical Methods in Natural Language Processing】

Self-Consistency Improves Chain of Thought Reasoning in Language Models （2022.03.21）

Xuezhi Wang, Jason Wei, D. Schuurmans, Quoc Le, E. Chi, etc . - 【ArXiv】

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? （2022.02.25）

Sewon Min, Xinxi Lyu, Ari Holtzman, Mikel Artetxe, M. Lewis, etc . - 【Conference on Empirical Methods in Natural Language Processing】

Co-training Improves Prompt-based Learning for Large Language Models （2022.02.02）

Hunter Lang, Monica Agrawal, Yoon Kim, D. Sontag . - 【International Conference on Machine Learning】

Black-Box Tuning for Language-Model-as-a-Service （2022.01.10）

Tianxiang Sun, Yunfan Shao, Hong Qian, Xuanjing Huang, Xipeng Qiu . - 【International Conference on Machine Learning】

Learning To Retrieve Prompts for In-Context Learning （2021.12.16）

Ohad Rubin, Jonathan Herzig, Jonathan Berant . - 【North American Chapter of the Association for Computational Linguistics】

True Few-Shot Learning with Prompts—A Real-World Perspective （2021.11.26）

Timo Schick, Hinrich Schütze . - 【Transactions of the Association for Computational Linguistics】

An Explanation of In-context Learning as Implicit Bayesian Inference （2021.11.03）

Sang Michael Xie, Aditi Raghunathan, Percy Liang, Tengyu Ma . - 【International Conference on Learning Representations】

MetaICL: Learning to Learn In Context （2021.10.29）

Sewon Min, M. Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi . - 【North American Chapter of the Association for Computational Linguistics】

Meta-learning via Language Model In-context Tuning （2021.10.15）

Yanda Chen, Ruiqi Zhong, Sheng Zha, G. Karypis, He He . - 【Annual Meeting of the Association for Computational Linguistics】

Multitask Prompted Training Enables Zero-Shot Task Generalization （2021.10.15）

Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, etc . - 【International Conference on Learning Representations】

The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design （2021.10.09）

Yoav Levine, Noam Wies, Daniel Jannai, D. Navon, Yedid Hoshen, etc . - 【International Conference on Learning Representations】

Reframing Instructional Prompts to GPTk’s Language （2021.09.16）

Swaroop Mishra, Daniel Khashabi, Chitta Baral, Yejin Choi, Hannaneh Hajishirzi . - 【Findings】

Noisy Channel Language Model Prompting for Few-Shot Text Classification （2021.08.09）

Sewon Min, Michael Lewis, Hannaneh Hajishirzi, Luke Zettlemoyer . - 【Annual Meeting of the Association for Computational Linguistics】

Multimodal Few-Shot Learning with Frozen Language Models （2021.06.25）

Maria Tsimpoukelli, Jacob Menick, Serkan Cabi, S. Eslami, Oriol Vinyals, etc . - 【Neural Information Processing Systems】

Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity （2021.04.18）

Yao Lu, Max Bartolo, Alastair Moore, S. Riedel, Pontus Stenetorp . - 【Annual Meeting of the Association for Computational Linguistics】

Calibrate Before Use: Improving Few-Shot Performance of Language Models （2021.02.19）

Tony Zhao, Eric Wallace, Shi Feng, D. Klein, Sameer Singh . - 【International Conference on Machine Learning】

What Makes Good In-Context Examples for GPT-3? （2021.01.17）

Jiachang Liu, Dinghan Shen, Yizhe Zhang, Bill Dolan, L. Carin, etc . - 【Workshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out】