paper list about large language model (LLM)
- "LIMA: Less Is More for Alignment"
Arxiv (2023).
[paper]
Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- "A Survey of Large Language Models"
Arxiv (2023).
[paper]
Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie, Ji-Rong Wen
-
"Is ChatGPT a General-Purpose Natural Language Processing Task Solver?" Arxiv (2023). [paper]
Chengwei Qin, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, Diyi Yang -
"A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity" Arxiv (2023). [paper]
Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung -
"We’re Afraid Language Models Aren’t Modeling Ambiguity" Arxiv (2023). [paper]
Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A. Smith, Yejin Choi
-
"VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" CVPR (2022). [paper]
Yi-Lin Sung, Jaemin Cho, Mohit Bansal -
"Multimodal Few-Shot Learning with Frozen Language Models" NIPS (2021). [paper]
Maria Tsimpoukelli, Jacob L Menick, Serkan Cabi, S. M. Ali Eslami, Oriol Vinyals, Felix Hill -
"Modular and Parameter-Efficient Multimodal Fusion with Prompting" Findings of ACL (2022). [paper]
Sheng Liang, Mengjie Zhao, Hinrich Schuetze -
"Learning to prompt for vision-language models" IJCV (2022). [paper]
Kaiyang Zhou, Jingkang Yang, Chen Change Loy, Ziwei Liu -
"Conditional prompt learning for vision-language models" CVPR (2022). [paper]
Kaiyang Zhou, Jingkang Yang, Chen Change Loy, Ziwei Liu -
"Maple: Multi-modal prompt learning" Arxiv (2022). [paper]
Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, Salman Khan, Fahad Shahbaz Khan -
"AIM: Adapting Image Models for Efficient Video Understanding" ICLR (2023). [paper]
Taojiannan Yang, Yi Zhu, Yusheng Xie, Aston Zhang, Chen Chen, Mu Li
-
"See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning" Arxiv (2023). [paper]
Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Hao Zhang, Chuang Gan -
"Multimodal Chain-of-Thought Reasoning in Language Models" Arxiv (2023). [paper]
Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola -
"Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering" CVPR (2023). [paper] [code]
Zhenwei Shao, Zhou Yu, Meng Wang, Jun Yu -
"Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners" CVPR (2023). [paper] [code]
Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao -
"An empirical study of gpt-3 for few-shot knowledge-based vqa" AAAI (2022). [paper]
Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Yumao Lu, Zicheng Liu, Lijuan Wang -
"KAT: A Knowledge Augmented Transformer for Vision-and-Language" NAACL (2022). [paper]
Liangke Gui, Borui Wang, Qiuyuan Huang, Alexander Hauptmann, Yonatan Bisk, Jianfeng Gao -
"REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering" NIPS (2022). [paper]
Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan -
"DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training" ICLR (2023). [paper]
Wei Li, Linchao Zhu, Longyin Wen, Yi Yang -
"Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models" Arxiv (2023). [paper]
Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan -
"MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action" Arxiv (2023). [paper]
Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang -
"ViperGPT: Visual Inference via Python Execution for Reasoning" Arxiv (2023). [paper]
Dídac Surís, Sachit Menon, Carl Vondrick -
"MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models" Arxiv (2023). [paper]
Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny -
"InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning" Arxiv (2023). [paper]
Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi -
"BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models" ICML (2023). [paper]
Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi -
“VideoChat: Chat-Centric Video Understanding” Arxiv (2023). [paper]
KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao -
"Self-Chained Image-Language Model for Video Localization and Question Answering" Arxiv (2023). [paper]
Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal -
"GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest" Arxiv (2023). [Paper]
Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Kai Chen, Ping Luo
-
"Generate labeled training data using Prompt Programming and GPT-3. An example of Big Five Personality Classification" Arxiv (2023). [paper]
Eason Chen -
"AugGPT: Leveraging ChatGPT for Text Data Augmentation" Arxiv (2023). [paper]
Haixing Dai, Zhengliang Liu, Wenxiong Liao, Xiaoke Huang, Yihan Cao, Zihao Wu, Lin Zhao, Shaochen Xu, Wei Liu, Ninghao Liu, Sheng Li, Dajiang Zhu, Hongmin Cai, Lichao Sun, Quanzheng Li, Dinggang Shen, Tianming Liu, Xiang Li -
"Reward Design with Language Models" ICLR (2023). [paper]
Minae Kwon, Sang Michael Xie, Kalesha Bullard, Dorsa Sadigh -
"Is a prompt and a few samples all you need? Using GPT-4 for data augmentation in low-resource classification tasks" Arxiv (2023). [paper]
Anders Giovanni Møller, Jacob Aarup Dalsgaard, Arianna Pera, Luca Maria Aiello
-
"SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models" Arxiv (2023). [paper]
Potsawee Manakul, Adian Liusie, Mark J. F. Gales -
"Reflexion: an autonomous agent with dynamic memory and self-reflection" Arxiv (2023). [paper]
Noah Shinn, Beck Labash, Ashwin Gopinath
-
"Fairness-guided Few-shot Prompting for Large Language Models" Arxiv (2023). [paper]
Huan Ma, Changqing Zhang, Yatao Bian, Lemao Liu, Zhirui Zhang, Peilin Zhao, Shu Zhang, Huazhu Fu, Qinghua Hu, Bingzhe Wu -
"Larger language models do in-context learning differently" Arxiv (2023). [paper]
Jerry Wei, Jason Wei, Yi Tay, Dustin Tran, Albert Webson, Yifeng Lu, Xinyun Chen, Hanxiao Liu, Da Huang, Denny Zhou, Tengyu Ma -
"Automatic Chain of Thought Prompting in Large Language Models" ICLR (2023). [paper]
Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola