The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained IndoGPT and IndoBART models, and a starter code! (EMNLP 2021)
nlp qa benchmark natural-language-processing deep-learning dataset bart summarization gpt indonesian bahasa chit-chat javanese gpt2 dialogue-system sundanese indonlg
-
Updated
Nov 16, 2024 - Python