Skip to content

Latest commit

 

History

History
30 lines (16 loc) · 1.47 KB

question_answering.md

File metadata and controls

30 lines (16 loc) · 1.47 KB

Question Answering

UIT-ViQuAD (version 1.0)

A Vietnamese Dataset for Evaluating Machine Reading Comprehension.

This dataset comprises over 23,000 human-generated question-answer pairs based on 5,109 passages of 174 Vietnamese articles from Wikipedia. In particular, we propose a new process of dataset creation for Vietnamese MRC.

UIT-ViNewsQA

New Vietnamese Corpus for Machine Reading Comprehension of Health News Articles.

The corpus comprises 22,057 human-generated question-answer pairs. Crowd-workers create the questions and their answers based on a collection of over 4,416 online Vietnamese healthcare news articles, where the answers comprise spans extracted from the corresponding articles. In particular, we develop a process of creating a corpus for the Vietnamese machine reading comprehension.

Miscellaneous

📜 Papers

📁 Open sources