Teaching-Machines-Reading-Comprehension

Abstract:

Inference is a fundamental skill that a human develops when he performs reading comprehension problems. The challenge is not only to identify the relevant portion of a text but also to extract the inferred context. This is what represents intelligence and is a necessary and important area to cover when trying to develop artificial intelligence. A machine tackling reading comprehension may be a step in the direction of artificial intelligence.

Traditional approaches to automatic reading comprehension are rigid as they depend on a set of handcrafted rules that use lexical and syntactic patterns to extract the right answer from a paragraph. Language itself is ambiguous and rules are never sufficient to make decisions. Machine learning may be a solution here because the machine automatically computes the rules without any interference from humans.

Dataset:

In this project, we tackle Reading Comprehension using SciQ Dataset released by Allen Institute for Artificial Intelligence.

Approach:

Representing language as mathematical models is a challenge and there are very few machine learning models that can help analyze it. For this project, we have decided to use a neural network architecture. This will give us the required depth in analysis to overcome the challenges which come with inherent ambiguity of language. In particular, we plan to use LSTM. Our dataset comprises of a question and a set of possible answers for a support paragraph.

Typically, a reading comprehension task consists of a long paragraph (long sequence of words) followed by a question which calls for retention of memory. One of the advantages of LSTMs is they are capable of retaining memory over very long span.

One of the key challenges we might face in this project is embedding real world context into a machine. For example, the question in the above sample asks for “an evolutionary model contrasted by ….” but the support paragraph says “... is better supported by the fossil record than is gradualism.” Clearly, a lot of linguistic tools such as synonym, context and paraphrasing are used here to match the two pieces of text. Hopefully, the LSTM model is able to encode this information while trying to extract the answers from the paragraph in scope.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
Final_Model.png		Final_Model.png
LICENSE		LICENSE
Preprocess.ipynb		Preprocess.ipynb
README.md		README.md
TeachingMachinesReadingComprehension.pptx		TeachingMachinesReadingComprehension.pptx
Visualize.ipynb		Visualize.ipynb
Visualize.py		Visualize.py
keras_implementation.ipynb		keras_implementation.ipynb
sentence_length_plots.ipynb		sentence_length_plots.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Teaching-Machines-Reading-Comprehension

Abstract:

Dataset:

Approach:

About

Releases

Packages

Languages

License

PrajnyaSatish/Teaching-Machines-Reading-Comprehension

Folders and files

Latest commit

History

Repository files navigation

Teaching-Machines-Reading-Comprehension

Abstract:

Dataset:

Approach:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages