JUST at VQA-Med: A VGG-Seq2Seq Model

http://ceur-ws.org/Vol-2125/paper_171.pdf

This paper describes the VGG-Seq2Seq system for the Medi- cal Domain Visual Question Answering (VQA-Med) Task of ImageCLEF 2018. The proposed system follows the encoder-decoder architecture, where the encoders fuses a pretrained VGG network with an LSTM net- work that has a pretrained word embedding layer to encode the input. To generate the output, another LSTM network is used for decoding. When used with a pretrained VGG network, the VGG-Seq2Seq model man- aged to achieve reasonable results with 0.06, 0.12, 0.03 BLEU, WBSS and CBSS, respectively. Moreover, the VGG-Seq2Seq is not expensive to train.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Keras_LSTMs.ipynb		Keras_LSTMs.ipynb
README.md		README.md
VQA-Keras.ipynb		VQA-Keras.ipynb
VQA-Med-tf.ipynb		VQA-Med-tf.ipynb
data_utils.py		data_utils.py
seq2seq_image.py		seq2seq_image.py
seq2seq_keras.ipynb		seq2seq_keras.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JUST at VQA-Med: A VGG-Seq2Seq Model

About

Releases

Packages

Languages

bashartalafha/VQA-Med

Folders and files

Latest commit

History

Repository files navigation

JUST at VQA-Med: A VGG-Seq2Seq Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages