Gated Recurrent Unit (GRU) Paper: Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation.