Este repositorio contiene el material para una charla de computación lingüística en el contexto del Laboratorio de Ciencia de Datos de la Facultad de Ingeniería.
- algoritmos de embeddings de palabras
- propiedades y regularidades en los embeddings
- alineación de embeddings
- traducción por máquina: modelos no supervisados
- traducción por máquina: modelos supervisados
Mikolov, Tomas, et al. "Distributed representations of words and phrases and their compositionality." Advances in neural information processing systems. 2013.
Mikolov, Tomas, Wen-tau Yih, and Geoffrey Zweig. "Linguistic regularities in continuous space word representations." Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2013.
Bojanowski, Piotr, et al. "Enriching word vectors with subword information." arXiv preprint arXiv:1607.04606 (2016).
Bahdanau, Dzmitry, Kyunghyun Cho, and Yoshua Bengio. "Neural machine translation by jointly learning to align and translate." arXiv preprint arXiv:1409.0473 (2014).
Conneau, Alexis, et al. "Word translation without parallel data." arXiv preprint arXiv:1710.04087 (2017).
Lample, Guillaume, Ludovic Denoyer, and Marc'Aurelio Ranzato. "Unsupervised machine translation using monolingual corpora only." arXiv preprint arXiv:1711.00043 (2017).