1. "Efficient Estimation of Word Representations in Vector Space". Tomas Mikolov
2. Distributed Representations ofWords and Phrases and their Compositionality. Tomas Mikolov
3. Deep Learning Embeddings for Discontinuous Linguistic Units