1. "Efficient Estimation of Word Representations in Vector Space". Tomas Mikolov
2. Distributed Representations ofWords and Phrases and their Compositionality. Tomas Mikolov 3. Deep Learning Embeddings for Discontinuous Linguistic Units