Better Word Representations with Recursive Neural Networks for Morphology

Minh-Thang Luong, Richard Socher, and Christopher D. Manning

Thanks for reading our paper and visiting this project page! If you have any questions, feel free to email us.

Dataset:

The Stanford Rare Word (RW) Similarity Dataset could now be downloaded here.

Morphologically-trained word vectors:

Based on Huang et al. (2012)'s embeddings (HSMN+csmRNN): [ embeddings (text) ] [ words (text) ] [ parameters (mat) ].

Based on Collobert et al. (2011)'s embeddings (CW+csmRNN): [ embeddings (text) ] [ words (text) ] [ parameters (mat) ].

Note:

Citation: