Vector Representations of Words

Vector Representations of Words

In this tutorial we look at the word2vec model by Mikolov et al. This model is used for learning vector representations of words, called "word embeddings".

Highlights

This tutorial is meant to highlight the interesting, substantive parts of building a word2vec model in TensorFlow.

  • We start by giving the motivation for why we would want to represent words as vectors.
  • We look at the intuition behind the model and how it is trained (with a splash of math for good measure).
  • We also show a simple implementation of the model in TensorFlow.
  • Finally, we look at ways to make the naive version scale better.

We wal