Word2Vec

Consider a NN: Input a word, output others' word's probability that occur nearby the word:

$N : w o r d \to (w o r d \to [0, 1])$

Train Data:

Matrix M1 shows the "feature" of 10000 words. Each word has 300 features.

Word Embedding

I eat apple
I     -> [1, 0, 0]
eat   -> [0, 1, 0]
apple -> [0, 0, 1]
--> [
	[1, 0, 0],
    [0, 1, 0],
    [0, 0, 1]
]

Last edited 6 months ago

View full history