Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
data_mining:neural_network:word_embeddings [2018/06/09 16:24] – [GloVe word vectors] phreazer | data_mining:neural_network:word_embeddings [2018/06/09 16:40] (current) – [Debiasing word embeddings] phreazer | ||
---|---|---|---|
Line 96: | Line 96: | ||
$e^{final}_w = \frac{e_w + \Theta_w}{2}$ | $e^{final}_w = \frac{e_w + \Theta_w}{2}$ | ||
- | ==== Featurization view of word embeddings ==== | + | ===== Application ===== |
+ | |||
+ | ==== Sentiment classification ==== | ||
+ | |||
+ | === Simple model === | ||
+ | |||
+ | * Extract embedding vector for each word | ||
+ | * Sum or Avg those vectors | ||
+ | * Pass to softmax to gain output (1-5 stars) | ||
+ | |||
+ | Problem: Doesn' | ||
+ | |||
+ | === RNN for sentiment classification === | ||
+ | |||
+ | * Extract embedding vector for each word | ||
+ | * Feed into RNN with softmax output | ||
+ | |||
+ | |||
+ | ===== Debiasing | ||
+ | |||
+ | Bias in text | ||
+ | |||
+ | Addressing bias in word embessing: | ||
+ | |||
+ | - Identify bias direction (e.g. gender) | ||
+ | * $e_{he} - e_{she}$, average them | ||
+ | - Neutralize: For every word that is not definitial (legitimate gender component), project | ||
+ | - Equalize pairs: Only difference should be gender (e.g. grandfather vs. grandmother); | ||