data_mining:neural_network:word_embeddings

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
data_mining:neural_network:word_embeddings [2018/06/09 18:27] – [Application] phreazerdata_mining:neural_network:word_embeddings [2018/06/09 18:40] (current) – [Debiasing word embeddings] phreazer
Line 99: Line 99:
  
 ==== Sentiment classification ==== ==== Sentiment classification ====
 +
 +=== Simple model ===
 +
 +  * Extract embedding vector for each word
 +  * Sum or Avg those vectors
 +  * Pass to softmax to gain output (1-5 stars)
 +
 +Problem: Doesn't include order/sequence of words
 +
 +=== RNN for sentiment classification ===
 +
 +  * Extract embedding vector for each word
 +  * Feed into RNN with softmax output
 +
 +
 +===== Debiasing word embeddings =====
 +
 +Bias in text
 +
 +Addressing bias in word embessing:
 +
 +  - Identify bias direction (e.g. gender)
 +    * $e_{he} - e_{she}$, average them
 +  -  Neutralize: For every word that is not definitial (legitimate gender component), project
 +  - Equalize pairs: Only difference should be gender (e.g. grandfather vs. grandmother); equidistant
  
  
  • data_mining/neural_network/word_embeddings.1528561677.txt.gz
  • Last modified: 2018/06/09 18:27
  • by phreazer