Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
data_mining:neural_network:autoencoder [2017/05/04 13:36] – [Semantic hashing] phreazer | data_mining:neural_network:autoencoder [2017/07/30 16:02] (current) – [Autoencoder] phreazer | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Autoencoder ====== | ====== Autoencoder ====== | ||
+ | |||
+ | * Unsupervised learning: Feature extraction, Generative models, Compression, | ||
+ | * Loss as evaluation metric | ||
+ | * Difference to RBM: Deterministic approach (not stochastic). | ||
+ | * Encoder compresses to few dimensions, Decoder maps back to full dimensionality | ||
+ | * Building block for deep belief networks | ||
+ | ===== Comparison with PCA ===== | ||
+ | |||
PCA: | PCA: | ||
Line 59: | Line 67: | ||
Query (supermaket search): Hash, get address, get nearby addresses (semantically similar documents). | Query (supermaket search): Hash, get address, get nearby addresses (semantically similar documents). | ||
+ | |||
+ | ===== Learn binary codes for image retrieval ===== | ||
+ | |||
+ | Matching real-values vectors is slow => short binary code faster. | ||
+ | |||
+ | Use semantic hashing with 28-bit binary code to get a long shortlist of promising images. The nuser 265 bit binary code to do a serial search for good matches. | ||
+ | |||
+ | Krizhevsky' | ||
+ | |||
+ | Reconstructing 32x32 color images from 256 bit codes. | ||
+ | |||
+ | ===== Shallow autoencoders for pre-training ===== | ||
+ | |||
+ | Just have 1 layer. RBMs can be seen as shallow autoencoders. | ||
+ | |||
+ | Train RBM with one-step constrastive divergence: Makses resconstruction look like data. | ||
+ | |||
+ | |||
+ | ===== Conclusion about pre-training ===== | ||
+ | |||
+ | For data sets without huge number of labeled cases: Pre-training helps subsequent discriminative learning, espescially if unlabeled extra data is available. | ||
+ | |||
+ | For very large, labeled datasets: Not necessary, but if nets get much larger pre-training is necessary again. |