Crossvalidation
Train/Dev/Test sets
Training set | Hold-out / Dev set | Test set
- Goal of dev set: Which algorithm does better
- Goal of test set: Performance estimate of final classifier
For NNs with large data sets: Often 98%/1%/1%
Mismatched train/test distribution
- Should have same distribution, but sometimes training set needs more data which comes from different distribution
- Then dev/test set should come from same distribution
Sometimes there is no test set, so you don't have a unbiased estimation (might overfit to dev set).