===== Data set partitioning =====

Train / Dev / Test set

Traditionally: 60 / 20 / 20

Big Data (>1 mio cases): 95.5 / 0.4 / 0.1

Another problem: Make sure that dev and test set came from same distribution (e.g. user uploaded potato pictures vs. high res).