Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |||
data_mining:error_analysis [2018/05/21 20:11] – [Problems with different train and dev/test set dist] phreazer | data_mining:error_analysis [2018/05/21 20:24] (current) – [Problems with different train and dev/test set dist] phreazer | ||
---|---|---|---|
Line 184: | Line 184: | ||
* Degree of overfitting to dev set (if to high => bigger dev set) | * Degree of overfitting to dev set (if to high => bigger dev set) | ||
* Test: 12% | * Test: 12% | ||
+ | |||
+ | ====== Data mismatch problems ====== | ||
+ | |||
+ | * Error analysis to understand difference between training and dev/test set | ||
+ | * Make training more similar / collect more data similar to dev/test set (e.g. simulate audio environment) | ||
+ | * Artificial data synthesis | ||
+ | * Problems: Possible that sampling from too few data (for human it might appear ok) |