====== Multi-task learning ====== Example: Predict if picture contains pedestrians, cars, stop signs as one output vector Also works for NA features (if not all are present). When to use: * Shared lower-level features * Amount of data for each task is similar (data could be helpful for other tasks) * Training a big enough network possible