Multi-task learning
Example: Predict if picture contains pedestrians, cars, stop signs as one output vector
Also works for NA features (if not all are present).
When to use:
- Shared lower-level features
- Amount of data for each task is similar (data could be helpful for other tasks)
- Training a big enough network possible