# AI Data Sets
- Training set: dataset containing valid inputs to outputs mapping. The training set is used for training the models
- `m` represents the size of the training set. The number of training examples
- Test set: dataset to validate against and measure performance
- Data that was not seen during training
As more and more (labeled) data is available, the performance of Deep Learning approaches rises. Scaling helps improve performance.