# SECTION 2 * you need to split the training set and a test set to balance the machine learning (you train on the test set and test those assumptions on the test set) ? what is categorical data, why whould you use it? * feature scaling: put all values on the same scale so the larger number do not destroy other numbers => standardisation ou normalisation