How does OOB error work? When bootstrap aggregation is used, two separate sets are produced. The data chosen to be “in-the-bag” by sampling with replacement is one set, the bootstrap sample. The out-of-bag set contains all data that was not picked during the sampling procedure. When ...
The table below shows a subset of the whole training dataset. Features X0-X8 are categorical variables and we have to transform them into in a form that is useable by our model – numerical values. This was done usingscikit-learn’sLabelEncoder class. fromsklearn.preprocessingimportLabelEncoder ...