So, we cannot take scale = 2, because this makes the decision range too inclusive, meaning it results in too few outliers. In other words, the decision range gets so big (compared to 3σ) that it considers some outliers as data points, which is not desirable either....