Statistics is a fundamental tool of data science because statistics form the basic foundation of all the Machine Learning algorithms. So, it is an important prerequisite for applied Machine Learning..
“statistics is the science of learning generalizable knowledge from data” raw data→processed data:去除异常、增加可比性 population→sample (probability) sample→population(inference) 编程和笔记范例-how to do a reproducible analysis and how to intermix the text(类似于python的jupyter notebook) ...
Next, we compare the statistical approach with those in computer science and machine learning and argue that the convergence of different methodologies for data analysis will be the core of the new field of data science. Then, we present two examples of Big Data analysis in which several new ...