Explore and run machine learning code with Kaggle Notebooks | Using data from Titanic - Machine Learning from Disaster
1. Number of times pregnant 2. Plasma glucose concentration a 2 hours in an oral glucose tolerance test 3. Diastolic blood pressure (mm Hg) 4. Triceps skin fold thickness (mm) 5. 2-Hour serum insulin (mu U/ml) 6. Body mass index (weight in kg/(height in m)^2) 7. Diabetes pedi...
清洗数据 Cleaning missing values 基本的统计分析 Performing some basic statistical analysis 数据转化/操...
This tutorial will introduce the use of Python for statistical data analysis, using data stored as Pandas DataFrame objects. Much of the work involved in analyzing data resides in importing, cleaning and transforming data in preparation for analysis. Therefore, the first half of the course is comp...
ELS, 《Elements of statistical learning》; Downey,《Think Bayes: Bayesian Statistics in Python》 (有待补充) 二、机器学习 PRML,《Pattern Recognition And Machine Learning》; MLAPP,《Machine Learning: A Probabilistic Perspective》; FML,《Foundations of Machine Learning》; ...
In statistical analysis, there are three main fundamental concepts associated with describing the data: location or Central tendency, Dissemination or spread, and Shape or distribution. A raw dataset is difficult to describe; descriptive statistics describe the dataset in a way simpler manner through;...
Wiki:In statistics, exploratory data analysis is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us bey...
—Statsmodels is a library for statistical and econometric analysis in Python. This paper discusses the current relationship between statistics and Python and open source more generally, outlining how the statsmodels package fills a gap in this relationship. An overview of statsmodels is provided, inclu...
Experiment or elapsed time; each timestamp is a measure of time relative to a particular start time (e.g., the diameter of a cookie baking each second since being placed in the oven) 虽然很多技术都可用于处理实验型的时间序列,其索引可能是一个整数或浮点数(表示从实验开始所经过的时间),但本章...
QuTiP - Quantum Toolbox in Python. RDKit - Cheminformatics and Machine Learning Software. SciPy - A Python-based ecosystem of open-source software for mathematics, science, and engineering. SimPy - A process-based discrete-event simulation framework. statsmodels - Statistical modeling and econometrics...