Springer Berlin HeidelbergData preprocessing for web data mining. Zhang Wei,Chen Tinggui. Advances in Intelligent and Soft Computing . 2012Zhang, W., & Chen, T. (2012). Data preprocessing for web data mining. In: Jin, D. & Lin, S. (Eds.), Advances in Electronic Commerce, Web ...
There are many tools for doing data pre-processing available, such as R, STATA, SAS, and Python; each differs in the level of programming background required. R is a free tool that is supported by a range of statistical and data manipulation packages. In this section of the chapter, we ...
usage.Data preprocessing is the first step and also the key step in web log mining,which determines the efficiency and quality of mining.This thesis elaborates on the general process of data preprocessing,and discusses on common techniques of data preprocessing employed currently at home and abroad...
Web log mining is the most important application in the research of Web data mining, and data preprocessing on Web log is the key technology. This paper introduces the definition and the main process of Web log mining ,using a real example, probes into the main task and the method of data...
S. (2017). Review of data preprocessing techniques in data mining. Journal of Engineering and Applied Sciences, 12(16), 4102–4107. doi:10.3923/jeasci.2017.4102.4107 (Open in a new window)Google Scholar Albawi, S., Mohammed, T. A., & Al-Zawi, S. (2018). Understanding of a ...
Tutorial on practical tips of the most influential data preprocessing algorithms in data mining. Knowl.-Based Syst. 98, 1–29 (2016). Article Google Scholar Alejo, R., Garcia, V. & Pacheco-Sanchez, J. H. An efficient over-sampling approach based on mean square error back-propagation for...
Handbook of Educational Data Mining (Chapman & Hall/CRC Data Mining and Knowledge Discovery Series) C Romero,S Ventura,VS Rita,... 被引量: 182发表: 2010年 Biological Knowledge Discovery Handbook: Preprocessing, Mining and Postprocessing of Biological Data In this chapter, the authors propose a ...
This paper focuses not only on the data preprocessing strategies and the effects on the quality of the models’ results, but also on the attribute selection. This topic is widely discussed in most, if not all papers on topics like data-driven ROP modeling. In this paper we compared attribute...
Data Preprocessing Abstract In real world applications, data usually contain errors and noise, need to be scaled and transformed, or need to be collected from different and possibly heterogeneous information sources. We distinguish deterministic and stochastic errors. Deterministic errors can sometimes be...
At present, the study on Web Usage Mining mainly focuses on pattern discovery (including Association Rules, sequence pattern, etc) and pattern analysis. However, the study on the main data sources, that is to say, the study on web-log pre-process is rela