For large volumes of data, searching on meta-data only will not be adequate. Scientists often need to search for data based on actual data values. This is a particular kind of data mining, which searches for data sets based on data content.In this chapter, we first describe the system ...
A method and apparatus for mining data relationships from an integrated database and data-mining system are disclosed. A set of frequent 1-itemsets is generated using a group-by query on data transact
Also, an expert may have questionable or incorrect rules while the data/image base may have questionable or incorrect records. Moreover, data mining discovery may take the form different from If-Then rules and these rules may need to be decoded before they are compared to expert rules. For ...
(National Institute of Astrophysics) Astronomical Observatories of Napoli and the California Institute of Technology, aims at creating a single, sustainable, distributed e-infrastructure for data mining and exploration in massive data sets, to be offered to the astronomical (but not only) community as...
With the increasing availability of large-scale biology data in crop plants, there is an urgent demand for a versatile platform that fully mines and utilizes the data for modern molecular breeding. We present Crop-GPA ( https://crop-gpa.aielab.net ), a c
with the increasing individual data collected by location-aware technologies. This paper proposes an integrated space-time pattern classification approach for individuals' travel trajectories, which differentiates itself from traditional data mining techniques, such as clustering, frequent pattern discovery and...
By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with varying standards of data protection. See our privacy policy for more information on the use of your perso...
Both machine learning and integrated learning have high requirements for data sets, and a good data set can greatly improve the accuracy of experimental results [18,19,20]. However, the recognition problem based on medical insurance is a category imbalance problem. For binary classification, the ...
For each ri , we define the differential/mutual distances between its members as d~mni=rim−rind~mni=rim−rin , ∀m ,n ,m ≠n. The set of the mutual/differential between the members of riri is denoted by D~iD~i. We measure the distance between the sets D and D~iD~i by:...
We also use optional cookies for advertising, personalisation of content, usage analysis, and social media. By accepting optional cookies, you consent to the processing of your personal data - including transfers to third parties. Some third parties are outside of the European Economic Area, with...