Data lakes are widely used to store extensive and heterogeneous datasets for advanced analytics. However, the unstructured nature of data in these repositories introduces complexities in exploiting them and ext
We conducted our experiment using four real-world datasets from Yelp.com. The results demonstrated that there is a strong negative correlation between concept drift and the performance of fake review detection/prediction models, which indicates the difficulty of building more efficient models....
Formal concept analysis (FCA) is a theory of data analysis that identifies conceptual structures among datasets. A strong feature of FCA is its capability of producing graphical visualizations of the inherent structures among data. High-risk systems have some special characteristics that result in ...
It can be used to discover knowledge patterns and implication rules in multi-relational datasets. The classification output by RCA is a family of lattices whose graphical representation facilitates the analysis by an expert. However, RCA comes with specific complexity issues. It iterates on the ...
Repository for the AdaptiveRandomForest algorithm implemented in MOA 2016-04 random-forestclassificationensembleensemble-learningdecision-treesconcept-driftmoadatastream UpdatedOct 18, 2017 Java concept drift datasets edited to work with scikit-multiflow directly ...
cytometry, aberration metrology, long-range imaging and coherent X-ray nanoscopy. A collection of datasets and reconstruction codes is provided for readers interested in implementing FP themselves. Key points Fourier ptychography (FP) is a computational method for synthesizing raw data into a high-...
Data depth can be also used to screen for outliers. The ability of the new notions of depth to detect shape outliers is presented. Several real datasets are considered to illustrate this new concept of depth, including applications to microarray observations, weather data, and growth curves. ...
In this paper, we present an approach to facilitate the journals classification of the DBLP datasets. For the analysis, the DBLP data sets were preprocessed by assigning each journal attributes defined by its topics and then the theory of formal concept analysis is introduced. It is subsequently ...
In addition, we tested TransformerCPI2.0 and the baseline models on the other two external datasets: a large external set containing new proteins and molecules, and a time-split test set named the ChEMBL27 dataset containing the new data that were deposited online after the training set (Fig....
Intersectionality is a concept that originated in Black feminist movements in the US-American context of the 1970s and 1980s, particularly in the work of feminist scholar and lawyer Kimberlé W. Crenshaw. Intersectional approaches aim to highlight the in