Joshua Rosen, Neoklis Polyzotis, Vinayak Borkar, Yingyi Bu, Michael J Carey, Markus Weimer, Tyson Condie, and Raghu Ramakrishnan. Iterative mapreduce for large scale machine learning. arXiv preprint arXiv:1303.3517, 2013.J. Rosen, N. Polyzotis, V. Borkar, Y. Bu, M. J. Carey, M. ...
Code Issues Pull requests An iterative computing framework for both Hadoop MapReduce and Hadoop YARN. machine-learning yarn hadoop in-memory iterative Updated May 20, 2022 Java cylondata / twister2 Star 57 Code Issues Pull requests A composable framework for fast and scalable data analytics ...
Twister: A runtime for iterative MapReduce. In Proc. the 19th ACM International Symposium on High Performance Distributed Computing, Jun. 2010, pp.810-818. DOI: 10.1145/1851476.1851593. Bu Y, Howe B, Balazinska M, Ernst M D. HaLoop: Efficient iterative data processing on large clusters. ...
The experimental results show that BSP-based applications have higher efficiency than that of MapReduce-based applications when the volume of data can be put in the memory during the course of processing; on the contrary the latter are better than the former, and the performance of BC-BSP ...
M. Rafique, "On exploiting data locality for iterative mapreduce ap- plications in hybrid clouds," in BDCAT'16: 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, Shanghai, China, 2016, pp. 118-122....
IEEE/ACM International Symposium Cluster, Cloud and Grid ComputingF. J. Clemente-Castello, B. Nicolae, M. M. Rafique, R. Mayo, and J. C. Fernandez, "Evaluation of data locality strategies for hybrid cloud bursting of iterative mapreduce," in 2017 17th IEEE/ACM International Symposium on ...
Our basic and intuitive idea is to partition a large data set recursively into smaller data partitions using MapReduce until all partitions can fit in memory of each computing node. A tree indexing structure is proposed to achieve recursive computation on MapReduce for data partitioning in ...
Cloud TechnologiesMapReduceData MiningIterative AlgorithmsMapReduce is a brilliant distributed computing strategy to process massive-scale data. However, for iterative applications, the general MapReduce needs to re-initialize runtime environment repetitively and re-load static data repetitively in every ...
Our basic and intuitive idea is to partition a large data set recursively into smaller data partitions using MapReduce until all partitions can fit in memory of each computing node. A tree indexing structure is proposed to achieve recursive computation on MapReduce for data partitioning in ...