Manuscript Number : ICASCT2533
Incremental-Parallel Data Stream Classification in Apache Spark Environment
Authors(2) :-S. Tamilarasi, Dr. KungumaRaj With notorious domain of big data age, the challenging task on data stream classification is high velocity conceptual infinite stream and perspective statistical properties of data which differs periodically. In this paper, we propose an Incremental Parallel Random Forest (IPRF) algorithm for data streams in spark cloud computing environment. The algorithm incrementally estimates the accuracy for classifying the data streams, which priors to parallelization process in order to reduce the training time and prediction process using random sampling and filtering approach, that improves the dynamic-data allocation and task-scheduling mechanism in a cloud environment. From the perspective of dynamic-data allocation, dynamically changes the data in a data stream environment, to reduce the communication cost, volume data using vertically data-partitioning, data-multiplexing method. From the perspective of task scheduling, an incremental-parallel technique is carried out in the training process of Random Forest and a task directed acyclic graph depends upon resilient distributed data objects as static, redundant, and least data object appending to re-organize the mapping relationship between successor task and slaves. The details and the results of evaluating the proposed mechanism using benchmark datasets are presented in this paper.
S. Tamilarasi Big Data, Data Stream Classification, Incremental-Parallel Technique Publication Details
Published in : Volume 3 | Issue 5 | May-June 2017 Article Preview
Department of Computer science,Research Scholar of ,Mother Teresa University, Kodaikanal, Tamilnadu, India
Dr. KungumaRaj
Department of Computer Applications,Head , Assitant Professor of Arulmigu Palaniandavar College for Women Palani, Tamilnadu, India
Date of Publication : 2017-04-30
License: This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 210-213
Manuscript Number : ICASCT2533
Publisher : Technoscience Academy
Journal URL : https://ijsrst.com/ICASCT2533
Citation Detection and Elimination |
|
| BibTeX | RIS | CSV