B11221292S219 - International Journal of Innovative Technology and Exploring Engineering (IJITEE)

Inclusion of Pre-Processing and Time Series Algorithms in Map Reduce Environment using Big Data Analytics
P. Nagaraj¹, P. Deepalakshmi²

¹P. Nagaraj, Department of Computer Science and Engineering, Kalasalingam Academy of Research and Education, Krishnankoil (Tamil Nadu), India.

²P. Deepalakshmi, Department of Computer Science and Engineering, Kalasalingam Academy of Research and Education, Krishnankoil (Tamil Nadu), India.

Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open-access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Map Reduce is one of the most effective ways of handling Big Data. Many of existing Data Mining / AI algorithms was developed in Map Reduce to provide effective results. There are many more algorithms including preprocessing algorithms such as Binarization, Normalization etc., Time series algorithms such as Moving average, Sliding Window, Correlation etc., which are not yet implemented in Map Reduce. Although there are not major algorithms they play a vital role in preprocessing and processing chunk data to a meaningful data. In this paper, we proposed a model of including these algorithms in Map Reduce to improve preprocessing outcome of Big Data much faster. The processed data can then be trained by the regression algorithms using Machine learning techniques to preprocess the huge data in a long run automatically.

Keywords: Big Data, Map Reduce, Data Mining, Pre-processing, Time Series, Regression.
Scope of the Article: Big Data Analytics

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US