Enhhanced and Efficient Memory Model For Hadoop Mapreduce
Archana Bhaskar1, Rajeev Ranjan2
1Archana Bhaskar*, Research Scholar , REVA University, Bangalore.
2Dr. Rajeev Ranjan, Associate Professor, REVA University , Bangalore.
Manuscript received on October 12, 2019. | Revised Manuscript received on 21 October, 2019. | Manuscript published on November 10, 2019. | PP: 161-168 | Volume-9 Issue-1, November 2019. | Retrieval Number: A3958119119/2019©BEIESP | DOI: 10.35940/ijitee.A3958.119119
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Usage of high-performance computing (HPC) infrastructure adopting cloud-computing environment offers an efficient solution for executing data intensive application. MapReduce (MR) is the favored high performance parallel computing framework used in BigData study, scientific, and data intensive applications. Hadoop is one of the significantly used MR based parallel computing framework by various organization as it is freely available open source framework from Apache foundation. The existing Hadoop MapReduce (HMR) based makespan model incurs memory and I/O overhead. Thus, affecting makespan performance. For overcoming research issues and challenges, this manuscript presented an efficient parallel HMR (PHMR) makespan model. The PHMR includes a parallel execution scheme in virtual computing worker to reduce makespan times using cloud computing framework. The PHMR model provides efficient memory management design within the virtual computing workers to minimize memory allocation and transmission overheads. For evaluating performance of PHMR of over existing model experiment are conducted on public cloud environment using Azure HDInsight cloud platform. Different application such as bioinformatics, tex mining, stream, and non stream application is considered. The overall result obtained shows superior performance is attained by PHMR over existing model in term of makespan time reduction and correlation among practical and theoretical makespan values.
Keywords: Big Data, Data Intensive Application, Caching, Cloud Computing, High-performance Computing, HMR, Scientific Application.
Scope of the Article: Cloud Computing