Leveraging Cloud-Native Architectures for Enhanced Data Wrangling Efficiency: A Security and Performance Perspective
Prakash Somasundaram
Prakash Somasundaram, Department of Computer Science, Northeastern University, San Francisco, California, United States of America (USA).
Manuscript received on 27 January 2024 | Revised Manuscript received on 04 March 2024 | Manuscript Accepted on 15 March 2024 | Manuscript published on 30 March 2024 | PP: 17-21 | Volume-13 Issue-4, March 2024 | Retrieval Number: 100.1/ijitee.D982113040324 | DOI: 10.35940/ijitee.D9821.13040324
Open Access | Editorial and Publishing Policies | Cite | Zenodo | OJS | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: In the contemporary landscape of big data analytics, cloud computing environments have emerged as pivotal platforms for data-wrangling processes, catering to the ingestion and transformation of vast datasets. This research paper explores optimization strategies for data wrangling within cloud computing environments, a critical component in the realm of big data analytics. It addresses the significant security and performance challenges encountered during data pipeline execution in cloud platforms. By proposing a novel strategy that includes executing data pipelines within a customer’s Virtual Private Cloud (VPC) and employing pushdown optimization for data transformation tasks in cloud data warehouses and databases, this approach seeks to enhance security and performance. The paper examines the theoretical underpinnings and practical applications of these strategies, conducting a comparative analysis with traditional data-wrangling methods to underscore the benefits of performance and security. Additionally, it assesses the implications of this approach on cost, scalability, and manageability within cloud architectures. The findings offer valuable insights and recommendations for deploying these optimization techniques in practical scenarios, setting the stage for future research in refining data-wrangling practices in cloud environments.
Keywords: Data Wrangling, Cloud Computing, Virtual Private Cloud (VPC), Pushdown Optimization, Cloud Data Warehouses, Data Security, Performance Enhancement.
Scope of the Article: Cloud Computing