Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

Towards Improving YARN performance for Frugal Heterogeneous SBC-based Edge Clusters

Version 1 : Received: 30 March 2024 / Approved: 1 April 2024 / Online: 2 April 2024 (09:14:06 CEST)
Version 2 : Received: 3 May 2024 / Approved: 3 May 2024 / Online: 3 May 2024 (10:40:13 CEST)

How to cite: Qureshi, B. Towards Improving YARN performance for Frugal Heterogeneous SBC-based Edge Clusters. Preprints 2024, 2024040154. https://doi.org/10.20944/preprints202404.0154.v2 Qureshi, B. Towards Improving YARN performance for Frugal Heterogeneous SBC-based Edge Clusters. Preprints 2024, 2024040154. https://doi.org/10.20944/preprints202404.0154.v2

Abstract

Efficient resource allocation is crucial in clusters with frugal Single-Board Computers (SBCs) possessing limited computational resources. These clusters are increasingly being deployed in edge computing environments in resource-constrained settings where energy efficiency and cost-effectiveness are paramount. A major challenge in Hadoop YARN scheduling is load-balancing, as frugal nodes within the cluster can become overwhelmed, resulting in degraded performance and frequent occurrences of out-of-memory errors, ultimately leading to job failures. In this study, we introduce an Adaptive Multi-criteria Selection for Efficient Resource Allocation (AMS-ERA) in Frugal Heterogeneous Hadoop Clusters. Our criterion considers CPU, memory and disk requirements for jobs and aligns the requirements with available resources in the cluster for optimal resource allocation. To validate our approach, we deploy a heterogeneous SBC-based cluster consisting of 11 SBC nodes and conduct several experiments to evaluate the performance using Hadoop wordcount and terasort benchmark for various workload settings. The results are compared to the Hadoop-Fair, FOG and IDaPS scheduling strategies. Our results demonstrate a significant improvement in performance with the proposed AMS-ERA, reducing execution time by 27.2%, 17.4% and 7.6% respectively using terasort and wordcount benchmarks.

Keywords

 frugal hadoop clusters; dynamic analytical hierarchy process; locality aware data placement; single board computers 

Subject

Computer Science and Mathematics, Computer Science

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.