A Containerized Load-Balanced Microservice Scheduling Framework for Execution Time and Deployment Cost Optimization in Cloud Environments

Authors

  • Shamsuddeen Rabiu Federal University Dutsinma, Katsina
  • Abdulrahman Nasiru Sada

DOI:

https://doi.org/10.33003/

Keywords:

Microservice, Container, Load Balancing, Cloud Based, Docker, Algorithm

Abstract

Cloud computing has seen widespread adoption of containerized microservice architectures, largely owing to their scalability, deployment efficiency, and operational flexibility. Despite this growth, prevailing microservice scheduling methods continue to struggle with challenges such as imbalanced workload distribution, elevated deployment costs, server overload, network bottlenecks, and excessive execution latency, all of which compromise Quality of Service (QoS) and diminish resource utilization efficiency.To address these limitations, this research introduces the Containerized Microservice Load Balancing (CMLB) framework, designed to optimize microservice scheduling within cloud environments. The framework combines distributed resource management with a hierarchical load balancing architecture composed of a Master Load Balancer (MLB) and multiple Local Load Balancers (LLBs) operating across dedicated Microservice Controllers (MSCs). Complementing this architecture, a new CMLB scheduling algorithm is proposed to simultaneously improve deployment cost-efficiency, execution speed, resource allocation, and system scalability. The underlying optimization problem is cast as a multi-objective scheduling model that seeks to minimize both application deployment cost and microservice execution time, while respecting resource capacity and workload constraints. To validate the framework, experiments were carried out using Google Cluster Trace datasets within a Docker-based microservice environment built on Spring Boot, Spring Cloud, Netflix, and Eureka service discovery. The CMLB algorithm was benchmarked against established scheduling approaches, including EPTA, Spread, Binpack, Random, and Optimal-VM. The results show that the proposed framework markedly outperforms existing methods in terms of workload balance, deployment cost reduction, execution time, traffic spike mitigation, and server overload prevention. These findings affirm that the CMLB framework delivers meaningful gains in scalability

References

Bhamare, D., Samaka, M., Erbad, A., Jain, R., & Gupta, L. (2018). Exploring microservices for enhancing internet QoS. Transactions on Emerging Telecommunications Technologies, 29(11). https://doi.org/10.1002/ett.3445

Casalicchio, E., & Perciballi, V. (2017). Auto-Scaling of Containers: The Impact of Relative and Absolute Metrics. Proceedings - 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems, FAS*W 2017, 207–214. https://doi.org/10.1109/FAS-W.2017.149

Cojocaru, M. D., Oprescu, A., & Uta, A. (2019). Attributes assessing the quality of microservices automatically decomposed from monolithic applications. Proceedings - 2019 18th International Symposium on Parallel and Distributed Computing, ISPDC 2019, June, 84–93. https://doi.org/10.1109/ISPDC.2019.00021

Dave, A., Patel, B., Bhatt, G., & Vora, Y. (2018). Load balancing in cloud computing using particle swarm optimization on Xen Server. 2017 Nirma University International Conference on Engineering, NUiCONE 2017, 2018-Janua, 1–6. https://doi.org/10.1109/NUICONE.2017.8325618

Ding, Z., Wang, S., & Pan, M. (2020). QoS-Constrained Service Selection for Networked Microservices. IEEE Access, 8, 39285–39299. https://doi.org/10.1109/ACCESS.2020.2974188

Erl, T., Fontenla, J., Caeiro, M., & Llamas, M. (2005). Web Services and Contemporary SOA. Service-Oriented A Rch Itectu Re Connceptsts , Technology , and Design, 25–81.

Guan, X., Wan, X., Choi, B., Song, S., & Zhu, J. (2016). Application Oriented Dynamic Resource Allocation for Data Centers Using Docker Containers. 1(c), 1–4. https://doi.org/10.1109/LCOMM.2016.2644658

Heggem, C. (2019). Container Load Balancing. https://avinetworks.com/glossary/container-load-balancing/

Hussein, S., Lahami, M., & Torjmen, M. (2024). ASSESSING THE QUALITY OF MICROSERVICE AND MONOLITHIC-BASED ARCHITECTURES : A SYSTEMATIC LITERATURE REVIEW. 7(2), 417–446.

Jain, S., & Saxena, A. K. (2017). A survey of load balancing challenges in cloud environment. Proceedings of the 5th International Conference on System Modeling and Advancement in Research Trends, SMART 2016, 291–293. https://doi.org/10.1109/SYSMART.2016.7894537

Kaur, A., & Kaur, B. (2019). Load balancing optimization based on hybrid Heuristic-Metaheuristic techniques in cloud environment. Journal of King Saud University - Computer and Information Sciences, xxxx. https://doi.org/10.1016/j.jksuci.2019.02.010

Lera, I., & Juiz, C. (2018). Genetic algorithm for multi-objective optimization of container allocation in cloud architecture. Journal of Grid Computing, 16(1), 113–135. https://doi.org/10.1007/s10723-017-9419-x

Li, K., Xu, G., Zhao, G., Dong, Y., & Wang, D. (2011). Cloud task scheduling based on load balancing ant colony optimization. Proceedings - 2011 6th Annual ChinaGrid Conference, ChinaGrid 2011, 3–9. https://doi.org/10.1109/ChinaGrid.2011.17

Lin, M., Xi, J., Bai, W., & Wu, J. (2019). Ant Colony Algorithm for Multi-Objective Optimization of Container-Based Microservice Scheduling in Cloud. IEEE Access, 7, 83088–83100. https://doi.org/10.1109/ACCESS.2019.2924414

Mahāwitthayālai Būraphā. Faculty of Informatics, IEEE Thailand Section, & Institute of Electrical and Electronics Engineers. (2017). The 2017-9th International Conference on Knowledge and Smart Technology : “Crunching Information of Everything” : February 1-4, 2017 @Amari Ocean Pattaya, Chon Buri, Thailand. 370.

Netto, M. A. S., Cardonha, C., Cunha, R. L. F., & Assunc, M. D. (2014). Evaluating Auto-scaling Strategies for Cloud Computing Environments. 187–196. https://doi.org/10.1109/MASCOTS.2014.32

Ni, Z., Wei, C., Wood, T., & Choi, N. (2022). A SmartNIC-based Load Balancing and Auto Scaling Framework for Middlebox Edge Server. 21–27. https://doi.org/10.1109/nfv-sdn53031.2021.9665167

Pan, K., & Chen, J. (2015). Load balancing in cloud computing environment based on an improved particle swarm optimization. Proceedings of the IEEE International Conference on Software Engineering and Service Sciences, ICSESS, 2015-Novem, 595–598. https://doi.org/10.1109/ICSESS.2015.7339128

Rabiu, S., Yong, C. H., & Mohamad, S. M. S. (2022). A cloud-based container microservices: A review on load-balancing and auto-scaling issues. International Journal of Data Science, 3(2), 80–92. https://doi.org/10.18517/ijods.3.2.80-92.2022

Rusek, M., Rzegorz, D., & Orłowski, A. (2016). A decentralized system for load balancing of containerized microservices in the cloud. International Conference on Systems Science (ICSS), 539(November), 142–152. https://doi.org/10.1007/978-3-319-48944-5

Shafiq, D. A., Jhanjhi, N. Z., & Abdullah, A. (2022). Load balancing techniques in cloud computing environment: A review. Journal of King Saud University - Computer and Information Sciences, 34(7), 3910–3933. https://doi.org/10.1016/j.jksuci.2021.02.007

Srirama, S. N., Adhikari, M., & Paul, S. (2020). Application deployment using containers with auto-scaling for microservices in cloud environment. Journal of Network and Computer Applications, 160(August 2019). https://doi.org/10.1016/j.jnca.2020.102629

Stevant, B., Pazat, J. L., & Blanc, A. (2018). Optimizing the Performance of a Microservice-Based Application Deployed on User-Provided Devices. Proceedings - 17th International Symposium on Parallel and Distributed Computing, ISPDC 2018, 133–140. https://doi.org/10.1109/ISPDC2018.2018.00027

Sundberg, A. (2019). A study on load balancing within microservices architecture. https://www.mendeley.com/catalogue/a4814e2d-827e-3e18-93b5-3f91efa6d98b/?utm_source=desktop&utm_medium=1.19.4&utm_campaign=open_catalog&userDocumentId=%7B0ed39c18-97ea-4c9c-994a-2dd841333607%7D

Tupid, T. (2019). Basic Guide: Load Balancing and Auto-Scaling in Cloud Computing. https://medium.com/@tudip/basic-guide-load-balancing-and-auto-scaling-in-cloud-computing-219a5f0768a

Valdivia, J. A., Limon, X., & Cortes-Verdin, K. (2020). Quality attributes in patterns related to microservice architecture: a Systematic Literature Review. 181–190. https://doi.org/10.1109/conisoft.2019.00034

Viennot, N., Lécuyer, M., Bell, J., Geambasu, R., & Nieh, J. (2015). Synapse: A microservices architecture for heterogeneous-database web applications. Proceedings of the 10th European Conference on Computer Systems, EuroSys 2015. https://doi.org/10.1145/2741948.2741975

Villamizar, M., Garcés, O., Castro, H., Verano, M., Salamanca, L., & Gil, S. (2015). Evaluating the Monolithic and the Microservice Architecture Pattern to Deploy Web Applications in the Cloud Evaluando el Patrón de Arquitectura Monolítica y de Micro Servicios Para Desplegar Aplicaciones en la Nube. 10th Computing Colombian Conference, 583–590.

Villamizar, M., Garcés, O., Ochoa, L., Castro, H., Salamanca, L., Verano, M., Casallas, R., Gil, S., Valencia, C., Zambrano, A., & Lang, M. (2017). Cost comparison of running web applications in the cloud using monolithic, microservice, and AWS Lambda architectures. Service Oriented Computing and Applications, 11(2), 233–247. https://doi.org/10.1007/s11761-017-0208-y

Wan, X., Guan, X., Wang, T., Bai, G., & Choi, B. (2018). Journal of Network and Computer Applications Application deployment using Microservice and Docker containers : Framework and optimization. 119(December 2017), 97–109.

Microservice Architecture (Villamizar et al., 2025)

Downloads

Published

22-06-2026

How to Cite

Rabiu, S., & Sada, A. N. (2026). A Containerized Load-Balanced Microservice Scheduling Framework for Execution Time and Deployment Cost Optimization in Cloud Environments. FUDMA JOURNAL OF SCIENCES, 10(ANB-K), 15-25. https://doi.org/10.33003/