A Containerized Load-Balanced Microservice Scheduling Framework for Execution Time and Deployment Cost Optimization in Cloud Environments
DOI:
https://doi.org/10.33003/Keywords:
Microservice, Container, Load Balancing, Cloud Based, Docker, AlgorithmAbstract
Cloud computing has seen widespread adoption of containerized microservice architectures, largely owing to their scalability, deployment efficiency, and operational flexibility. Despite this growth, prevailing microservice scheduling methods continue to struggle with challenges such as imbalanced workload distribution, elevated deployment costs, server overload, network bottlenecks, and excessive execution latency, all of which compromise Quality of Service (QoS) and diminish resource utilization efficiency.To address these limitations, this research introduces the Containerized Microservice Load Balancing (CMLB) framework, designed to optimize microservice scheduling within cloud environments. The framework combines distributed resource management with a hierarchical load balancing architecture composed of a Master Load Balancer (MLB) and multiple Local Load Balancers (LLBs) operating across dedicated Microservice Controllers (MSCs). Complementing this architecture, a new CMLB scheduling algorithm is proposed to simultaneously improve deployment cost-efficiency, execution speed, resource allocation, and system scalability. The underlying optimization problem is cast as a multi-objective scheduling model that seeks to minimize both application deployment cost and microservice execution time, while respecting resource capacity and workload constraints. To validate the framework, experiments were carried out using Google Cluster Trace datasets within a Docker-based microservice environment built on Spring Boot, Spring Cloud, Netflix, and Eureka service discovery. The CMLB algorithm was benchmarked against established scheduling approaches, including EPTA, Spread, Binpack, Random, and Optimal-VM. The results show that the proposed framework markedly outperforms existing methods in terms of workload balance, deployment cost reduction, execution time, traffic spike mitigation, and server overload prevention. These findings affirm that the CMLB framework delivers meaningful gains in scalability
References
Bhamare, D., Samaka, M., Erbad, A., Jain, R., & Gupta, L. (2018). Exploring microservices for enhancing internet QoS. Transactions on Emerging Telecommunications Technologies, 29(11). https://doi.org/10.1002/ett.3445
Casalicchio, E., & Perciballi, V. (2017). Auto-Scaling of Containers: The Impact of Relative and Absolute Metrics. Proceedings - 2017 IEEE 2nd International Workshops on Foundations and Applications of Self* Systems, FAS*W 2017, 207–214. https://doi.org/10.1109/FAS-W.2017.149
Cojocaru, M. D., Oprescu, A., & Uta, A. (2019). Attributes assessing the quality of microservices automatically decomposed from monolithic applications. Proceedings - 2019 18th International Symposium on Parallel and Distributed Computing, ISPDC 2019, June, 84–93. https://doi.org/10.1109/ISPDC.2019.00021
Dave, A., Patel, B., Bhatt, G., & Vora, Y. (2018). Load balancing in cloud computing using particle swarm optimization on Xen Server. 2017 Nirma University International Conference on Engineering, NUiCONE 2017, 2018-Janua, 1–6. https://doi.org/10.1109/NUICONE.2017.8325618
Ding, Z., Wang, S., & Pan, M. (2020). QoS-Constrained Service Selection for Networked Microservices. IEEE Access, 8, 39285–39299. https://doi.org/10.1109/ACCESS.2020.2974188
Erl, T., Fontenla, J., Caeiro, M., & Llamas, M. (2005). Web Services and Contemporary SOA. Service-Oriented A Rch Itectu Re Connceptsts , Technology , and Design, 25–81.
Guan, X., Wan, X., Choi, B., Song, S., & Zhu, J. (2016). Application Oriented Dynamic Resource Allocation for Data Centers Using Docker Containers. 1(c), 1–4. https://doi.org/10.1109/LCOMM.2016.2644658
Heggem, C. (2019). Container Load Balancing. https://avinetworks.com/glossary/container-load-balancing/
Hussein, S., Lahami, M., & Torjmen, M. (2024). ASSESSING THE QUALITY OF MICROSERVICE AND MONOLITHIC-BASED ARCHITECTURES : A SYSTEMATIC LITERATURE REVIEW. 7(2), 417–446.
Jain, S., & Saxena, A. K. (2017). A survey of load balancing challenges in cloud environment. Proceedings of the 5th International Conference on System Modeling and Advancement in Research Trends, SMART 2016, 291–293. https://doi.org/10.1109/SYSMART.2016.7894537
Kaur, A., & Kaur, B. (2019). Load balancing optimization based on hybrid Heuristic-Metaheuristic techniques in cloud environment. Journal of King Saud University - Computer and Information Sciences, xxxx. https://doi.org/10.1016/j.jksuci.2019.02.010
Lera, I., & Juiz, C. (2018). Genetic algorithm for multi-objective optimization of container allocation in cloud architecture. Journal of Grid Computing, 16(1), 113–135. https://doi.org/10.1007/s10723-017-9419-x
Li, K., Xu, G., Zhao, G., Dong, Y., & Wang, D. (2011). Cloud task scheduling based on load balancing ant colony optimization. Proceedings - 2011 6th Annual ChinaGrid Conference, ChinaGrid 2011, 3–9. https://doi.org/10.1109/ChinaGrid.2011.17
Lin, M., Xi, J., Bai, W., & Wu, J. (2019). Ant Colony Algorithm for Multi-Objective Optimization of Container-Based Microservice Scheduling in Cloud. IEEE Access, 7, 83088–83100. https://doi.org/10.1109/ACCESS.2019.2924414
Mahāwitthayālai Būraphā. Faculty of Informatics, IEEE Thailand Section, & Institute of Electrical and Electronics Engineers. (2017). The 2017-9th International Conference on Knowledge and Smart Technology : “Crunching Information of Everything” : February 1-4, 2017 @Amari Ocean Pattaya, Chon Buri, Thailand. 370.
Netto, M. A. S., Cardonha, C., Cunha, R. L. F., & Assunc, M. D. (2014). Evaluating Auto-scaling Strategies for Cloud Computing Environments. 187–196. https://doi.org/10.1109/MASCOTS.2014.32
Ni, Z., Wei, C., Wood, T., & Choi, N. (2022). A SmartNIC-based Load Balancing and Auto Scaling Framework for Middlebox Edge Server. 21–27. https://doi.org/10.1109/nfv-sdn53031.2021.9665167
Pan, K., & Chen, J. (2015). Load balancing in cloud computing environment based on an improved particle swarm optimization. Proceedings of the IEEE International Conference on Software Engineering and Service Sciences, ICSESS, 2015-Novem, 595–598. https://doi.org/10.1109/ICSESS.2015.7339128
Rabiu, S., Yong, C. H., & Mohamad, S. M. S. (2022). A cloud-based container microservices: A review on load-balancing and auto-scaling issues. International Journal of Data Science, 3(2), 80–92. https://doi.org/10.18517/ijods.3.2.80-92.2022
Rusek, M., Rzegorz, D., & Orłowski, A. (2016). A decentralized system for load balancing of containerized microservices in the cloud. International Conference on Systems Science (ICSS), 539(November), 142–152. https://doi.org/10.1007/978-3-319-48944-5
Shafiq, D. A., Jhanjhi, N. Z., & Abdullah, A. (2022). Load balancing techniques in cloud computing environment: A review. Journal of King Saud University - Computer and Information Sciences, 34(7), 3910–3933. https://doi.org/10.1016/j.jksuci.2021.02.007
Srirama, S. N., Adhikari, M., & Paul, S. (2020). Application deployment using containers with auto-scaling for microservices in cloud environment. Journal of Network and Computer Applications, 160(August 2019). https://doi.org/10.1016/j.jnca.2020.102629
Stevant, B., Pazat, J. L., & Blanc, A. (2018). Optimizing the Performance of a Microservice-Based Application Deployed on User-Provided Devices. Proceedings - 17th International Symposium on Parallel and Distributed Computing, ISPDC 2018, 133–140. https://doi.org/10.1109/ISPDC2018.2018.00027
Sundberg, A. (2019). A study on load balancing within microservices architecture. https://www.mendeley.com/catalogue/a4814e2d-827e-3e18-93b5-3f91efa6d98b/?utm_source=desktop&utm_medium=1.19.4&utm_campaign=open_catalog&userDocumentId=%7B0ed39c18-97ea-4c9c-994a-2dd841333607%7D
Tupid, T. (2019). Basic Guide: Load Balancing and Auto-Scaling in Cloud Computing. https://medium.com/@tudip/basic-guide-load-balancing-and-auto-scaling-in-cloud-computing-219a5f0768a
Valdivia, J. A., Limon, X., & Cortes-Verdin, K. (2020). Quality attributes in patterns related to microservice architecture: a Systematic Literature Review. 181–190. https://doi.org/10.1109/conisoft.2019.00034
Viennot, N., Lécuyer, M., Bell, J., Geambasu, R., & Nieh, J. (2015). Synapse: A microservices architecture for heterogeneous-database web applications. Proceedings of the 10th European Conference on Computer Systems, EuroSys 2015. https://doi.org/10.1145/2741948.2741975
Villamizar, M., Garcés, O., Castro, H., Verano, M., Salamanca, L., & Gil, S. (2015). Evaluating the Monolithic and the Microservice Architecture Pattern to Deploy Web Applications in the Cloud Evaluando el Patrón de Arquitectura Monolítica y de Micro Servicios Para Desplegar Aplicaciones en la Nube. 10th Computing Colombian Conference, 583–590.
Villamizar, M., Garcés, O., Ochoa, L., Castro, H., Salamanca, L., Verano, M., Casallas, R., Gil, S., Valencia, C., Zambrano, A., & Lang, M. (2017). Cost comparison of running web applications in the cloud using monolithic, microservice, and AWS Lambda architectures. Service Oriented Computing and Applications, 11(2), 233–247. https://doi.org/10.1007/s11761-017-0208-y
Wan, X., Guan, X., Wang, T., Bai, G., & Choi, B. (2018). Journal of Network and Computer Applications Application deployment using Microservice and Docker containers : Framework and optimization. 119(December 2017), 97–109.
Downloads
Published
Issue
Section
Categories
License
Copyright (c) 2026 Shamsuddeen Rabiu, Abdulrahman Nasiru Sada

This work is licensed under a Creative Commons Attribution 4.0 International License.