📄
Abstract
Cloud native architectures are essential for modern software systems due to their ability to handle dynamic environments, scalability, and high availability. However, ensuring resilience in these systems remains a significant challenge, particularly under varying operational conditions such as high-load periods and failure scenarios. This study aims to assess the resilience of cloud native architectures using quantitative metrics that objectively evaluate key attributes such as availability, fault tolerance, recovery time, and scalability. Through the application of these metrics, the study identifies the strengths and weaknesses of the architecture, providing insights into how the system performs under stress and recovers from failures. The results show that while the architecture demonstrates strong availability and scalability under typical conditions, recovery time and scalability under extreme load conditions reveal areas for improvement. Specifically, issues with resource allocation and self-healing capabilities were identified as key weaknesses affecting the overall resilience of the system. These findings highlight the importance of using data-driven metrics to gain detailed insights into system resilience and to guide architectural improvements. The study also emphasizes the need for continuous monitoring and adaptation of the architecture to optimize fault tolerance and recovery processes. The implications of this research extend to cloud application developers and architects, offering actionable recommendations for improving system resilience. Future research could focus on integrating real-time monitoring systems, developing more advanced resilience metrics, and incorporating AI-driven scaling techniques to further enhance the adaptability and robustness of cloud native systems. By addressing these challenges, cloud native architectures can be better equipped to maintain high performance and reliability in dynamic, real-world environments.