📄
Abstract
Cloud-native systems are essential for modern software development, offering enhanced scalability, flexibility, and resilience through cloud computing environments. However, ensuring the reliability and performance of these systems presents a challenge due to their dynamic and distributed nature. Traditional testing methods, such as unit and integration testing, while valuable for detecting individual component defects and interactions, are insufficient for predicting failure rates in complex, cloud-native applications. This study explores the effectiveness of various testing techniques and quality metrics in predicting failure rates within scalable cloud-native systems. A comparative experimental study was conducted using three primary testing techniques: unit testing, integration testing, and chaos testing. The results indicate that chaos testing, when combined with advanced quality metrics such as migration rate and mismigration rate, significantly outperforms traditional methods in predicting failure rates and evaluating system resilience. These findings suggest that chaos testing offers a more comprehensive evaluation, simulating real-world disruptions to test system behavior under stress, which is essential for cloud-native environments where high availability and fault tolerance are critical. The study also highlights the importance of integrating predictive quality metrics, which improve the accuracy of failure predictions and enhance system reliability. The study concludes that for cloud-native systems, a combination of advanced testing techniques and predictive metrics is essential for ensuring high availability, scalability, and reliability in dynamic environments. Future research should focus on refining predictive testing approaches, developing standardized frameworks, and empirically validating new testing methods to address the growing complexity of cloud-native systems.