Skip to main content

Research Repository

Advanced Search

Towards Antifragility of Cloud Systems: An Adaptive Chaos driven Framework

Botros, Joseph S.; Al-Qora'n, Lamis F.; Al-Said Ahmad, Amro

Authors

Joseph S. Botros

Lamis F. Al-Qora'n



Abstract

Context
Unlike resilience, antifragility describes systems that get stronger rather than weaker under stress and chaos. Antifragile systems have the capacity to overcome stressors and come out stronger, whereas resilient systems are focused on their capacity to return to their previous state following a failure. As technology environments become increasingly complex, there is a great need for developing software systems that can benefit from failures while continuously improving. Most applications nowadays operate in cloud environments. Thus, with this increasing adoption of Cloud-Native Systems they require antifragility due to their distributed nature.

Objective
The paper proposes UNFRAGILE framework, which facilitates the transformation of existing systems into antifragile systems. The framework employs chaos engineering to introduce failures incrementally and assess the system's response under such perturbation and improves the quality of system response by removing fragilities and introducing adaptive fault tolerance strategies.

Method
The UNFRAGILE framework's feasibility has been validated by applying it to a cloud-native using a real-world architecture to enhance its antifragility towards long outbound service latencies. The empirical investigation of fragility is undertaken, and the results show how chaos affects application performance metrics and causes disturbances in them. To deal with chaotic network latency, an adaptation phase is put into effect.

Results
The findings indicate that the steady stage's behaviour is like the antifragile stage's behaviour. This suggests that the system could self-stabilise during the chaos without the need to define a static configuration after determining from the context of the environment that the dependent system was experiencing difficulties.

Conclusion
Overall, this paper contributes to ongoing efforts to develop antifragile software capable of adapting to the rapidly changing complex environment. Overall, the research provides an operational framework for engineering software systems that learn and improve through exposure to failures rather than just surviving them.

Citation

Botros, J. S., Al-Qora'n, L. F., & Al-Said Ahmad, A. (2024). Towards Antifragility of Cloud Systems: An Adaptive Chaos driven Framework. Information and Software Technology, Article 107519. https://doi.org/10.1016/j.infsof.2024.107519

Journal Article Type Article
Acceptance Date Jun 20, 2024
Online Publication Date Jun 22, 2024
Publication Date Jun 22, 2024
Deposit Date Jun 22, 2024
Journal Information and Software Technology
Print ISSN 0950-5849
Publisher Elsevier
Peer Reviewed Peer Reviewed
Article Number 107519
DOI https://doi.org/10.1016/j.infsof.2024.107519
Keywords Antifragility, Resilience, Chaos Engineering, Self-Adaptive, Software Resilience testing
Public URL https://keele-repository.worktribe.com/output/856761
Publisher URL https://www.sciencedirect.com/science/article/pii/S0950584924001241