Chaos Engineering provided by Nathan Claire Africa
Chaos Engineering builds resilience into your interdependent transaction processing environments by identifying and resolving vulnerabilities that could lead to performance degradation or catastrophic collapse in the event of component failure. It proactively ensures that your technology environment is engineered to withstand unexpected disruptions that will impact your revenues and your brand, and divert your best minds away from innovation and productivity.
Nathan Claire’s subject matter experts will work with your SRE teams to build resilience and reliability into your environment thereby protecting it against chaos. Scenarios will be designed and executed to see what will actually happen if critical components (production servers, services, processes, connections or even your disaster recovery systems) break for any reason including high loads. Tests can be halted or rolled back safely and automatically.
Rather than building failsafe mechanisms around what they think will happen, your SRE teams can see what actually happens when something breaks, and then build precise failsafe mechanisms such as code-based fixes, scaling of compute or storage, restarting a service, etc. Auto-remediation using orchestration tools can also be built into SRE. and the final result is a hardened environment, optimizing costs, service delivery, transaction processing and customer satisfaction.