conducting-chaos-engineering

Name: conducting-chaos-engineering
Author: jeremylongshore

by jeremylongshore

Installation

Pick a client and clone the repository into its skills directory.

Installation

Quick info

Author: jeremylongshore
Category: Testing

GitHub repo

About this skill

This skill enables Claude to design and execute chaos engineering experiments to test system resilience. It is used when the user requests help with failure injection, latency simulation, resource exhaustion testing, or resilience validation. The skill is triggered by discussions of chaos experiments (GameDays), failure injection strategies, resilience testing, and validation of recovery mechanisms like circuit breakers and retry logic. It leverages tools like Chaos Mesh, Gremlin, Toxiproxy, and AWS FIS to simulate real-world failures and assess system behavior.

How to use

Opisz swój system i cele testowania — powiedz mi, jaką usługę chcesz testować i jakie scenariusze awarii Cię interesują (np. symulacja opóźnień, wyczerpanie zasobów, przerwy w połączeniu).
Wspólnie definiujemy zakres eksperymentu — określamy docelowy komponent, typ awarii oraz metryki, które będziemy monitorować podczas testu.
Wybieram odpowiednie narzędzie — na podstawie Twojego środowiska (Kubernetes, AWS, lokalne) rekomenduje Chaos Mesh, Gremlin, Toxiproxy lub AWS FIS.
Pomagam skonfigurować eksperyment — przygotowuję konfigurację, skrypty lub parametry potrzebne do uruchomienia testu w Twoim systemie.
Monitorujemy zachowanie systemu — obserwujemy, jak system reaguje na symulowane awarie, zbierając dane o wydajności i błędach.
Analizuję wyniki i daję rekomendacje — identyfikuję odkryte słabe punkty i proponuję konkretne ulepszenia mechanizmów odporności, takich jak timeout'y, retry'e lub failover'y.

Related skills

dependency-upgrade

by wshobson

Manage major dependency version upgrades with compatibility analysis, staged rollout, and comprehensive testing. Use when upgrading framework versions, updating major dependencies, or managing breaking changes in libraries.

Testing

17138

playwright

by BloomBooks

How to make good playwright (e2e) tests for this project.

Testing

1298

nextjs-developer

by zenobi-us

Expert Next.js developer mastering Next.js 14+ with App Router and full-stack features. Specializes in server components, server actions, performance optimization, and production deployment with focus on building fast, SEO-friendly applications.

Testing

166226

code-review-excellence

by wshobson

Master effective code review practices to provide constructive feedback, catch bugs early, and foster knowledge sharing while maintaining team morale. Use when reviewing pull requests, establishing review standards, or mentoring developers.

Testing

1145

backtesting-frameworks

by wshobson

Build robust backtesting systems for trading strategies with proper handling of look-ahead bias, survivorship bias, and transaction costs. Use when developing trading algorithms, validating strategies, or building backtesting infrastructure.

Testing

12105

hono

by openstatusHQ

Efficiently develop Hono applications using Hono CLI. Supports documentation search, API reference lookup, request testing, and bundle optimization.

Testing

1257