Based on magnitude of changes (Example: infrastructure changes) or Global outages that are scheduled to happen, Observability script should run in an advanced mode to detect mult-customer outages by sniffing and group cluster level, similar alerts and situations.
Requirements:
1) Proactive: In Proactive mode the Observability will read data from CRQ Insights through API calls
2) Reactive: In Reactive mode, the Observability will analyze already created Situations/Alerts and predict if this is a multi-customer outage or not.
I want it... | No rush - I just think it's cool |