Collecting messages from Click House. (Some gaps are identified and communicated).
08 Jan 2025 Matta, Isaac to provide full requirements and use cases where Click House Command Line interface would assist SRE Top Customers and NOC/MIM customers in following scenarios: 1) Outages (PPT) 2) Health Check Bridges 3) Critical Alerts/Situations 4) Focused Environments
28 Jan 2025 To be discussed with Sahil
11 Feb 2025 Matta, Isaac Will provide the Jira case. Patil, Manoj Will arrange a call. impact: Risk of losing import log messages, delayed resolution, remediation due to logs missing.
risk of reoccurrence: Anytime when there is a functional, partial, or total outage.
I want it... | Soon |
@Sumeet Gill We will not do it as there are other ways to achieve this use case.
@Jason Ferens @Guest you can use STP portal for downloading Threadump which are automatically collected based on the POD probe failure and auto pod restarts.
Screenshot for navigation.
You can use support assist tool to download all other logs including thread dumps.
@Jason Ferens Today support assist tools takes, 15-30 mins to download the all logs from one customer environment.
Expectation is to collect logs automatically as soon as bridge is raised to save the time and can be used later for RCA.
Please list PODS and directories logs to be collected every time.
All PODs from customer environments.