Doctor Droid reposted this
At Doctor Droid (YC W23) we are building an Open Source framework to enable engineers automate investigations for on-call issues. I know it's a big vision, but from current feedback this framework makes it 10x easier for engineers to investigate issues in production. How? * Senior engineers can define steps for investigation (with deep integration to 10+ observability tools -- query metrics, logs, DBs, run scripts, check configs) * The defined playbooks/runbooks can be connected to alert messages from Slack / PagerDuty / OpsGenie / etc.. No need to investigate manually from Google Docs. or trying to guess. * Engineer can see the investigation snapshot at the alerting moment / see the investigation from past alerts -- so they can correlate faster. We are currently inviting beta testers for feedback on the repo. If anyone is interested, please DM me -- I'll send you the repo link with more details! #debugging #startups #observability