展示HN:前Cloudflare SRE正在构建一个实时运行状态地图
大家好,我是肯尼斯。我曾在Cloudflare担任高级SRE多年。
随着时间的推移,我逐渐意识到,大多数故障、安全问题和合规性演练并不是因为缺乏工具,而是因为缺乏上下文。人们不知道正在运行什么,事物是如何连接的,或者最近发生了什么变化,尤其是在系统分散在多个云、代码库和团队之间时。
这就是我正在构建OpsCompanion的原因。
我们的目标很简单:保持一个实时的、共享的视图,展示实际运行的内容以及它们之间的关系。
OpsCompanion帮助工程师:
- 查看服务、基础设施和依赖关系的实时可视化地图
- 在不需要翻阅五个工具、Slack线程或过时文档的情况下回答“发生了什么变化?”
- 保留操作上下文,以便下一个值班的人不必从零开始
这并不是要增加更多的日志或警报,或者在现有仪表板上简单地添加AI。它是关于捕捉经验丰富的操作员在脑海中持有的思维模型,并保持其共享和更新。
现在还处于早期阶段,仍有一些粗糙之处。我已经向一小部分接近生产环境的工程师开放,以便获得诚实的反馈。如果它有用,那就太好了。如果没有,我真心希望了解原因以及如何改进。
你可以在这里试用:
<a href="https://opscompanion.ai/?utm_source=hn&utm_medium=show_hn&utm_campaign=jan_2026" rel="nofollow">https://opscompanion.ai/?utm_source=hn&utm_medium=show_hn&ut...</a>
我会在评论区出现。欢迎提问技术问题,表达怀疑,或者讨论实际系统中发生的故障。
查看原文
Hey HN, I’m Kenneth. I spent several years as a Senior SRE at Cloudflare.<p>One thing that became painfully clear over time is that most outages, security issues, and compliance fire drills don’t come from a lack of tools. They come from missing context. People don’t know what’s running, how things connect, or what changed recently, especially once systems sprawl across clouds, repos, and teams.<p>That’s why I’m building OpsCompanion.<p>The goal is simple: keep a live, shared picture of what’s actually running and how it fits together.<p>OpsCompanion helps engineers:<p>See a live, visual map of services, infrastructure, and dependencies<p>Answer “what changed?” without digging through five tools, Slack threads, or outdated docs<p>Preserve operational context so the next person on call isn’t starting from zero<p>This isn’t about adding more logs or alerts, or slapping AI on top of existing dashboards. It’s about capturing the mental model experienced operators carry in their heads and keeping it shared and up to date.<p>It’s still early, and there are rough edges. I’ve opened it up to a small group of engineers who work close to production so I can get honest feedback. If it’s useful, great. If not, I genuinely want to understand why and what would make it better.<p>You can try it here:
<a href="https://opscompanion.ai/?utm_source=hn&utm_medium=show_hn&utm_campaign=jan_2026" rel="nofollow">https://opscompanion.ai/?utm_source=hn&utm_medium=show_hn&ut...</a><p>I’ll be around in the comments. Happy to answer technical questions, hear skepticism, get a bit roasted, or talk about what actually breaks in real systems.