HackerNews中文版

我一直在研究长时间运行的自动化/代理系统，而我不断遇到的问题是如何定义“完成”。<p>演示很简单：一项任务在顺利完成后就算结束。真正的系统则复杂得多——部分失败、重试、幂等性、不明确的终止状态，以及何时停止或升级的问题。<p>对于那些构建过调度程序、代理或其他长时间运行系统的人来说：你们在实践中是如何定义“完成”的？是状态机、不变式、超时、外部信号，还是仅仅是操作启发式？

查看原文

I've been working on long-running automation / agent systems, and one thing I keep running into is how hard it is to define "done".<p>Demos are easy: a task finishes once the happy path works. Real systems are messier — partial failures, retries, idempotency, unclear terminal states, and the question of when to stop or escalate.<p>For people who’ve built schedulers, agents, or other long-running systems: how do you define "done" in practice? Is it a state machine, invariants, timeouts, external signals, or just operational heuristics?

请问HN：对于长期运行的人工智能代理，您如何定义“完成”？