请问HN:对于长期运行的人工智能代理,您如何定义“完成”?
我一直在研究长时间运行的自动化/代理系统,而我不断遇到的问题是如何定义“完成”。<p>演示很简单:一项任务在顺利完成后就算结束。真正的系统则复杂得多——部分失败、重试、幂等性、不明确的终止状态,以及何时停止或升级的问题。<p>对于那些构建过调度程序、代理或其他长时间运行系统的人来说:你们在实践中是如何定义“完成”的?是状态机、不变式、超时、外部信号,还是仅仅是操作启发式?
查看原文
I've been working on long-running automation / agent systems, and one thing I keep running into is how hard it is to define "done".<p>Demos are easy: a task finishes once the happy path works.
Real systems are messier — partial failures, retries, idempotency, unclear terminal states, and the question of when to stop or escalate.<p>For people who’ve built schedulers, agents, or other long-running systems:
how do you define "done" in practice?
Is it a state machine, invariants, timeouts, external signals, or just operational heuristics?