HackerNews中文版

我在模型上下文管理方面遇到了瓶颈。在免费或低级别的API计划中，当你在传递来回的对话历史时，令牌限制消失得非常快。

查看原文

I'm hitting a wall with model context management. On free or lower-tier API plans, those token limits vanish incredibly fast when you're passing back-and-forth conversation history.

问HN：在切换大型语言模型以节省令牌时，你是如何管理上下文丢失的？