告诉HN:Pangram很容易被Claude击败
我一直在使用Pangram来检测我所阅读的内容是否完全是由人工智能生成的。今天,我想知道如何轻松地绕过这个检测。
我只需向它们展示生成的文本被检测为100%由人工智能生成,就能让它们生成一段“听起来像人类”写的文本。
Claude Sonnet: https://claude.ai/share/28080c8c-5647-43df-9671-91c9f9e46791
有趣的是,ChatGPT 5.4并不会这样做,至少在它使用的默认模型下是这样的:https://chatgpt.com/share/69c6c713-038c-832e-86be-689abd7b7ae1。我猜它可以被破解以实现这一点。
查看原文
I use Pangram all of the time to detect whether what I'm reading is fully AI-generated or not. Today, I wondered how easy it was to defeat it.<p>All I had to do was show them that the text they generated was detected as 100% AI generated to get them to generate a "human-sounding" text snippet.<p>Claude Sonnet: https://claude.ai/share/28080c8c-5647-43df-9671-91c9f9e46791<p>Interestingly, ChatGPT 5.4 won't do it, at least not with the default model it uses: https://chatgpt.com/share/69c6c713-038c-832e-86be-689abd7b7ae1. I'm guessing it can be jailbroken to do it though.