请问HN:现在人们是如何进行人工智能评估的?
随着新AI模型的不断发布(感觉几乎每隔一周就有一个新模型),各公司是如何进行内部AI评估,以确定哪种模型最适合他们的应用场景的呢?
查看原文
With the buzz that's happening with all the new AI models that get released (what feels like every other week), how are companies running internal AI evals to determine which model is best for their use case?