OpenAI’s new reasoning models see rise in hallucination rates

news image
The o3 model hallucinated 33% of the time in the PersonQA benchmark, up from 16% in o1 and 14.8% in o3-mini…
阅读更多(Read More)

作者 liangfm-2