LLMs show a “highly unreliable” capacity to describe their own internal processes
WHY ARE WE ALL YELLING?! Credit: Anthropic Unfortunately for AI self-awareness boosters, this demonstrated ability was extremely inconsistent and brittle across repeated tests. The best-performing models in Anthropic’s tests—Opus 4 and 4.1—topped out at correctly…