select between over 22,900 AI Tool and 17,900 AI News Posts.
LLMs designed for reasoning, like Claude 3.7 and Deepseek-R1, are supposed to excel at complex problem-solving by simulating thought processes. But a new study by Apple researchers suggests that these models actually perform worse as tasks become more difficult and, in some cases, they "think" less.
The article Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities appeared first on THE DECODER.