Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities
Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities

LLMs designed for reasoning, like Claude 3.7 and Deepseek-R1, are supposed to excel at complex problem-solving by simulating thought processes. But a new study by Apple researchers suggests that these models actually perform worse as tasks become more difficult and, in some cases, they "think" less.
The article Apple study finds "a fundamental scaling limitation" in reasoning models' thinking abilities appeared first on THE DECODER.

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Moonshot's Kimi K2 Thinking emerges as leading open source AI, outperformin

<p>Even as <a href="https://www.tomshardware.com/tech-industry/openai-walks-back-statement-it-wants-a-government-backstop-for-its-massive-loans-company-says-government-playing-its-part-c [...]

Match Score: 201.91

venturebeat
Baidu just dropped an open-source multimodal AI that it claims beats GPT-5

<p><a href="https://www.baidu.com/"><u>Baidu Inc.</u></a>, China&#x27;s largest search engine company, released a new artificial intelligence model on Monda [...]

Match Score: 132.83

venturebeat
New 'Markovian Thinking' technique unlocks a path to million-token AI reaso

<p>Researchers at Mila have proposed a new technique that makes large language models (LLMs) vastly more efficient when performing complex reasoning. Called <a href="https://arxiv.org/ab [...]

Match Score: 111.94

venturebeat
Google’s new AI training method helps small models tackle complex reasoni

<p>Researchers at <a href="https://research.google/teams/cloud-ai-research/">Google Cloud</a> and <a href="https://www.ucla.edu/">UCLA</a> have propos [...]

Match Score: 111.64

venturebeat
Large reasoning models almost certainly can think

<p>Recently, there has been a lot of hullabaloo about the idea that large reasoning models (LRM) are unable to think. This is mostly due to a research article published by Apple, &quot;<a [...]

Match Score: 109.48

venturebeat
Samsung AI researcher's new, open reasoning model TRM outperforms models 10

<p>The trend of AI researchers developing new, <a href="https://www.linkedin.com/pulse/next-big-thing-ai-think-small-models-venturebeat-yyrte/?trackingId=x3X3vTZhTnmwCTUtOWGAug%3D%3D&quo [...]

Match Score: 101.22

venturebeat
Meta researchers open the LLM black box to repair flawed AI reasoning

<p>Researchers at Meta FAIR and the University of Edinburgh have developed a new technique that can predict the correctness of a large language model&#x27;s (LLM) reasoning and even interven [...]

Match Score: 97.59

venturebeat
MiniMax-M2 is the new king of open source LLMs (especially for agentic tool

<p>Watch out, DeepSeek and Qwen! There&#x27;s a new king of open source large language models (LLMs), especially when it comes to something enterprises are increasingly valuing: agentic tool [...]

Match Score: 93.75

venturebeat
Nvidia researchers boost LLMs reasoning skills by getting them to 'think' d

<p>Researchers at Nvidia have developed a new technique that flips the script on how large language models (LLMs) learn to reason. </p><p>The method, called <a href="https:// [...]

Match Score: 88.10