select between over 22,900 AI Tool and 17,900 AI News Posts.
A new physics benchmark called "CritPt" puts leading AI models to the test at the level of early-stage PhD research. The results show that even top systems like Gemini 3 Pro and GPT-5 still fall far short of acting as autonomous scientists.
The article Gemini 3 Pro and GPT-5 still fail at complex physics tasks designed for real scientific research appeared first on THE DECODER.