Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI
Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI

In the ARC-AGI-2 benchmark, which is designed to measure a language model's general reasoning skills, GPT-5 (High) scored 9.9 percent at a cost of $0.73 per task, according to ARC Prize.
The article Grok 4 edges out GPT-5 in complex reasoning benchmark ARC-AGI appeared first on THE DECODER.

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Grok 4.1 Fast's compelling dev access and Agent Tools API overshadowed

<p>Elon Musk&#x27;s frontier generative AI startup xAI<a href="https://x.ai/news/grok-4-1-fast"> formally opened developer access to its Grok 4.1 Fast models</a> last n [...]

Match Score: 328.95

venturebeat
xAI launches Grok 4.3 at an aggressively low price and a new, fast, powerfu

<p>While Elon Musk faces off against his former colleague and OpenAI co-founder Sam Altman in <a href="https://www.theverge.com/ai-artificial-intelligence/920775/evidence-exhibits-elon-m [...]

Match Score: 214.95

venturebeat
Musk's xAI launches Grok 4.1 with lower hallucination rate on the web

<p>In what appeared to be a bid to soak up some of Google&#x27;s limelight prior to the <a href="https://venturebeat.com/ai/google-unveils-gemini-3-claiming-the-lead-in-math-science- [...]

Match Score: 199.97

venturebeat
Microsoft built Phi-4-reasoning-vision-15B to know when to think — and wh

<p><a href="https://www.microsoft.com/en-us">Microsoft</a> on Tuesday released <a href="https://www.microsoft.com/en-us/research/blog/phi-4-reasoning-vision-and-the [...]

Match Score: 183.57

venturebeat
OpenAI launches GPT-5.4 with native computer use mode, financial plugins fo

<p>The AI updates aren&#x27;t slowing down. Literally two days after OpenAI launched a new underlying AI model for ChatGPT called <a href="https://venturebeat.com/orchestration/gpt-5 [...]

Match Score: 176.08

venturebeat
AI IQ is here: a new site scores frontier AI models on the human IQ scale.

<p>For decades, the IQ test has been one of the most familiar — and most contested — yardsticks for human intelligence. Now, a startup project called <a href="https://www.aiiq.org/&q [...]

Match Score: 153.13

venturebeat
Musk's xAI launches Grok Business and Enterprise with compelling vault

<p>xAI has <a href="https://x.ai/news/grok-business">launched Grok Business and Grok Enterprise</a>, positioning its flagship AI assistant as a secure, team-ready platform [...]

Match Score: 149.59

venturebeat
Samsung AI researcher's new, open reasoning model TRM outperforms mode

<p>The trend of AI researchers developing new, <a href="https://www.linkedin.com/pulse/next-big-thing-ai-think-small-models-venturebeat-yyrte/?trackingId=x3X3vTZhTnmwCTUtOWGAug%3D%3D&quo [...]

Match Score: 137.58

venturebeat
DeepSeek-V4 arrives with near state-of-the-art intelligence at 1/6th the co

<p>The whale has resurfaced. </p><p>DeepSeek, the Chinese AI startup offshoot of High-Flyer Capital Management quantitative analysis firm, became a <a href="https://venturebe [...]

Match Score: 134.18