AnyAi.fyi - Discover ANY AI to make more online for less.

OpenAI says its latest models outperform doctors in medical benchmark

OpenAI has released a new benchmark for testing AI systems in healthcare. Called HealthBench, it's designed to evaluate how well language models handle realistic medical conversations. According to OpenAI, its latest models outperform doctors on the test.
The article OpenAI says its latest models outperform doctors in medical benchmark appeared first on THE DECODER.

Discover Copy

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat

Corti's new Symphony for Speech-to-Text model beats OpenAI at medical

Today, Copenhagen-based healthcare AI <a href="https://www.corti.ai/">Corti</a> is launching Symphony for Speech-to-Text, a new generation of clinical-grade speech recog [...]

More Copy

Match Score: 101.65

venturebeat

Microsoft and OpenAI gut their exclusive deal, freeing OpenAI to sell on AW

<a href="https://www.microsoft.com/en-us">Microsoft</a> and <a href="https://openai.com/">OpenAI</a> on Monday announced a sweeping overhaul of the [...]

More Copy

Match Score: 100.09

venturebeat

OpenAI launches GPT-5.4 with native computer use mode, financial plugins fo

The AI updates aren't slowing down. Literally two days after OpenAI launched a new underlying AI model for ChatGPT called <a href="https://venturebeat.com/orchestration/gpt-5 [...]

More Copy

Match Score: 74.22

venturebeat

OpenAI deploys Cerebras chips for 15x faster code generation in first major

<a href="https://openai.com/">OpenAI</a> on Thursday launched <a href="https://openai.com/index/introducing-gpt-5-3-codex-spark/">GPT-5.3-Codex-Spark< [...]

More Copy

Match Score: 72.66

venturebeat

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claud

For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same. OpenAI's <a href="https:/ [...]

More Copy

Match Score: 70.27

venturebeat

Frontier models are failing one in three production attempts — and gettin

AI agents are now embedded in real enterprise workflows, and they're still failing roughly one in three attempts on structured benchmarks. That <a href="https://hai.stanford. [...]

More Copy

Match Score: 65.70

venturebeat

OpenAI’s GPT-5.3-Codex drops as Anthropic upgrades Claude — AI coding w

<a href="https://openai.com/">OpenAI</a> on Wednesday released <a href="https://openai.com/index/introducing-gpt-5-3-codex/">GPT-5.3-Codex</a>, whi [...]

More Copy

Match Score: 61.64

venturebeat

Amazon’s OpenAI gambit signals a new phase in the cloud wars — one wher

<a href="https://aws.amazon.com/">Amazon Web Services</a> on Tuesday launched one of the most consequential enterprise AI plays in the company's 20-year history [...]

More Copy

Match Score: 59.24

venturebeat

Zoom says it aced AI’s hardest exam. Critics say it copied off its neighb

<a href="https://www.zoom.com/">Zoom Video Communications</a>, the company best known for keeping remote workers connected during the pandemic, announced last week that [...]

More Copy

Match Score: 58.97