Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


OpenAI promises greater transparency on model hallucinations and harmful content
OpenAI promises greater transparency on model hallucinations and harmful content

OpenAI has launched a new web page called the safety evaluations hub to publicly share information related to things like the hallucination rates of its models. The hub will also highlight if a model produces harmful content, how well it behaves as instructed and attempted jailbreaks. 
The tech company claims this new page will provide additional transparency on OpenAI, a company that, for context, has faced multiple lawsuits alleging it illegally used copyrighted material to train its AI models. Oh, yeah, and it's worth mentioning that The New York Times claims the tech company accidentally deleted evidence in the newspaper's plagiarism case against it.
The safety evaluations hub is meant to expand on OpenAI's system cards. They only outline a development's safety measures at launch, whereas the hub should provide ongoing updates. 
"As the science of AI evaluation evolves, we aim to share our progress on developing more scalable ways to measure model capability and safety," OpenAI states in its announcement. "By sharing a subset of our safety evaluation results here, we hope this will not only make it easier to understand the safety performance of OpenAI systems over time, but also support community efforts⁠ to increase transparency across the field." OpenAI adds that its working to have more proactive communication in this area throughout the company. 
Introducing the Safety Evaluations Hub—a resource to explore safety results for our models.While system cards share safety metrics at launch, the Hub will be updated periodically as part of our efforts to communicate proactively about safety.https://t.co/c8NgmXlC2Y— OpenAI (@OpenAI) May 14, 2025
Interested parties can look at each of the hub's sections and see information on relevant models, such as GPT-4.1 through 4.5. OpenAI notes that the information provided in this hub is only a "snapshot" and that interested parties should look at its system cards. assessments and other releases for further details. 
One of the big buts to the entire safety evaluation hub is that OpenAI is the entity doing these tests and choosing what information to share publicly. As a result, there isn't any way to guarantee that the company will share all its issues or concerns with the public.This article originally appeared on Engadget at https://www.engadget.com/ai/openai-promises-greater-transparency-on-model-hallucinations-and-harmful-content-184545691.html?src=rss

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
GPT-5.3 Instant cuts hallucinations by 26.8% as OpenAI shifts focus from sp

<p>OpenAI&#x27;s GPT-5.3 Instant — the company&#x27;s most widely used model — reduces hallucinations by up to 26.8% compared to its predecessor, prioritizing accuracy and conversati [...]

Match Score: 88.16

venturebeat
OpenAI deploys Cerebras chips for 15x faster code generation in first major

<p><a href="https://openai.com/">OpenAI</a> on Thursday launched <a href="https://openai.com/index/introducing-gpt-5-3-codex-spark/">GPT-5.3-Codex-Spark< [...]

Match Score: 86.75

blogspot
How I Get Free Traffic from ChatGPT in 2025 (AIO vs SEO)

<p style="text-align: left;">Three weeks ago, I tested something that completely changed how I think about organic traffic. I opened ChatGPT and asked a simple question: "What [...]

Match Score: 83.70

venturebeat
OpenAI launches GPT-5.4 with native computer use mode, financial plugins fo

<p>The AI updates aren&#x27;t slowing down. Literally two days after OpenAI launched a new underlying AI model for ChatGPT called <a href="https://venturebeat.com/orchestration/gpt-5 [...]

Match Score: 83.14

venturebeat
OpenAI upgrades ChatGPT with interactive learning tools as lawsuits and Pen

<p><a href="https://openai.com/">OpenAI</a> on Monday launched a set of interactive visual tools inside <a href="https://chatgpt.com/">ChatGPT</a> tha [...]

Match Score: 70.06

venturebeat
OpenAI’s GPT-5.3-Codex drops as Anthropic upgrades Claude — AI coding w

<p><a href="https://openai.com/">OpenAI</a> on Wednesday released <a href="https://openai.com/index/introducing-gpt-5-3-codex/">GPT-5.3-Codex</a>, whi [...]

Match Score: 68.62

venturebeat
OpenAI debuts Sora 2 AI video generator app with sound and self-insertion c

<p>OpenAI today <a href="https://openai.com/index/sora-2/">announced the release of Sora 2</a>, its latest video generation model, which now includes AI generated audio mat [...]

Match Score: 68.03

venturebeat
OpenAI launches a Codex desktop app for macOS to run multiple AI coding age

<p><a href="https://openai.com/">OpenAI</a> on Monday released a new desktop application for its <a href="https://chatgpt.com/codex">Codex</a> artific [...]

Match Score: 65.73

venturebeat
Lean4: How the theorem prover works and why it's the new competitive e

<p>Large language models (LLMs) have astounded the world with their capabilities, yet they remain plagued by unpredictability and hallucinations – confidently outputting incorrect information. [...]

Match Score: 58.59