Discover ANY AI to make more online for less.

select between over 22,900 AI Tool and 17,900 AI News Posts.


venturebeat
OpenAI admits prompt injection is here to stay as enterprises lag on defenses

It's refreshing when a leading AI company states the obvious. In a detailed post on hardening ChatGPT Atlas against prompt injection, OpenAI acknowledged what security practitioners have known for years: "Prompt injection, much like scams and social engineering on the web, is unlikely to ever be fully 'solved.'"What’s new isn’t the risk — it’s the admission. OpenAI, the company deploying one of the most widely used AI agents, confirmed publicly that agent mode “expands the security threat surface” and that even sophisticated defenses can’t offer deterministic guarantees. For enterprises already running AI in production, this isn’t a revelation. It’s validation — and a signal that the gap between how AI is deployed and how it’s defended is no longer theoretical.None of this surprises anyone running AI in production. What concerns security leaders is the gap between this reality and enterprise readiness. A VentureBeat survey of 100 technical decision-makers found that 34.7% of organizations have deployed dedicated prompt injection defenses. The remaining 65.3% either haven't purchased these tools or couldn't confirm they have.The threat is now officially permanent. Most enterprises still aren’t equipped to detect it, let alone stop it.OpenAI’s LLM-based automated attacker found gaps that red teams missed OpenAI's defensive architecture deserves scrutiny because it represents the current ceiling of what's possible. Most, if not all, commercial enterprises won't be able to replicate it, which makes the advances they shared this week all the more relevant to security leaders protecting AI apps and platforms in development.The company built an "LLM-based automated attacker" trained end-to-end with reinforcement learning to discover prompt injection vulnerabilities. Unlike traditional red-teaming that surfaces simple failures, OpenAI's system can "steer an agent into executing sophisticated, long-horizon harmful workflows that unfold over tens (or even hundreds) of steps" by eliciting specific output strings or triggering unintended single-step tool calls.Here's how it works. The automated attacker proposes a candidate injection and sends it to an external simulator. The simulator runs a counterfactual rollout of how the targeted victim agent would behave, returns a full reasoning and action trace, and the attacker iterates. OpenAI claims it discovered attack patterns that "did not appear in our human red-teaming campaign or external reports."One attack the system uncovered demonstrates the stakes. A malicious email planted in a user's inbox contained hidden instructions. When the Atlas agent scanned messages to draft an out-of-office reply, it followed the injected prompt instead, composing a resignation letter to the user's CEO. The out-of-office was never written. The agent resigned on behalf of the user.OpenAI responded by shipping "a newly adversarially trained model and strengthened surrounding safeguards." The company's defensive stack now combines automated attack discovery, adversarial training against newly discovered attacks, and system-level safeguards outside the model itself.Counter to how oblique and guarded AI companies can be about their red teaming results, OpenAI was direct about the limits: "The nature of prompt injection makes deterministic security guarantees challenging." In other words, this means “even with this infrastructure, they can't guarantee defense.”This admission arrives as enterprises move from copilots to autonomous agents — precisely when prompt injection stops being a theoretical risk and becomes an operational one.OpenAI defines what enterprises can do to stay secureOpenAI pushed significant responsibility back to enterprises and the users they support. It’s a long-standing pattern that security teams should recognize from cloud shared responsibility models.The company recommends explicitly using logged-out mode when the agent doesn't need access to authenticated sites. It advises carefully reviewing confirmation requests before the agent takes consequential actions like sending emails or completing purchases.And it warns against broad instructions. "Avoid overly broad prompts like 'review my emails and take whatever action is needed,'" OpenAI wrote. "Wide latitude makes it easier for hidden or malicious content to influence the agent, even when safeguards are in place."The implications are clear regarding agentic autonomy and its potential threats. The more independence you give an AI agent, the more attack surface you create. OpenAI is building defenses, but enterprises and the users they protect bear responsibility for limiting exposure.Where enterprises stand todayTo understand how prepared enterprises actually are, VentureBeat surveyed 100 technical decision-makers across company sizes, from startups to enterprises with 10,000+ employees. We asked a simple question: has your organization purchased and implemented dedicated solutions for prompt filtering and abuse detection?Only 34.7% said yes. The remaining 65.3% either said no or couldn't confirm their organization's status.That split matters. It shows that prompt injection defense is no longer an emerging concept; it’s a shipping product category with real enterprise adoption. But it also reveals how early the market still is. Nearly two-thirds of organizations running AI systems today are operating without dedicated protections, relying instead on default model safeguards, internal policies, or user training.Among the majority of organizations surveyed without dedicated defenses, the predominant response regarding future purchases was uncertainty. When asked about future purchases, most respondents could not articulate a clear timeline or decision path. The most telling signal wasn’t a lack of available vendors or solutions — it was indecision. In many cases, organizations appear to be deploying AI faster than they are formalizing how it will be protected.The data can’t explain why adoption lags — whether due to budget constraints, competing priorities, immature deployments, or a belief that existing safeguards are sufficient. But it does make one thing clear: AI adoption is outpacing AI security readiness.The asymmetry problemOpenAI's defensive approach leverages advantages most enterprises don't have. The company has white-box access to its own models, a deep understanding of its defense stack, and the compute to run continuous attack simulations. Its automated attacker gets "privileged access to the reasoning traces … of the defender," giving it "an asymmetric advantage, raising the odds that it can outrun external adversaries."Enterprises deploying AI agents operate at a significant disadvantage. While OpenAI leverages white-box access and continuous simulations, most organizations work with black-box models and limited visibility into their agents' reasoning processes. Few have the resources for automated red-teaming infrastructure. This asymmetry creates a compounding problem: As organizations expand AI deployments, their defensive capabilities remain static, waiting for procurement cycles to catch up. Third-party prompt injection defense vendors, including Robust Intelligence, Lakera, Prompt Security (now part of SentinelOne), and others are attempting to fill this gap. But adoption remains low. The 65.3% of organizations without dedicated defenses are operating on whatever built-in safeguards their model providers include, plus policy documents and awareness training.OpenAI's post makes clear that even sophisticated defenses can't offer deterministic guarantees.What CISOs should take from thisOpenAI's announcement doesn't change the threat model; it validates it. Prompt injection is real, sophisticated, and permanent. The company shipping the most advanced AI agent just told security leaders to expect this threat indefinitely.Three practical implications follow:The greater the agent autonomy, the greater the attack surface. OpenAI's guidance to avoid broad prompts and limit logged-in access applies beyond Atlas. Any AI agent with wide latitude and access to sensitive systems creates the same exposure. As Forrester noted during their annual security summit earlier this year, generative AI is a chaos agent. This prediction turned out to be prescient based on OpenAI’s testing results released this week.Detection matters more than prevention. If deterministic defense isn't possible, visibility becomes critical. Organizations need to know when agents behave unexpectedly, not just hope that safeguards hold.The buy-vs.-build decision is live. OpenAI is investing heavily in automated red-teaming and adversarial training. Most enterprises can't replicate this. The question is whether third-party tooling can close the gap, and whether the 65.3% without dedicated defenses will adopt before an incident forces the issue.Bottom lineOpenAI stated what security practitioners already knew: Prompt injection is a permanent threat. The company pushing hardest on agentic AI confirmed this week that “agent mode … expands the security threat surface” and that defense requires continuous investment, not a one-time fix.The 34.7% of organizations running dedicated defenses aren’t immune, but they’re positioned to detect attacks when they happen. The majority of organizations, by contrast, are relying on default safeguards and policy documents rather than purpose-built protections. OpenAI’s research makes clear that even sophisticated defenses cannot offer deterministic guarantees — underscoring the risk of that approach.OpenAI’s announcement this week underscores what the data already shows: the gap between AI deployment and AI protection is real — and widening. Waiting for deterministic guarantees is no longer a strategy. Security leaders need to act accordingly.

Rating

Innovation

Pricing

Technology

Usability

We have discovered similar tools to what you are looking for. Check out our suggestions for similar AI tools.

venturebeat
Red teaming LLMs exposes a harsh truth about the AI security arms race

<p>Unrelenting, persistent attacks on frontier models make them fail, with the patterns of failure varying by model and developer. Red teaming shows that it’s not the sophisticated, complex at [...]

Match Score: 122.07

venturebeat
Prompt Security's Itamar Golan on why generative AI security requires

<p>VentureBeat recently sat down (virtually) with Itamar Golan, co-founder and CEO of <a href="https://prompt.security/">Prompt Security</a>, to chat through the GenAI secu [...]

Match Score: 106.86

venturebeat
GitHub leads the enterprise, Claude leads the pack—Cursor’s speed can

<p>In the race to deploy generative AI for coding, the fastest tools are not winning enterprise deals. A new VentureBeat analysis, combining a comprehensive survey of 86 engineering teams with o [...]

Match Score: 91.98

venturebeat
Anthropic vs. OpenAI red teaming methods reveal different security prioriti

<p>M<!-- -->odel providers want to prove the security and robustness of their models, releasing system cards and conducting red-team exercises with each new release. But it can be difficul [...]

Match Score: 86.08

venturebeat
The 11 runtime attacks breaking AI security — and how CISOs are stopping

<p>Enterprise security teams are losing ground to AI-enabled attacks — not because defenses are weak, but because the threat model has shifted. As AI agents move into production, attackers are [...]

Match Score: 72.46

OpenAI admits prompt injection may never be fully solved, casting doubt on the agentic AI vision
OpenAI admits prompt injection may never be fully solved, casting doubt on

<p><img width="1536" height="1024" src="https://the-decoder.com/wp-content/uploads/2025/10/openai_atlas-1.png" class="attachment-full size-full wp-post-imag [...]

Match Score: 72.31

venturebeat
This new, dead simple prompt technique boosts accuracy on LLMs by up to 76%

<p>In the chaotic world of Large Language Model (LLM) optimization, engineers have spent the last few years developing increasingly esoteric rituals to get better answers. </p><p>We [...]

Match Score: 72.00

venturebeat
TrueFoundry launches TrueFailover to automatically reroute enterprise AI tr

<p>When OpenAI went down in December, one of <a href="https://www.truefoundry.com/">TrueFoundry</a>’s customers faced a crisis that had nothing to do with chatbots or con [...]

Match Score: 67.37

venturebeat
The enterprise voice AI split: Why architecture — not model quality — d

<p>For the past year, enterprise decision-makers have faced a rigid architectural trade-off in voice AI: adopt a &quot;Native&quot; speech-to-speech (S2S) model for speed and emotional f [...]

Match Score: 61.32