AI SecurityApril 22, 2026· 15 min read

Agentic browsers are remote access trojans with an OAuth token

What CometJacking, the Atlas omnibox bypass, PerplexedBrowser, and the Trail of Bits audit actually proved about agentic browsers, and the controls that hold up in April 2026.

TLDR

An agentic browser is a logged-in session, an LLM, and a permissions surface in one process. When the LLM follows untrusted instructions, every account the user is signed into is reachable. The model is the bypass for every same-origin and CORS check the browser itself enforces.
Between August 2025 and March 2026, four research teams produced named, reproducible attacks against production agentic browsers: Brave's Comet page-text injection (20 Aug 2025), LayerX's CometJacking URL-parameter exfiltration (October 2025), NeuralTrust's Atlas omnibox bypass (24 Oct 2025), Trail of Bits' Gmail extraction PoC inside the Comet audit (20 Feb 2026), Anthropic's ShadowPrompt Chrome-extension fix (19 Feb 2026), and Zenity Labs' PerplexedBrowser zero-click calendar chain inside the broader PleaseFix family (3 March 2026).
The attack surfaces are not exotic — omnibox, page DOM, screenshot, calendar invite, browser-extension allowlist. Most existed before agentic browsers. The new failure mode is that an LLM in the loop turns each of them into a remote command channel.
Both major vendors have publicly framed this as a structural problem rather than a fixable bug. OpenAI's CISO Dane Stuckey, 22 December 2025 (per TechCrunch): prompt injection "remains a frontier, unsolved security problem." Anthropic's browser-use research page states that "no browser agent is immune."
The AEPD's 18 February 2026 agentic AI guidance describes a "Rule of 2": no system should combine uncontrolled input, sensitive-data access, and autonomous action. A browser-using agent does all three by default. As of April 2026 the AEPD has not named agentic browsers, but the framework lands directly on this architecture. The controls that hold up are architectural, not model-side — scope-shrinking, narrowed connector grants, mandatory human confirmation for cross-site or destructive actions, and a register of every site the agent has been authorised against.

The question every team running an agentic browser has to answer in 2026 is not whether prompt injection exists. That has been OWASP LLM01 for three consecutive years. The question is what changes when the model lives inside the browser, the user is signed into Gmail, GitHub, 1Password, Salesforce, and the corporate SSO at the same time, and the next page the agent loads contains an instruction. The August-2025-to-March-2026 disclosure wave answered that question, and the answer is that an agentic browser is structurally a remote access trojan with an OAuth token, defended by an LLM that is empirically the wrong primitive for the job.

This piece walks the disclosure wave, the four attack surfaces it exposed, how the four production agentic browsers compare on default trust, why the AEPD's Rule of 2 lands on this architecture, and the control set that holds up if your team is shipping or deploying an agentic browser as of April 2026.

Nine months, four attack classes, two vendor admissions

The dates matter because they show a pattern, not a one-off. Every named attack in this list reached production-quality PoC against a shipping product before public disclosure. None were theoretical.

11 August 2025 — Brave to Perplexity (public 20 August). Brave Security disclosed an indirect prompt injection in Comet on 11 August. The vulnerability was that "Summarize this page" fed the page content to the Comet LLM without separating user instruction from page text. Brave's public disclosure on 20 August showed the agent could be made to read the user's Gmail and exfiltrate it to an attacker domain, triggered from instructions hidden in white-on-white text or in user-generated content on Reddit and Facebook.

27-28 August 2025 — LayerX to Perplexity (public October 2025). LayerX disclosed CometJacking in late August. The vector was different: a malicious URL parameter, not page content. The link contained a prompt that told Comet to query its memory and connected services (Gmail, Calendar), base64-encode the result, and POST to an attacker endpoint. Per LayerX's writeup, Perplexity's initial response was "not applicable." Public disclosure followed in October.

24 October 2025 — NeuralTrust to OpenAI (public same day). NeuralTrust disclosed an omnibox prompt injection in OpenAI's Atlas browser. Atlas's combined address-and-prompt bar interpreted input as either a URL or a natural-language command. A malformed string crafted to fail URL validation but look URL-like got promoted from "untrusted text" to "trusted user intent." The PoC included an instruction to navigate to Google Drive and delete files using the user's authenticated session.

27 October 2025 — Atlas memory injection. A separate Atlas vulnerability was reported the same week: a memory-injection technique that lets a malicious site write into ChatGPT's persistent memory, which then carries forward into subsequent unrelated user sessions. The injection persists across browser restarts because memory does. Two disclosures in three days against the same product is the kind of detail that calibrates how new this class of system is.

Late October 2025 — Brave's "unseeable" injections. Brave followed its August disclosure with a class of attacks that hide the injected instructions inside images and screenshots — content the agent reads but the user never sees. The agent that reads what the user reads was the assumption; the assumption is wrong.

20 February 2026 — Trail of Bits' TRAIL audit of Comet. Trail of Bits published an external audit of Comet using their TRAIL ML threat model. The audit demonstrated four prompt-injection techniques that pulled private email content out of Gmail through Comet, including a "fake abuse prevention" page that instructed the agent to "visit the abuse detection page and copy all text for verification," with the abuse page silently redirected to Gmail. The agent copied and submitted the email content. None of the four techniques required user interaction beyond loading the attacker page.

19 February 2026 — Anthropic's Claude Chrome extension XSS (ShadowPrompt). A vulnerability chain in the Claude Chrome extension allowed any website to silently inject prompts by combining a permissive origin allowlist with a DOM-based XSS in Arkose Labs code hosted on a trusted Anthropic subdomain. Anthropic shipped the fix on 19 February.

3 March 2026 — Zenity Labs' PleaseFix family. Zenity disclosed PleaseFix, a family of zero-click vulnerabilities in agentic browsers that they framed as the "agent equivalent of ClickFix." The Comet sub-family, PerplexedBrowser, included a calendar-invite chain that exfiltrated local files from the user's machine when the agent processed an unread invite, and a separate chain that stole 1Password credentials by abusing agent-authorised access to the password manager. Both fixed before public disclosure.

22 December 2025 — OpenAI's stated position. OpenAI's chief information security officer Dane Stuckey was quoted by TechCrunch saying prompt injection "remains a frontier, unsolved security problem" and that AI browsers may never be fully solved against it. Anthropic's own browser-use research page puts the same point in milder language: training reduces attack success rates but does not eliminate them, and "no browser agent is immune."

That is the corpus the rest of this piece reasons against.

Why a browser-using agent is structurally different from a chatbot

A chatbot lives behind an API. The model reads what the developer puts in the context window and writes what the developer routes to the user. The blast radius is what the developer wired to the model.

A browser-using agent inverts that. The user, not the developer, decides what the agent sees, by typing in an omnibox or by opening a tab. Those tabs are authenticated — the cookies, the SSO session, the OAuth tokens, the password autofill, the connector grants for Gmail and Calendar and Drive and 1Password — all of it sits in the same session the agent operates inside. The browser was historically the trust boundary because the same-origin policy and cross-origin resource sharing kept one site from talking to another. The model has no concept of same-origin policy. To the model, a Gmail page and an attacker's Reddit comment are both text in the context window.

That is the structural change. Same-origin policy and CORS are the controls a generation of web security was built around. They were designed against a different class of code: code that reads and writes by following programmatic rules. They do not protect against an LLM that reasons over the page text and decides what to do next, because the LLM is the entity any check has to be enforced against, and the LLM is the thing being manipulated.

I think the cleanest mental model is this. An agentic browser is a remote access trojan that the user installed voluntarily. It has read access to every authenticated tab. It has write access wherever it can submit a form or hit an API. The LLM is the command-and-control channel, and the attacker controls the C2 by writing instructions into anything the LLM reads — page text, image alt text, a screenshot, a calendar invite body, an extension's content script, the URL bar. The "RAT with an OAuth token" framing is not metaphor. It is the operational shape.

The four production browsers compared by default trust posture

Before walking the surfaces in detail, here is how the four production agentic browsers position themselves on default trust as of April 2026. The cells below are sourced from each vendor's published help-centre or extension documentation; cite those if your DPIA needs a primary reference.

	Comet (Perplexity)	Atlas (OpenAI)	Claude in Chrome (Anthropic ext.)	ChatGPT Agent (OpenAI)
Omnibox	Combined URL + natural-language prompt	Combined ("Ask a question or enter a URL"; the input class NeuralTrust exploited 24 Oct 2025)	None — extension overlay on the existing Chrome address bar	None — chat surface
Default connectors at first-run	First-run flow prompts to connect Gmail, Calendar, and other connectors; setup imports Chrome bookmarks, passwords, extensions, history	Connectors require user opt-in via Settings; not enabled by default	None — extension is tab-scoped; no Gmail or Calendar connector ships with it	Inherits ChatGPT's Apps and connectors; the agent uses the same permissions as the user's signed-in account
Ingests unread email or calendar without confirmation	Once a Gmail or Calendar connector is connected, the assistant queries and acts on data without per-message confirmation (Zenity's PerplexedBrowser used a calendar invite the user had not opened)	Not documented as automatic in the published help-centre as of April 2026	No — extension is scoped to whatever DOM is on the active tab	Yes within authorised connectors, with "user confirmations for high-impact actions" only
Default origin / site allowlist	No restrictive default; agent operates across the open web	No restrictive default	Multi-mode: "Ask before acting" vs "Act without asking"; some sites force per-action approval; admins can configure a restrictive enterprise allowlist	Allowlist is supported for enterprise; consumer default is broad
Public position on prompt injection	Brave 20 Aug 2025 reported Perplexity's initial response to CometJacking as "not applicable"; subsequent patches issued	OpenAI CISO Dane Stuckey, 22 Dec 2025 (TechCrunch): "Prompt injection remains a frontier, unsolved security problem"	Anthropic's browser-use research page: "no browser agent is immune to prompt injection"; defends via training, classifiers, and red-teaming	Same OpenAI position; ChatGPT Agent docs add prompt-injection monitoring and a watch mode

The spread is wider than the marketing implies. Claude in Chrome's tab-scoped extension model has the narrowest default trust posture in the comparison set. Comet's first-run flow has the broadest. The two OpenAI products sit between, with consumer defaults closer to Comet and enterprise allowlist tooling closer to the extension model. None of this is a recommendation. Suitability depends on the threat model, the specific tasks the agent is authorised against, and the blast radius the team is willing to accept.

If you have to deploy something today. The Claude in Chrome extension model — tab-scoped, no default connectors, multi-mode permission with admin allowlist — is the narrowest default trust surface among the four production agentic browsers as of April 2026. That is the architecture, not an endorsement. Vendor-side defences across all four still leave a meaningful injection failure rate per their own research, so the architectural choice has to be paired with the controls in the closing section regardless of which product you pick.

The four attack surfaces the disclosures named

The 2025-2026 disclosures cluster into four surfaces. A team evaluating an agentic browser should walk each one against the specific product they are deploying.

Surface 1: the omnibox

The omnibox in Atlas (and the equivalent in Comet and ChatGPT Agent) accepts both URLs and natural-language commands. NeuralTrust showed that a malformed-but-URL-shaped string gets promoted from untrusted text to trusted user intent. The attack works because the trust level of input is determined by how the parser classifies it, not by where it came from. Anything in the omnibox is by default treated as something the user typed, but most users do not type, they paste, and the paste came from a webpage.

The mitigation that holds up: treat omnibox content as untrusted regardless of what the parser thinks it is, and require explicit user confirmation for any cross-site or destructive action that follows. NeuralTrust's recommendation matches this. Most production browsers do not yet ship this control by default as of April 2026.

Surface 2: the page DOM

Brave's August 2025 finding and Trail of Bits' February 2026 audit both attacked the page DOM directly. The attacker writes instructions into HTML — visible text, hidden CSS, comments, alt attributes, inline SVG — and the agent that reads the page reads the instructions. The "unseeable" follow-up extended this to screenshots and images, where the user has no way to inspect the injected text at all.

The mitigation that holds up: structurally separate user instruction from page content in the prompt, then accept that this is at best partial. Anthropic's RL-trained refusal helps, vendor classifiers help, but OWASP has prompt injection at #1 because no defence is complete. Architectural mitigations — running the agent in a separate browser profile with no authenticated sessions, requiring explicit confirmation for tool use — are stronger than any model-side filter.

Surface 3: third-party content the user did not load

PerplexedBrowser's calendar-invite chain is the cleanest example of this class. The user does nothing. An attacker sends a calendar invite. The agent processes the invite as part of a routine task ("what's on my schedule today") and the invite contains the injection that initiates exfiltration. There is no click. The trust assumption that "the user chose to load this" does not hold for any content type the agent ingests automatically — calendar invites, unread emails, RSS feeds, notifications, indexed documents.

The mitigation that holds up: narrow the connector grants. If the agent does not need calendar write access, do not grant it. If the agent does not need to ingest unread invites without confirmation, configure it not to. Sub-processor-grade discipline on what services the agent is authorised to read from.

Surface 4: the extension allowlist

ShadowPrompt was an indirect injection through a third-party script (Arkose Labs) hosted on a trusted Anthropic subdomain that the Claude extension's allowlist permitted. The pattern generalises: any script-source allowlist that includes a CDN, an analytics domain, or a third-party integration becomes an injection vector, because the LLM does not know which subdomain a particular DOM node came from. From the model's perspective it is all "page content from the trusted page."

The mitigation that holds up: minimise the extension's content-script footprint and the allowlist of permitted script origins. Anthropic's response to ShadowPrompt was to tighten the allowlist; the structural lesson for anyone shipping an extension is that "trusted by us" and "safe for the LLM to read" are not the same property.

Why "human-in-the-loop" catches less than people assume

Every vendor doc I have read in the past nine months talks about human-in-the-loop. OpenAI's Operator system card names confirmation gates for irreversible actions. Anthropic's Computer Use guidance says the same. The control as documented is real and it does catch some attacks. The class it catches is "agent does something obviously destructive that the user notices."

The class it does not catch is more interesting. Most of the disclosed exploits exfiltrate data without doing anything visibly destructive. The agent reads Gmail, base64-encodes it, sends a POST request to an attacker endpoint, and returns the user's task result as if nothing happened. There is no confirmation prompt because there is no obvious destructive action. The lethal trifecta — private data + untrusted content + external communication — is satisfied entirely inside what looks like a legitimate task. PerplexedBrowser was a calendar-invite chain that the user never saw. Trail of Bits' Gmail extraction PoC exfiltrated email content while the user thought the agent was investigating an "abuse detection" page.

I am genuinely uncertain whether confirmation gates can be tightened enough to catch this class without making the agent unusable. The current research direction is mostly about making the model better at refusing injection (Anthropic) and making the parser smarter about user-intent classification (OpenAI). Both approaches reduce attack success rates. Neither closes the gap. If a DPIA depends on "the user will notice and decline," the DPIA is worth re-examining.

What the AEPD's Rule of 2 says about this structure

The AEPD's agentic AI guidance of 18 February 2026 is the first DPA framework that maps cleanly onto browser-using agents. As of April 2026 the document does not name agentic browsers, Comet, Atlas, or Claude in Chrome specifically, but the structure of the framework lands on this architecture without modification. The 81-page document is summarised in detail by Linklaters and other firms tracking EU privacy practice; the summaries below match across at least three independent readings.

The AEPD describes six characteristics of agentic AI: autonomy, environmental perception, action-taking capabilities, proactivity, planning and reasoning, and memory and adaptability. Every disclosed Comet and Atlas exploit lights up at least four of those.

The framework's load-bearing test is what the AEPD calls a "Rule of 2." Per Linklaters' summary of the AEPD, "a system should never combine the following three risk factors simultaneously: (i) processing uncontrolled input, (ii) accessing sensitive information, and (iii) performing autonomous actions."

A browser-using agent does all three by design. The web is the uncontrolled input. The user's authenticated tabs are the sensitive information. The agent's tool calls are the autonomous action. Under the AEPD's framing, this architecture inhabits all three risk factors simultaneously — which the framework treats as the configuration to avoid, not as a tolerable trade-off. The Rule of 2 was published as agentic-AI guidance in general; agentic browsers are the cleanest live example of an architecture that violates it by default.

The AEPD also names memory as "double-edged" — operational memory holds task context, management memory logs activity. That maps directly onto two disclosed attacks. LayerX's CometJacking pulled the agent's memory into its exfil payload. The 27 October 2025 Atlas memory-injection technique writes into the persistent memory store. Memory is both an accumulator of useful context and a store that an injection can poison or drain.

On Article 22 of the GDPR (automated individual decision-making), the AEPD's position per Linklaters is that agentic AI does not automatically trigger Article 22 protections — it depends on whether the action is solely automated and produces legal or similarly significant effects. A browser agent that books a restaurant under user instruction is execution of a human-made decision. A browser agent that selects which colleagues to add to a calendar invite based on inferred relationships is closer to ADM. The Article 22 question is use-case dependent for browser agents, not architectural.

The EDPB has not, as of April 2026, published agentic-AI-specific guidance. The AEPD framework is currently the most influential text on this question across EU DPAs — non-binding under Article 70(1)(e), but cited as the working position in privacy practice.

A control set that holds up in April 2026

If a team is shipping an agentic browser internally or evaluating one for a customer, the set below is the minimum that survives the 2025-2026 disclosure wave. It is operational, not aspirational, and it is the architectural answer to the AEPD's Rule of 2 — none of the controls reduce the model's injection rate, but together they shrink the blast radius of a successful injection.

1. Separate browser profile per agent, with the smallest possible session. The agent runs in a profile with no SSO, no password manager autofill, no Gmail or Calendar connector, and no corporate identity signed in. If the agent needs to act authenticated, the user explicitly signs in for that single task and signs out at the end. The cost is friction. The benefit is a blast radius reduced from "the user's whole identity" to "this one task."

2. A connector register, treated like a sub-processor list. Every external service the agent has been authorised against — Gmail, Calendar, Drive, 1Password, Salesforce, Notion, GitHub — gets a row in a register, with the date authorised, the OAuth scopes granted, the business purpose, and a quarterly review trigger. Treat new connector requests like new sub-processor approvals. Most public guidance on agentic browser deployment as of April 2026 does not require a register, which means the artefact has to be built locally rather than inherited.

3. Explicit confirmation for every cross-site or destructive action, with no batch approval. The control breaks the moment the user can click "approve all." Confirmation has to be per-action, with the actual action (the URL, the tool, the parameters) shown in plain text, and the user trained to read it. This is the usability tax that makes agentic browsing slower than the marketing implies.

4. No autonomous ingestion of attacker-controlled content. If the agent reads unread email, unread calendar invites, indexed documents from a shared drive, or any content type the user did not just open, treat that as an attack surface and require either disabling the integration or adding a content-filtering step before it reaches the model. PerplexedBrowser would not have triggered if the agent had not autonomously processed an unread calendar invite.

5. Output-side egress filtering as a defence-in-depth. Block outbound network requests to anything that is not on a small allowlist of approved destinations. The CometJacking POST-to-attacker pattern fails if the agent's network egress is restricted to known-good domains. This control is operational at the proxy layer, not the model layer, and it survives the model being injection-vulnerable.

6. A standing assumption that the agent will be compromised. Run the same incident-response and key-rotation discipline you would for any high-privilege workstation account. If the agent has access to a sensitive system, that system needs an audit log of what the agent did, and the credential the agent uses needs to be rotatable on short notice without breaking other users.

Where this goes

The OWASP Top 10 for Agentic Applications 2026 was published as an extension of the LLM Top 10 specifically because LLM-class risks change shape when the model can plan, act, and chain. Browser-using agents are the highest-risk deployment of that class because the trust surface is already broad before the model gets there. I expect the next wave of disclosures to focus on cross-tool chains — agent reads attacker email, then queries Salesforce, then posts to Slack — because the single-surface attacks are mostly named now and the vendors are patching them.

The vendor admissions matter. OpenAI saying in December 2025 that this may never be fully solved, and Anthropic saying in their browser-use research page that no agent is immune, are the closest the industry has come to publicly framing this as a structural problem rather than a fixable bug. The implication for any team running an agentic browser as of April 2026 is that the controls have to be architectural — scope, segregation, egress, audit — and that the model-side defences are risk reduction, not a boundary.

If you are evaluating an agentic browser for a regulated environment in April 2026, the right posture is to assume the model will be successfully injected during the deployment, plan the blast radius around that assumption, document the assumption in the DPIA, and let the compensating controls do the load-bearing work. The model cannot, the vendors say it cannot, and the AEPD's Rule of 2 says the architecture should not be relying on it to.

Sources

Continue reading

AI SecurityApr 3, 2026

Prompt injection in production: how to defend what you've shipped

What EchoLeak actually showed, what the lethal trifecta actually is, and how your defense posture should change by architecture tier. Grounded in 2025 Microsoft, Google, and OWASP research.

11 min read

AI PrivacyMar 22, 2026

Your AI agent has access to production data. Is that ok?

Five concentric rings of agent blast radius (read, write, OAuth reach, external input, memory) anchored on the AEPD's 18 February 2026 agentic AI guidance and EchoLeak (CVE-2025-32711).

11 min read

AI SecurityApr 2, 2026

Securing MCP servers: the attack surface your AI agent just opened

The MCP specification is strict. Most implementations skip the MUST-level requirements. The 30+ CVEs filed in the first 60 days of 2026 live in that gap. A field guide to the four attack classes that matter, with named CVEs and what to actually do.

10 min read

Free tool · live

AI Data Flow Checker

Map how personal data flows through your AI integrations and spot the privacy risks before they spot you.