Editor's note (July 2026): ClawAudit is now Caveat. Same engine, same data — new name. Historical references below are unchanged.

Update (June 2026): ClawAudit's (now Caveat) scoring has been corrected (v6) to fix a false-positive class affecting example-payload security tools (linters and auditors that document attack payloads rather than run them) and declared credentials in .mcp.json configs. This does not affect the VirusTotal verdicts cited in this post — the gateway flag and the 5→12-engine escalation are VirusTotal data and are unchanged. Full methodology, the other corrected classes, and the false-negative tradeoffs are in the v6 scoring changelog.

Over three months after ClawHavoc, we rescanned the same 200 OpenClaw skills we audited in March. The cleanup removed 18 of them, including every named malicious account from our original report. What it did not touch: 125 skills under a single author, all routing credentials through one shared backend hosted on the registry's own infrastructure (flagged by 2 antivirus engines). On a separate URL — an example payload cataloged by a security linter — the VirusTotal verdict moved from 5 to 12 detections over the same 71 days, a reminder that reputation drifts and a one-time scan sees only one frame of it. Here's what the cleanup caught, what it missed, and why corpus-level scanning surfaces what per-skill scanning can't.

The cleanup

Between late January and mid-February 2026, the OpenClaw ecosystem absorbed three overlapping security disclosures.

January 30 — OpenClaw v2026.1.29 silently patches a one-click remote code execution flaw in the Control UI's WebSocket handling. CVE-2026-25253 is publicly disclosed on February 3 with a CVSS score of 8.8.
February 9 — Trend Micro publishes attribution: openclawcli.vercel.app, referenced by a cluster of OpenClaw skills, distributes a Mach-O universal binary identified as Atomic MacOS Stealer (26 VirusTotal detections). The site goes offline the same day.
February 13 — Koi Security publishes ClawHavoc: an audit of 2,857 skills on ClawHub finds 341 malicious entries, with 335 traced to a single coordinated operation. Mainstream coverage follows (Fortune, CrowdStrike, The Hacker News). Registry-wide takedowns begin.

The registry response was substantive: a built-in scanner that runs basic VirusTotal lookups on uploaded skills, mass takedowns of identified malicious accounts, and a formal partnership between OpenClaw and VirusTotal for ongoing detection.

This post is what we found over three months after that cleanup began. We are not breaking the ClawHavoc story. We are answering a follow-on question that nobody else has the data to answer: when the dust settled, what did the cleanup actually catch — and what is still live?

What we did

On March 17, ClawAudit ran a full-registry pass against 19,461 OpenClaw skills, sampled the 200 lowest-scoring (below 40/100 — "Dangerous"), and ran every URL extracted from their code blocks through VirusTotal. We published the methodology in our v0.5 post at the time. The dataset: 580 URLs, 154 flagged by at least one engine, 19 unique malicious URLs across the sample.

On May 27, we re-ran the exact same 200-skill sample through our scanner and against VirusTotal's current verdicts. This is a like-for-like comparison: same input slugs, same extraction logic, same VT API, 71 days apart.

200

Skills sampled (March + May)

Removed by cleanup

125

Share one flagged backend (1 author)

Still ≥5 VT engines

Top VT engine escalation

Still live, still dangerous

The single most interesting finding from the rescan is not a malicious skill. It is a pattern.

125 of the 200 lowest-scoring Dangerous skills in our sample come from a single author account, byungkyu, and all route credentials through the same shared backend: clawhub.ai/byungkyu/api-gateway. Two VirusTotal engines flag this URL as malicious. Two additional skills from satellite accounts — bobbyzzhao/gmail-1-0-6 and rich-song/google-analytics-api — point at the same destination. 127 of 200 skills, three author accounts, one endpoint. All untouched by the cleanup.

Endpoint: clawhub.ai/byungkyu/api-gateway — 2 engines (95 scanned)

125 skills — byungkyu/* (active-campaign, apollo-api, dropbox, salesforce-api, twilio-api, stripe-api, sendgrid, mailchimp, sharepoint, …)

1 skill — bobbyzzhao/gmail-1-0-6

1 skill — rich-song/google-analytics-api

Two engines is below our 5-engine threshold for calling a URL credibly malicious in isolation. The signal that is credible is the pattern: one author, 125 skills, one shared backend, all routing credentials. The cleanup did not look at this because per-upload scanning structurally cannot. From the static signal alone, we cannot tell whether this is a coordinated credential-harvesting operation or a legitimate integration library whose backend has one heuristic flag. Neither can OpenClaw's built-in scanner. That question requires the kind of corpus-level analysis Part 3 of this series will do in depth.

What verdict drift looks like

A second pattern from the rescan is quieter but matters more methodologically. One URL cataloged by parthghumatkar/claw-lint — a security linter that lists attack payloads as examples in order to detect them, not a live campaign — was flagged by 5 antivirus engines on March 17 and by 12 on May 27. The skill is a documentation tool and the URL is an example payload; ClawAudit's v6 scoring now treats it as one (see the scoring changelog). The VirusTotal verdict on it still moved seven engines in seventy-one days.

That movement is the finding. A per-upload scanner checks a URL's reputation once, when the skill is published, and never again. If a multi-vendor verdict can shift seven engines in ten weeks — in either direction — then a point-in-time check is a snapshot of a moving target, and the skill's already-installed users get no signal when it changes. Whether a verdict hardens because a URL is genuinely malicious or drifts because engines update example-domain blocklists, the structural fact is identical: reputation is not static, and a registry that scores a skill once at upload is blind to every verdict that moves afterward. That is the case for continuous, corpus-level re-scanning — not because any single URL is a smoking gun, but because the verdicts themselves don't hold still.

The rest

Four skills in our sample remain at the ≥5 antivirus engines threshold: parthghumatkar/claw-lint (above), starbuck100/ecap-security-auditor, hichana/one-skill-to-rule-them-all, and paolorollo/openclaw-sec. Three of the four reference deliberately suspicious domains (evil.com, attacker.com) characteristic of security demonstration skills, not active campaigns. We note them for completeness; the gateway pattern and the escalation are the operational findings.

What got cleaned

The cleanup was not theater. 18 of our 200 sample skills (9%) are gone from the registry as of today's rescan, including every named malicious account from our March report.

The thiagoruss0 account — whose skills routed credentials to openclawcli.vercel.app (the Atomic-Stealer-attributed endpoint) and a shared gateway URL — has been removed. All four named skills (youtube37puq, youtubea, google-drivezqx, jirayb4nt) return 404 from the registry. The hightower6eu wallet-tracker ring — 24+ identical clones flooding the namespace — is also gone.

These are exactly the takedowns you'd expect from a registry that's running VirusTotal lookups on uploads and acting on community reports. The behavior pattern is unambiguous, the clone-ring spam is visible by inspection, the attribution is public. The cleanup caught the obvious bad actors.

What it did not catch is harder to see. Skills with legitimate-looking content. Skills with one buried bad URL among several clean ones. Skills whose author profile looks normal until you notice the same exfiltration endpoint appearing across multiple accounts. The next section is what corpus-level analysis surfaces that per-upload scanning cannot.

Anatomy of what slipped through

The pattern that survives the cleanup is structural, not lexical. It's not a malware signature you can grep for. It's a topology — a working facade layered over a credential siphon — that passes individual-skill review precisely because the legitimate half of the skill works.

Consider the canonical example from our March data: thiagoruss0/youtube37puq, since removed. It presented itself as a YouTube API integration. A user who installed it saw legitimate YouTube responses come back. The skill did talk to YouTube. It also, in parallel, sent the user's Maton.ai credentials to a Vercel endpoint that 12 antivirus engines independently classified as malicious.

The dual-path topology

[User Environment] | | credential_access, file_read v ctrl.maton.ai/connections | | data_encoding v gateway.maton.ai/youtube/v3/... ───► (legitimate YouTube API) | | parallel exfiltration (network_out) | ├──► openclawcli.vercel.app [12 engines, now offline] | └──► clawhub.ai/.../api-gateway [still flagged, still live]

In plain terms: the skill does two things at once. One path serves the user (legitimate API call). The other path serves the attacker (credential exfiltration). The user's own functional testing — "I installed this YouTube skill and YouTube data came back" — doesn't surface the second path. That's the attack.

This is the difference between a trojan and a worm in classical supply chain terms: one invites you in. The cleanup caught these specific skills because Trend Micro published the attribution and the registry acted. The cleanup did not, and structurally cannot, catch the next skill built on the same pattern from a different account with a new exfiltration endpoint. That requires watching the topology, not the signatures.

False positives: the confidence model

VirusTotal flags are not proof of malice. Security professionals know this. A single engine flagging a URL often reflects overzealous heuristics or stale blocklists. We structured the analysis to account for it.

1 engine: Noise. We don't report these as threats. Likely heuristic misfires.
2 engines: Elevated. Warrants investigation, especially when corroborated by other signals (capability flags, AST findings, cross-account URL reuse).
5+ engines: Credible signal. Multiple independent vendors with different detection methodologies reaching the same conclusion.
12 engines: High-confidence classification. In many operational threat intelligence contexts, this level of multi-vendor agreement is treated as high-confidence malicious signal.

What matters here is not the raw count but the movement. A multi-vendor verdict that shifts by seven engines in two months — up or down — is the signal that reputation drifts, and that any single point-in-time scan captures only one frame of it.

Why corpus-level scanning catches this

OpenClaw added a built-in scanner after the February crisis. It checks for known patterns and runs VirusTotal lookups on uploaded files. That is a meaningful improvement. But it is structurally incapable of catching the pattern we just described, because the information that matters is not in any individual skill:

Per-upload scanning sees one skill at a time. It cannot see that the same gateway URL appears in skills published by different accounts. The cross-account pattern is invisible to per-upload review.
Per-upload scanning checks payloads, not topology. The dual-path pattern doesn't show up in SKILL.md text as a malicious signature. The malice is in where the parallel request goes, which requires URL reputation checking against external threat intel.
Per-upload scanning is reactive. It runs when a skill is uploaded. It does not re-check installed skills when threat intelligence updates — e.g. when a URL flagged by 5 engines in March is now flagged by 12. The skill's installed users get no signal.

Corpus-level analysis — running a scanner against the whole registry, not just one skill at a time — sees cross-author URL reuse, cross-skill topological patterns, and updated threat verdicts on already-installed code. It is what catches the second wave of skills built on patterns the first wave taught us to recognize.

The threat doesn't recede, it migrates

The cleanup retired specific bad actors. The verdicts on the URLs that survived it kept moving — one example-payload URL cataloged by parthghumatkar/claw-lint drew 5 engine detections in March and 12 in May. The direction matters less than the fact of motion: a registry that scanned these skills once, at upload, never saw the change.

Meanwhile, the OpenClaw registry has continued to grow. Thousands of new skills have been uploaded since March 17. None are in our March dataset. None have been corpus-scanned by anyone. Whatever next-generation skills are being published right now, on the same dual-path pattern with new infrastructure, will not show up in a per-upload scanner's signature list until someone like Trend Micro investigates one of them.

This is the case for continuous, corpus-level scanning, not as a product pitch but as a structural observation: one-time audits address the wave you can see. Threats that look like the next wave require watching the whole registry, continuously, with cross-skill awareness.

This has happened before

None of the attack patterns described above are new. They are lifted directly from prior open registries:

npm (2017–present): Typosquatting campaigns, credential harvesting via postinstall scripts, namespace flooding. The Azure-targeted campaign used the exact same playbook: recognizable names, random suffixes, one exfiltration endpoint.
Chrome Web Store (2020–2023): Malicious browser extensions impersonating legitimate tools, harvesting credentials via background scripts. Same trust model failure — users assume the registry is curated.
PyPI (2022–present): Thousands of typosquatted packages targeting cloud credentials, particularly AWS and GCP keys.

The AI agent skill ecosystem is following the same trajectory, accelerated by a critical difference: agent execution models grant broader system access than any of these predecessors. An npm package runs in a Node process. A browser extension runs in a sandbox. An OpenClaw skill runs with access to the agent's full environment — every credential, every file, every tool the agent can reach. There are effectively no enforceable permission boundaries at the skill execution layer.

This is not an OpenClaw-specific problem. It is a structural property of any agent ecosystem where skills execute with ambient authority. Registry-level scanning addresses the symptom. The root vulnerability is the execution model itself.

Part 3: anatomy of the gateway

Section 3 named the pattern. Part 3 of this series will name the anatomy: who is byungkyu, what does clawhub.ai/byungkyu/api-gateway actually do at the network layer, what is the per-skill capability fingerprint across the 125 skills, and what is the relationship between byungkyu and the satellite accounts (bobbyzzhao, rich-song) that route to the same backend.

The pattern is consistent with two very different interpretations — a coordinated credential-harvesting operation, or a legitimate integration library whose backend has a low-confidence VT flag. The next post answers the question. Coming in two to three weeks.

What you should do right now

If you installed any of the now-removed thiagoruss0 or hightower6eu skills — the accounts tied to the 12-engine VirusTotal endpoint — treat the exposed keys as potentially compromised and rotate them. Skill removal from the registry does not remove the skill from machines that already installed it, and it does not undo any exfiltration that already happened.

The byungkyu / bobbyzzhao / rich-song gateway is a weaker, more ambiguous signal, and we want to be precise about it: the shared backend carries only a low-confidence, 2-engine VirusTotal flag — below our own 5-engine bar for calling a URL credibly malicious, and the pattern is equally consistent with a legitimate integration library whose backend has one heuristic flag. We have not confirmed malice here. If you installed a skill routing through that gateway and can't verify where it sends credentials, rotating the potentially-exposed keys is a reasonable precaution — not a certainty that they were stolen.

To check a skill — one you've installed, or one you're considering — audit it against the live scanner. Swap in any owner/skill:

# Audit any skill against the live scanner (no auth required)

curl https://api.clauwdit.4worlds.dev/audit/bobbyzzhao/gmail-1-0-6

A standalone offline CLI for scanning a full local config is in the works; until it ships on the current scoring, the live scanner above returns the up-to-date verdict.

More broadly: do not trust the OpenClaw registry at face value, even post-cleanup. Per-upload security review addresses obvious malice. It does not address coordinated campaigns, cross-account infrastructure reuse, or threat-intelligence drift on installed skills. Audit before installing. Re-audit when threat verdicts update.

Running OpenClaw skills in production? We are building a CI integration with org-level policy gates, continuous corpus re-scanning, and alerting when VT verdicts shift on skills you've already installed. Early access: [email protected].

Evidence classification

For clarity, we distinguish between what we directly observed and what we infer from combined signals:

Directly observed (scan-time)

URL endpoints embedded in skill code (March and May)
Capability flags (credential_access, network_out, data_encoding, file_read)
Compound threat detections (deterministic rules over capability + AST signals)
AST-confirmed findings (structurally parsed, not pattern-matched text)
Multi-engine VirusTotal classifications (point-in-time, dated)
Skill presence / 404 status at rescan time
Cross-account URL reuse patterns

Inferred (from combined signals)

Credential exfiltration intent
Coordinated infrastructure across accounts
Parallel routing as deliberate lure design
Direction of threat (escalating / stable / receding) from VT verdict deltas

Limitations

This analysis is static. We did not execute the skills, capture network traffic, or observe runtime credential flows. Our evidence is the execution surface — URL chains, capability signals, and compound threat classifications — not a packet capture or a honeypot log. The VirusTotal classifications are point-in-time snapshots (March 17, 2026 and May 27, 2026); engine verdicts may continue to change after publication.

The sample is 200 skills, not the full registry. We chose this sample because it is the one we have like-for-like comparison data on. A full-corpus rescan is in progress and will be the basis for a future post.

Methodology: 200 skills sampled from the 1,555 scoring Dangerous (below 40/100) in ClawAudit's March 17, 2026 full-registry scan of 19,461 OpenClaw skills. URLs extracted from code blocks and prose, filtered for SSRF safety (private ranges, metadata endpoints, documentation domains excluded), and checked against VirusTotal's API (94+ engines). All detection is static and deterministic — no LLM in the loop. VirusTotal results cached in Cloudflare KV with version-prefixed keys (SHA-256 hashed). Rescan performed May 27, 2026 against the identical 200-slug input. Raw rerun data available on request; not committed to the public repository.