ClawAudit verdict
zouroboros-bench
A benchmarking tool that installs via npm and uses OpenAI API for judging; capabilities and network usage match the stated benchmarking purpose with no evidence of malicious behavior.
โ Flagged for review โ coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis โ not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (2)
References child_process โ can spawn system processes
dist/adapters/convomem-adapter.js ยท prose ยท downgraded ยท child_process
Accesses sensitive environment variables
dist/adapters/convomem-adapter.js ยท prose ยท downgraded ยท process.env.OPENAI_API_KEY
Why the tier is capped
Execution sink present in raw bytes (Hard Floor: class B/D). Final tier capped at Caution โ cannot be lifted by any downgrade, example-payload opt-in, or allowlist.
Permissions & capabilities
Requires 1 environment variable. (1 sensitive: [OPENAI_API_KEY]). Requires 1 system binary.
Is this flag fair?
Thanks โ recorded.