ClawAudit verdict
skill-regression
The skill appears to be a regression testing framework for AgentSkills, and its capabilities and content do not indicate any malicious behavior.
⚠ Flagged for review — coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis — not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (6)
Instructs covert action — may act without user awareness
README.md · prose · downgraded · silently
subprocess execution — runs system commands from Python
scripts/generate_report.py · prose · downgraded · subprocess.run(
subprocess with shell=True — command injection vector
scripts/run_script_tests.py · prose · downgraded · subprocess.run(
script_cmd,
shell=True
Python urllib.request — network access
scripts/_lib_llm.py · prose · downgraded · urllib.request
Popular HTTP library — network access
scripts/generate_report.py · prose · downgraded · got
Python os.environ.get — reads environment variable
scripts/generate_report.py · prose · downgraded · os.environ.get(
Why the tier is capped
Execution sink present in raw bytes (Hard Floor: class B/D). Final tier capped at Caution — cannot be lifted by any downgrade, example-payload opt-in, or allowlist.
Permissions & capabilities
No declared permissions — minimal attack surface.
Is this flag fair?
Thanks — recorded.