ClawAudit verdict

improvement-evaluator

45
🟠 Risky
Significant concerns — only install if you understand the risks

The skill uses dynamic evaluation to measure the effectiveness of skill improvements, which may pose a risk if not used properly.

⚠ Flagged for review — coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.

Automated static analysis — not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.

45
security
80
transparency
70
maintenance

Findings (5)

Pattern match high

Dynamic __import__('os') — Python OS command execution

interfaces/judges.py · prose · downgraded · __import__("os")

Pattern match high

Accesses sensitive system files

tests/test_judges.py · prose · downgraded · /etc/passwd

Pattern match medium

subprocess execution — runs system commands from Python

interfaces/judges.py · prose · downgraded · subprocess.run(

Pattern match low

Popular HTTP library — network access

interfaces/judges.py · prose · downgraded · got

Pattern match low

Python shutil file operation — copies/moves/deletes files

interfaces/judges.py · prose · downgraded · shutil.rmtree(

Why the tier is capped

Execution sink present in raw bytes (Hard Floor: class D). Final tier capped at Caution — cannot be lifted by any downgrade, example-payload opt-in, or allowlist.

Permissions & capabilities

No declared permissions — minimal attack surface.

dynamic_eval

Is this flag fair?

Check another skill Browse the registry Auditing your own skills or configs? Use the API