ClawAudit verdict
improvement-evaluator
The skill uses dynamic evaluation to measure the effectiveness of skill improvements, which may pose a risk if not used properly.
⚠ Flagged for review — coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis — not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (5)
Dynamic __import__('os') — Python OS command execution
interfaces/judges.py · prose · downgraded · __import__("os")
Accesses sensitive system files
tests/test_judges.py · prose · downgraded · /etc/passwd
subprocess execution — runs system commands from Python
interfaces/judges.py · prose · downgraded · subprocess.run(
Popular HTTP library — network access
interfaces/judges.py · prose · downgraded · got
Python shutil file operation — copies/moves/deletes files
interfaces/judges.py · prose · downgraded · shutil.rmtree(
Why the tier is capped
Execution sink present in raw bytes (Hard Floor: class D). Final tier capped at Caution — cannot be lifted by any downgrade, example-payload opt-in, or allowlist.
Permissions & capabilities
No declared permissions — minimal attack surface.
dynamic_eval Is this flag fair?
Thanks — recorded.