ClawAudit verdict
model-tester
The skill appears to be a testing tool for validating model routing, performance, and output quality, with no suspicious behavior.
โ Flagged for review โ coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis โ not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (2)
Popular HTTP library โ network access
SKILL.md ยท frontmatter ยท got
subprocess execution โ runs system commands from Python
scripts/model_tester.py ยท prose ยท downgraded ยท subprocess.run(
Why the tier is capped
Execution sink present in raw bytes (Hard Floor: class D). Final tier capped at Caution โ cannot be lifted by any downgrade, example-payload opt-in, or allowlist.
Permissions & capabilities
No declared permissions โ minimal attack surface.
Is this flag fair?
Thanks โ recorded.