ClawAudit verdict
skill-forge
auto-skill-forge
Generates test suites (task_suite.yaml) and skill definitions (SKILL.md) from existing skills or spec files, operating entirely on local files; a development tooling skill with no network access or data exfiltration.
โ Flagged for review โ coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis โ not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (2)
Writes to SKILL.md โ self-modifying skill
tests/test_forge.py ยท prose ยท downgraded ยท SKILL.md").write
subprocess execution โ runs system commands from Python
scripts/forge.py ยท prose ยท downgraded ยท subprocess.run(
Why the tier is capped
Execution sink present in raw bytes (Hard Floor: class D). Final tier capped at Caution โ cannot be lifted by any downgrade, example-payload opt-in, or allowlist.
Permissions & capabilities
No declared permissions โ minimal attack surface.
Is this flag fair?
Thanks โ recorded.