ClawAudit verdict
Input Safety Guard
input-safety-guard
The skill appears to be a safety guard for agent input with no apparent malicious intent.
โ Flagged for review โ coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis โ not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (2)
Raw model control tokens โ prompt injection via token manipulation
src/input_safety_guard/pipeline.py ยท prose ยท downgraded ยท [INST]
Possible prompt injection โ attempts to redefine agent identity
config/default_rules.yaml ยท prose ยท downgraded ยท you are now
Permissions & capabilities
No declared permissions โ minimal attack surface.
Is this flag fair?
Thanks โ recorded.