ClawAudit verdict

Input Safety Guard

input-safety-guard

88
๐ŸŸข Trusted
Low risk โ€” reviewed by ClawAudit, behavior matches stated purpose

The skill appears to be a safety guard for agent input with no apparent malicious intent.

โš  Flagged for review โ€” coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.

Automated static analysis โ€” not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.

70
security
90
transparency
70
maintenance

Findings (2)

Pattern match high

Raw model control tokens โ€” prompt injection via token manipulation

src/input_safety_guard/pipeline.py ยท prose ยท downgraded ยท [INST]

Pattern match medium

Possible prompt injection โ€” attempts to redefine agent identity

config/default_rules.yaml ยท prose ยท downgraded ยท you are now

Permissions & capabilities

No declared permissions โ€” minimal attack surface.

Is this flag fair?

Check another skill Browse the registry Auditing your own skills or configs? Use the API