ClawAudit verdict
prompt-defense
email-prompt-injection-defense
Defense skill that defines detection patterns for prompt injection in emails and enforces a confirmation protocol before acting on any email-sourced instructions; it is a security control and contains no malicious behavior.
โ Flagged for review โ coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis โ not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
Findings (3)
Prompt injection โ tries to override agent instructions
references/patterns.md ยท code ยท Ignore all previous instructions
Unicode homoglyph detected โ uses lookalike characters to evade pattern matching
references/patterns.md ยท code
Redefines agent role โ prompt injection technique
references/patterns.md ยท code ยท From now on, you are
Permissions & capabilities
No declared permissions โ minimal attack surface.
Is this flag fair?
Thanks โ recorded.