ClawAudit verdict
llm-tester
88
🟢 Trusted
Low risk — reviewed by ClawAudit, behavior matches stated purpose
The skill appears to be a legitimate tool for testing LLM models, with no clear evidence of malicious behavior.
⚠ Flagged for review — coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.
Automated static analysis — not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.
93
security
70
transparency
70
maintenance
Permission integrity
Installs packages at runtime — transitive dependencies are not auditable
package_install
Findings (1)
Pattern match low
Python os.environ.get — reads environment variable
scripts/llm_benchmark.py · prose · downgraded · os.environ.get(
Permissions & capabilities
No declared permissions — minimal attack surface.
package_install Is this flag fair?
Thanks — recorded.