ClawAudit verdict

llm-tester

🟢 Trusted

Low risk — reviewed by ClawAudit, behavior matches stated purpose

The skill appears to be a legitimate tool for testing LLM models, with no clear evidence of malicious behavior.

⚠ Flagged for review — coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.

Automated static analysis — not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.

security

transparency

maintenance

Permission integrity

Installs packages at runtime — transitive dependencies are not auditable

package_install

Findings (1)

Pattern match low

Python os.environ.get — reads environment variable

scripts/llm_benchmark.py · prose · downgraded · os.environ.get(

Permissions & capabilities

No declared permissions — minimal attack surface.

package_install

Is this flag fair?

Check another skill Browse the registry Auditing your own skills or configs? Use the API