agent-evaluation | Agent Skills Directory

agent-evaluation

openclaw

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.

Install

openclaw install @rustyorb/agent-evaluation

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.