Skip to content

agent-evaluation

openclaw

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.

Install

openclaw install @rustyorb/agent-evaluation

About

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.