Skip to content
agent-evaluation
openclaw

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.

Install

openclaw install @rustyorb/agent-evaluation

About

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.