Agentic CLEAR: Automated Multi-Level Evaluation Framework for LLM Agents

ai-technology · 2026-05-23

Agentic CLEAR is a user-friendly, automatic evaluation framework designed for agentic systems, tackling the complexities of monitoring autonomous agent actions. It generates textual insights across three levels of granularity: system, trace, and node. Functioning above the observability layer, it offers seamless integration with an easy-to-navigate interface. Extensive experiments conducted on four benchmarks and seven agentic environments, alongside tens of thousands of LLM calls, showcase its ability to provide high-quality, data-driven feedback.

Key facts

Agentic CLEAR is an automatic evaluation framework for LLM agents.
It provides insights at system, trace, and node levels.
It operates above the observability layer.
Features an intuitive UI for accessibility.
Tested on four benchmarks and seven agentic settings.
Involved tens of thousands of LLM calls.
Produces high-quality, data-driven feedback.
Addresses limitations of current tools that use static error taxonomies.

Entities

—

Sources

arXiv cs.AI — 2026-05-23