ToolFuzz - Correctness
Failure Report{{ result.llm_output_expectation }}
Tool output, Agent output and Agent trace are truncated for brevity. The full contents are available at: Raw results
{{ test.tool_output | safe }}
{{ test.agent_output | safe }}
{{ test.trace | safe }}