Native tracing, evals, and audit trail
Every span, every model call, every tool invocation — logged with user attribution, cost, latency, and policy enforcement. Score outputs against datasets, run evals on a schedule, and export GDPR-aligned audit logs without standing up a separate Langfuse or Helicone deployment.
- Span-level tracing across agent runs
- Auto-scoring and custom evals
- Searchable, exportable audit logs

classifyretrievegeneratevalidate


