The hierarchy
Concepts
Trace
One end-to-end unit of work — typically one user request or one agent turn.
Span
A step within a trace — an LLM call, retrieval, tool call, or custom code block.
Session
A series of traces that belong to the same user conversation or workflow.
Eval
An automated score for a trace — correctness, hallucination, safety, etc.
Feedback
A human label attached to a trace — thumbs up/down, score, comment.
Metric
An aggregated view across traces — error rate, p95 latency, cost.
How it all fits together
You instrument your app with an SDK. Every LLM call, retrieval, or tool call becomes a span inside a trace. Traces are batched and shipped to the TruLayer ingest API. In the background:- Evals run against traces and produce scores (stored as eval results).
- Feedback is attached to traces directly from your app or through the dashboard.
- Metrics are computed on a rolling window and surfaced in the dashboard and via the API.
- Failures are clustered and routed to alerting channels you configure.