List scoring jobs
List confidence scoring jobs, optionally filtered by identity context fields. All filters are combined with AND logic.
Authorizations
Query Parameters
Filter by agent name.
Filter by agent version.
Filter by agent session ID.
Filter by workflow name.
Filter by workflow version.
Filter by workflow session ID.
Filter by tool identifier.
Filter by tool instance identifier.
Filter by tool execution identifier.
Response
Successful Response
Unique identifier for this scoring job.
The agent, workflow, and tool context that produced the scored output.
The scoring module used to evaluate the output. Judge-based modules (e.g. correctness, coherence, faithfulness) produce scores on a 0–10 scale. Statistical modules (e.g. observed_consistency, data_grounding) produce scores on a 0.0–1.0 scale.
Current status of the scoring job: submitted, in_progress, completed, failed, or not_run.
The computed confidence score, or null if the job has not completed. Range depends on the module: 0–10 (integer) for judge-based modules, 0.0–1.0 for statistical modules.
Human-readable explanation of the score.