Evaluate multiple outputs efficiently with 27 built-in metrics
Run evaluation on multiple outputs in a single request. Ideal for batch processing, dataset evaluation, and regression testing. Uses the same 27 built-in metrics as the single evaluation endpoint.Documentation Index
Fetch the complete documentation index at: https://playgent.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
playval RAG: answer_relevancy, faithfulness, contextual_precision,
contextual_recall, contextual_relevancy Safety: bias, toxicity,
non_advice, misuse, pii_leakage, role_violation Agentic:
task_completion, tool_correctness, argument_correctness,
step_efficiency, plan_adherence, plan_quality Multi-Turn:
turn_relevancy, role_adherence, knowledge_retention,
conversation_completeness, goal_accuracy, tool_use, topic_adherence,
turn_faithfulness, turn_contextual_precision, turn_contextual_recall Or
use custom scorer IDs from Create Custom
Scorerrunning, completed, failed