Test Cases

A Test Case defines a test scenario for your agent with input, expected behavior, and optional context.

Quick Start

test_case = client.test_cases.create(
    name="Refund Request - Valid Order",
    agent_id=agent.id,
    turns=[{
        "input": {"text": "I want a refund for order #1234"},
        "expected_behavior": "Agent should verify order and explain refund process",
        "context": ["Order #1234: $99.99, delivered Dec 1, 2024, Status: Delivered"],
        "ground_truth": "Order #1234 is eligible for refund within 30-day window"
    }],
    tags=["refunds", "customer-support", "happy-path"]
)

Test Types

Single-Turn

One question, one response:

test_case = client.test_cases.create(
    name="Simple Question",
    agent_id=agent.id,
    turns=[{
        "input": {"text": "What are your store hours?"},
        "expected_behavior": "Provide store hours"
    }]
)

Multi-Turn

Full conversation with multiple exchanges:

test_case = client.test_cases.create(
    name="Refund Flow",
    agent_id=agent.id,
    turns=[
        {"input": {"text": "I want a refund"}, "expected_behavior": "Ask for order number"},
        {"input": {"text": "Order #1234"}, "expected_behavior": "Verify and process"}
    ]
)

Additional Options

Context for RAG

Add retrieved documents for RAG evaluation:

turns=[{
    "input": {"text": "Can I return opened items?"},
    "context": ["Policy: Unopened items returnable within 30 days..."]
}]

Ground Truth

Specify factual answers for accuracy checks:

turns=[{
    "input": {"text": "What is the capital of France?"},
    "ground_truth": "Paris"
}]

Auto-Generate

Generate tests from documents:

generated = client.test_cases.generate(
    agent_id=agent.id,
    documents=[{"content": "Policy doc...", "name": "policy.md"}],
    generation_config={"num_test_cases": 10}
)

Getting Started

Core Concepts

Test Cases

Test Cases

Quick Start

Test Types

Single-Turn

Multi-Turn

Additional Options

Context for RAG

Ground Truth

Tags

Auto-Generate

Next Steps

Test Runs

API Reference

Getting Started

Core Concepts

Documentation Index

​Test Cases

​Quick Start

​Test Types

​Single-Turn

​Multi-Turn

​Additional Options

​Context for RAG

​Ground Truth

​Tags

​Auto-Generate

​Next Steps

Test Runs

API Reference

Test Cases

Quick Start

Test Types

Single-Turn

Multi-Turn

Additional Options

Context for RAG

Ground Truth

Tags

Auto-Generate

Next Steps