Skip to main content
POST
/
v1
/
experiments
curl -X POST https://api.playgent.com/v1/experiments \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "System Prompt v2 Test",
    "agent_id": "agent_abc123",
    "variants": [
      {
        "name": "control",
        "config": { "system_prompt": "You are a helpful agent..." }
      },
      {
        "name": "treatment",
        "config": { "system_prompt": "You are a customer support agent for Acme Corp..." }
      }
    ],
    "test_case_ids": ["tc_1", "tc_2", "tc_3"],
    "metrics": ["relevance", "faithfulness", "latency_ms"],
    "config": {
      "runs_per_variant": 10,
      "randomize": true
    }
  }'
{
  "experiment_id": "exp_yza567",
  "status": "completed",
  "results": {
    "control": {
      "runs": 10,
      "pass_rate": 0.70,
      "avg_scores": { "relevance": 0.85, "faithfulness": 0.82 },
      "avg_latency_ms": 2340
    },
    "treatment": {
      "runs": 10,
      "pass_rate": 0.90,
      "avg_scores": { "relevance": 0.91, "faithfulness": 0.94 },
      "avg_latency_ms": 2180
    }
  },
  "statistical_analysis": {
    "winner": "treatment",
    "confidence": 0.95,
    "p_value": 0.023,
    "effect_size": {
      "pass_rate": "+28.6%",
      "faithfulness": "+14.6%"
    }
  },
  "recommendation": "Treatment variant shows statistically significant improvement. Recommend deploying."
}

Documentation Index

Fetch the complete documentation index at: https://playgent.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Create an A/B experiment to compare different agent configurations with statistical analysis and recommendations.
name
string
required
Experiment name
agent_id
string
required
Base agent for the experiment
variants
array
required
Experiment variants (minimum 2)
test_case_ids
array
required
Test cases to run in the experiment
metrics
array
required
Metrics to measure (e.g., “relevance”, “faithfulness”, “latency_ms”)
config
object
Experiment configuration
experiment_id
string
required
Experiment identifier
status
string
required
Status: pending, running, completed
results
object
Results per variant (when completed)
statistical_analysis
object
Statistical analysis (when completed)
recommendation
string
Human-readable recommendation
curl -X POST https://api.playgent.com/v1/experiments \
  -H "Authorization: Bearer your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "System Prompt v2 Test",
    "agent_id": "agent_abc123",
    "variants": [
      {
        "name": "control",
        "config": { "system_prompt": "You are a helpful agent..." }
      },
      {
        "name": "treatment",
        "config": { "system_prompt": "You are a customer support agent for Acme Corp..." }
      }
    ],
    "test_case_ids": ["tc_1", "tc_2", "tc_3"],
    "metrics": ["relevance", "faithfulness", "latency_ms"],
    "config": {
      "runs_per_variant": 10,
      "randomize": true
    }
  }'
{
  "experiment_id": "exp_yza567",
  "status": "completed",
  "results": {
    "control": {
      "runs": 10,
      "pass_rate": 0.70,
      "avg_scores": { "relevance": 0.85, "faithfulness": 0.82 },
      "avg_latency_ms": 2340
    },
    "treatment": {
      "runs": 10,
      "pass_rate": 0.90,
      "avg_scores": { "relevance": 0.91, "faithfulness": 0.94 },
      "avg_latency_ms": 2180
    }
  },
  "statistical_analysis": {
    "winner": "treatment",
    "confidence": 0.95,
    "p_value": 0.023,
    "effect_size": {
      "pass_rate": "+28.6%",
      "faithfulness": "+14.6%"
    }
  },
  "recommendation": "Treatment variant shows statistically significant improvement. Recommend deploying."
}