
- Live Test — a one-off interactive conversation, useful for quick “does this work?” checks while editing.
- Test Platform — emulate a specific channel (Voice, SMS, Web, Email) to verify channel-specific behavior.
- Test Scenarios — recorded conversations with assertions, replayable across flow versions to catch regressions.
Live Test (chat sandbox)
The fastest way to talk to the flow you’re editing — built into the right side of the flow editor. When the panel first opens, you pick Chat Using Text or Voice Using Voice to start a conversation against the current draft.Pick a modality in the Live Test panel
On the right side, the Live Test panel asks “Chat Using Text” or “Voice Using Voice”. Chat is text-only and faster to iterate. Voice exercises the speech pipeline end-to-end.
Send a message
Type as if you were the customer. The agent responds using the current draft of the flow.

Test Platform
The chat sandbox runs in a neutral text mode. Real conversations happen on specific channels — voice has different latencies and phonetic considerations than SMS; email is async and quote-aware. Test Platform runs the same draft flow but emulates a specific channel.Open Test Platform
From the Flows page, click the three-dot menu on the flow card and select Test Platform.
Pick a channel
Voice, SMS, Web, or Email. Each applies the platform-specific behavior the runtime would apply on a live conversation.
Channel-specific things to watch
| Channel | Watch for |
|---|---|
| Voice | The agent spells back names digit-by-digit / letter-by-letter when needed; numbers are read naturally (twenty-two, not 2-2). |
| SMS | Messages are short. Long structured responses get awkward on a phone screen. |
| Web | Formatting renders (Markdown, lists, links). |
| Quotes and threading work; the agent’s tone reads as written, not spoken. |
B r i a n — and the runtime will treat it as a spelled name.
Test Scenarios
The chat sandbox and Test Platform are interactive — good for exploring, bad for catching regressions. Test Scenarios are recorded conversations with assertions: you walk through a flow once, save the scenario, and the platform can replay it against any future flow version to confirm behavior is stable.Creating a scenario
Record a conversation
Click New scenario, walk through a representative conversation in the sandbox. Each turn is captured along with the agent’s response, function calls, and state transitions.
Add assertions
On each turn, mark which assertions matter: a specific state was reached, a specific function was called, a specific event was emitted, a specific tag was set, the agent’s reply contained certain phrasing.
Running scenarios
The Test Scenarios panel lists every saved scenario. Click Run on one to replay it; click Run all to replay the entire category. Each run shows pass/fail per assertion alongside the full replay transcript.Versioned results
Test results are tracked per flow version. When you publish a new version of a flow, run all scenarios — the platform records the result against that version. The history view shows which scenarios pass against which versions, making it easy to see when a regression was introduced and against which change. This is the safety net for editing a flow that’s already live. As long as your scenarios cover the conversations you care about, you can edit confidently — failing assertions will catch breakage before it reaches customers.What to test before publishing
A minimal test plan for any flow heading to production:- Happy path for every primary intent your flow handles.
- Disambiguation — what happens when a function returns multiple matches or no matches.
- Out-of-scope requests — the customer asks for something this flow can’t do; does the agent transfer or escalate gracefully?
- Emergencies if your domain has them — fire, gas leak, medical, threats of harm. The agent must route correctly.
- Channel-specific quirks — at minimum a smoke test on each channel the flow will run on.
- Language fallback if the flow is single-language but customers may try another.
Related
Building your first flow
Where testing fits in the build process.
Designing thoughts
Most flow bugs are thought bugs — fix them here.
Flow configuration
Configure call review criteria that complement scenario testing.

