Protocol ready
The test scenario exists and can be run during a demo or pilot.
Benchmark results
Track which vendors have public evidence, which tests are ready to run, and where buyers should ask for proof before trusting a polished demo.
Current matrix
Use this as a buyer checklist and vendor submission target. It is intentionally conservative until repeatable benchmark packets are reviewed.
| Vendor | Latency | Handoff | Booking | Escalation | Noisy caller | Evidence state |
|---|---|---|---|---|---|---|
| Retell AI Developer voice agent platform Benchmark evidence page | Public claims to verify | Test pending | Scenario ready | Needs evidence packet | Test pending | Profiled |
| Vapi Developer voice agent API Benchmark evidence page | Public claims to verify | Implementation dependent | Scenario ready | Needs evidence packet | Test pending | Profiled |
| Telnyx Voice infrastructure | Infrastructure evidence | Implementation dependent | Implementation dependent | Needs workflow proof | Test pending | Architecture mapped |
| Bland AI Enterprise voice AI Benchmark evidence page | Public evidence check | Test pending | Scenario ready | Needs evidence packet | Test pending | Profiled |
| Synthflow No-code enterprise voice AI Benchmark evidence page | Public evidence check | Test pending | Scenario ready | Needs evidence packet | Test pending | Profiled |
| Goodcall AI receptionist Benchmark evidence page | Test pending | Needs evidence packet | Scenario ready | Needs policy proof | Test pending | Profiled |
| Smith.ai Hybrid receptionist | Service dependent | Hybrid model noted | Scenario ready | Needs packet | Service dependent | Profiled |
| Slang AI Restaurant voice AI | Test pending | Needs evidence packet | Restaurant scenario ready | Needs escalation proof | Restaurant audio test pending | Profiled |
Status definitions
The matrix separates public claims, test readiness, missing evidence, and implementation-dependent workflows.
The test scenario exists and can be run during a demo or pilot.
Public claims or docs are available, but standardized evidence is not complete yet.
Voice Agent Index has not published a repeatable benchmark result for that vendor and scenario.
The vendor should provide recordings, transcripts, logs, routing proof, or workflow artifacts.
The result depends heavily on how the buyer or implementation partner configures the workflow.
Vendor evidence
Vendors can submit benchmark evidence packets for the same latency, handoff, booking, escalation, and noisy-caller protocols buyers use during demos.
No. This matrix tracks evidence status and test readiness. Numeric vendor scores should only be added when the same benchmark scenario and evidence packet are applied across vendors.
A vendor can submit demo access, call recordings, transcripts, tool logs, transfer evidence, pricing details, and policy documentation for editorial review.
Buyers need to know which evidence is public, missing, implementation-dependent, or ready to test. The matrix makes gaps visible instead of pretending every profile is equally proven.