AI voice infrastructure

AI Voice Agents Need Real Phone Numbers: A DID Strategy Guide

Vapi, Retell AI, ElevenLabs Conversational AI, Bland, Synthflow, LiveKit Agents, Pipecat, Cartesia, Deepgram Voice Agent, Vocode — the AI voice platforms have one thing in common: they need real phone numbers, with real STIR/SHAKEN attestation, low-latency regional interconnects, and the ability to scale without per-DID lock-in. The DID layer is now the bottleneck for most AI voice deployments.

2026-04-26 · 7 min read

1. Why AI agents need real DIDs (not just SIP endpoints)

The first mistake teams make: they assume an AI agent only needs a SIP endpoint — some software phone the agent dials into, voice flows through, done. That's true for internal agent testing. The moment you put an AI agent in front of real customers, you need a real phone number.

Customers dial numbers, not URIs. “Call our support agent at sip:[email protected]” isn't going to happen. They dial +1 415 555 0123.
Number portability matters. If your AI vendor changes (Vapi to Retell, Retell to ElevenLabs), you don't want to issue your customers a new phone number.
Regulator and STIR/SHAKEN compliance attaches to the number, not the AI service. You need a DID with proper provenance.
Local presence matters for outbound. An AI agent calling US customers from a +1 415 (San Francisco) number gets answered. From a +44 or +972 number it goes to voicemail.

2. STIR/SHAKEN: the AI-specific spam problem

This is the issue most AI voice teams hit and don't understand. Outbound calls in the US are signed with a STIR/SHAKEN attestation (A, B, or C) by the originating carrier. US mobile carriers (T-Mobile, Verizon, AT&T) increasingly drop or label calls with weak attestation as “Spam Likely” or “Scam Likely.”

Attestation A (full): the carrier verified the caller has the right to use the Caller ID number. This is what you need for sustained delivery.
Attestation B (partial): the caller is known to the carrier but the Caller ID isn't fully verified. Often used in BYOC. Increasingly degraded by mobile carriers.
Attestation C (gateway): origin unverified. Mostly dropped or heavily flagged.

Why this hits AI agents specifically: AI voice platforms typically don't directly own the underlying carrier — they resell or interconnect through one. Many AI platform's bundled DIDs ship with B-attestation by default, which gets your AI agent's calls flagged as spam in production. The user thinks the AI is bad. It's actually fine; the call just isn't getting through.

The fix: use a carrier with verified A-attestation on outbound. DIDHub provisions DIDs with A-attestation and signs every outbound call accordingly — including calls placed via your AI platform's BYOC integration.

3. Latency: the AI-specific killer

Realtime AI voice (sub-200ms agent response) is brittle to network latency. An extra 100ms of SIP egress latency can break the natural-conversation feel that makes AI agents work. The two latency sources you control:

SIP signaling latency — the round-trip between your AI platform and the carrier's SIP infrastructure. Pick a carrier with a Point of Presence in the same region as your AI inference.
RTP media latency — the path the actual audio takes. Same regional rule: ingress / egress as close to the AI inference region as possible.

DIDHub operates SIP/RTP edges in Ashburn (US-East), San Jose (US-West), Dallas, Frankfurt (EU), Amsterdam, Singapore, Tokyo, and Dubai. Your AI platform connects to the closest one; voice traffic stays regional from end-user PSTN gateway to AI inference and back. Typical sub-50ms regional ingress.

4. Scaling: per-DID vs pooled allocations

AI voice apps tend to fall into two scaling patterns:

Per-customer DID

Each customer business gets their own dedicated DID. Their customers see “Acme Plumbing” on caller ID. The DID is a 1:1 with their account. Better trust, better deliverability, more management overhead.

Pooled outbound

A small set of DIDs handles outbound for many customer accounts at high volume. Lower cost per call but more spam-flagging risk — carrier analytics will flag a number that calls 5,000 different mobile numbers in 24 hours.

For most AI voice products, per-customer DID is the right pattern. DIDHub's per-DID pricing ($1.99/mo for US, $0.50/mo for some markets) keeps it economical at scale.

The bulk-allocation API and webhook events (call started / answered / ended) make programmatic provisioning trivial — one POST request per new customer onboarding gives you a DID, attaches it to your AI platform, and pipes the events to your webhook.

5. Three integration patterns

Pattern A: BYOC SIP trunk to your AI platform

You buy DIDs from DIDHub. Your AI platform (Vapi, Retell, ElevenLabs, Bland, Synthflow, LiveKit Agents, Pipecat, Vocode, Cartesia, Deepgram Voice Agent) supports “Bring Your Own Carrier.” You configure DIDHub's SIP credentials in their dashboard. Inbound calls to your DID hit DIDHub's edge, route to your AI platform's SIP endpoint over secure SIP/TLS + SRTP. Outbound the AI platform places goes back through DIDHub.

Pattern B: Webhook-driven, custom voice bot

You use DIDHub's REST API directly. Inbound call hits DIDHub, fires a webhook to your application. You answer with a SIP INVITE response back to your media server (LiveKit, Pipecat, custom). Useful when you've built your own voice agent stack on top of OpenAI Realtime API or Google Vertex Live.

Pattern C: SIP trunk + ATA bridge to legacy

You're integrating an AI agent into an existing PBX (3CX, FreePBX, Asterisk). The AI agent registers as a SIP extension. DIDHub's trunk feeds the PBX. The PBX routes specific call patterns to the AI extension.

6. How to wire DIDHub to your AI platform (concrete steps)

For Pattern A, the process is similar across most AI voice platforms:

Buy DIDs. POST to /api/dids or pick from the DIDHub dashboard. Specify country, area code, and SMS capability if needed.
Get DIDHub SIP credentials. A SIP URI (typically sip.didhub.io) plus authentication (digest or IP allowlist).
Configure BYOC in your AI platform's dashboard. Most platforms have a “Carrier Settings” or “BYO SIP Trunk” page. Paste the DIDHub URI and credentials.
Bind the DID to an AI agent. Tell the AI platform that calls to +1 415 555 0123 should route to your specific assistant configuration.
Test inbound. Dial the DID from a mobile. The AI agent should answer.
Test outbound (if applicable). Trigger the AI agent to make an outbound call. Verify Caller ID, STIR/SHAKEN attestation, and call quality.

For specific platform docs, see DIDHub integrations, or talk to [email protected] for an integration spec specific to your AI stack.

Why this matters

The AI voice space moves fast and the platforms compete on agent quality, latency, and prompt fidelity. The DID layer is becoming the operational quality differentiator: you can have a perfect AI agent and still lose if your DIDs are flagged as spam, your calls have 300ms of SIP latency, or your provisioning loop takes 4 days. Pick the DID layer like you pick the inference: with care, with regional presence, and with attestation that holds up in production.

Ready to get a number?

Pick a DID in 130+ countries from $1.99/month. Activates instantly on most numbers.

Browse numbers Talk to sales