I run a small Ukrainian-granite distribution business called OD Granite in the US. 90 days ago I replaced my planned SDR hires with a voice agent. This is the uncensored operator log. What worked. What broke. What I changed. What I still cannot automate.
Written for the person deciding whether to pay a platform $200-$500 a month or hire a $60K SDR. I made the same decision 90 days ago. Here is what the data says on the other side.
The setup
Platform: Retell for call orchestration. Vapi evaluated, lost on cost + voice quality at the time (Jan 2026).
Voice synthesis: ElevenLabs for Ukrainian and Russian. Retell native for English.
Reasoning: Claude Sonnet via API for conversation logic. Real-time inference, sub-second latency.
Target list: 2,400 US commercial contractors and property managers scraped from public permit data, enriched with firmographics.
Total monthly cost: ~$250/month all in (platform, voice, reasoning, and calling minutes).
What I was going to spend on 2 SDRs: $120K/yr base + ~$30K overhead = $150K fully loaded.
The actual 90-day numbers
Metric
Day 0-30
Day 31-60
Day 61-90
Dials per day
80-120
140-180
160-200
Connect rate
32%
38%
41%
Conversations over 30 seconds
14%
19%
22%
Meetings booked / week
3-5
5-8
6-10
Meetings that actually happened
2-4
4-6
5-8
Qualified pipeline created
$8K
$24K
$42K
Breaks requiring fix
7
5
3
The learning curve is real. First 30 days the agent was mediocre. Day 60 onwards it's consistent enough that I can plan around it.
What worked from day one
Language switching
The biggest unexpected win. Ukrainian granite buyers in the US are often Ukrainian or Russian-speaking contractors themselves. When the agent opens in English, detects a Ukrainian accent, and switches mid-sentence, the conversation depth triples. Close rate on Ukrainian-language calls is about 4x English-language calls.
Budget qualification
Second unexpected win. The agent asks "what's your typical material budget on a project like this" early. Most SDR scripts avoid this because it feels rude. The agent doesn't care. Rude is fine at scale. We disqualify roughly 40% of connects in the first minute on budget mismatch, which saves everybody time.
Voicemail strategy
Agent leaves voicemails on unanswered calls. Short, specific, name-dropped a likely recent permit. Voicemail-triggered callbacks account for about 12% of our booked meetings. Nobody tells you about this in voice agent marketing.
What broke
Breaks requiring fixes, by type (90 days total)
Model update shifted voice persona: 5 times. Sonnet minor version rolls, conversation style drifts, prospects notice. Fix: re-tune opening prompt, takes 20 min.
Platform API token revoked: 3 times. Retell rotates creds on schedule, my cron doesn't catch immediately. Fix: re-auth script, takes 5 min, but calls lose ~2 hours before I notice.
ElevenLabs voice model deprecated: 1 time. The Ukrainian voice I was using got replaced with a newer one. Fix: swap voice ID, re-record IVR intro, 30 min.
Phone number flagged as spam likely: 2 times. Carrier-level spam score changed. Fix: rotate number, takes 10 min via Twilio.
Cron missed because VPS rebooted: 4 times. My $8/month Hetzner box gets security patches, reboots, cron jobs don't auto-restart. Fix: systemd + monitoring, one-time 2-hour setup.
Total time fixing breaks across 90 days: about 14 hours. That's ~9 min per day average. For context, one US-based SDR would cost $60K/yr plus the time I'd spend managing them, coaching them, replacing them when they quit.
What I cannot automate (and do not want to)
Real closes. The agent books meetings. I show up to meetings. Voice agents close 5-10% of demos. Humans close 30-40%. Not a model problem, a trust problem. Humans buy from humans.
Technical product questions. Contractors ask detailed stone-spec questions. The agent doesn't know our inventory deep enough yet. When asked, it hands off to me via SMS. I respond in 2-3 min. Working great.
Referral pitches. When a contractor says "I know another contractor who'd be interested," the agent should ask for the intro. It doesn't. I rewrite this prompt monthly and it still isn't natural. I suspect this requires human flex.
Complaint handling. One contractor called back upset about a misrouted call. The agent tried to apologize and escalate. It was fine, but a human would've been better. Routing all returned / angry calls to me manually now.
The platform comparison (Retell vs Vapi vs Bland)
I tested all three in evaluation. Retell won for my use case in Jan 2026. Your mileage may vary.
Retell: Best conversation latency. Easy Claude integration. Cost at my volume: ~$120/mo.
Vapi: Better phone number handling. Slightly more complex for cross-language. Cost similar.
Bland: Fastest setup, best docs. Cost was higher at my call volume.
All three will work for most B2B use cases. Pick based on your existing stack integrations.
Setup time, honestly
The "one weekend" setup timeline is a lie. Real timeline:
Day 1-2: Platform signup, phone number provisioning, basic English conversation flow. ~8 hours.
Day 3-4: Ukrainian / Russian voice synthesis, accent detection, language switching. ~6 hours.
Day 5-7: Prompt engineering for the actual sales conversation. This is where 70% of the real work is. ~20 hours.
Day 8-14: Watching the first 100 calls, rewriting prompts, fixing edge cases. ~15 hours.