What You'll Build

An AI copilot for live sales calls. While you're on the phone with a prospect, the AI listens to both sides of the conversation, analyzes the tone and content in real time, and displays suggestions on your screen: what to say next, how to handle objections, when to close, and when to shut up and listen.

Why This Works

Cold calling is hard. Even experienced salespeople freeze when a prospect throws an unexpected objection. "We already have a vendor for that." "Send me an email." "We don't have budget." In those moments, having the perfect response ready isn't about being manipulative. It's about being prepared.

Sales coaches charge $200-500/hour to listen to your calls and give feedback. That feedback usually comes after the call, when it's too late. This system gives you coaching while the call is still happening.

Prerequisites

Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  Live Call   │────▢│  Real-Time   │────▢│  AI Coach    β”‚
β”‚  Audio       β”‚     β”‚  Transcriber β”‚     β”‚              β”‚
β”‚              β”‚     β”‚              β”‚     β”‚  - Analyze   β”‚
β”‚  Both sides  β”‚     β”‚  - Deepgram  β”‚     β”‚    sentiment β”‚
β”‚  of convo    β”‚     β”‚  - <500ms    β”‚     β”‚  - Detect    β”‚
β”‚              β”‚     β”‚    latency   β”‚     β”‚    objection β”‚
β”‚              β”‚     β”‚              β”‚     β”‚  - Suggest   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”˜
                                                  β”‚
                                         β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”€β”€β”
                                         β”‚  Display       β”‚
                                         β”‚                β”‚
                                         β”‚  Live panel:   β”‚
                                         β”‚  "They said    β”‚
                                         β”‚  budget is     β”‚
                                         β”‚  tight. Try:   β”‚
                                         β”‚  [suggestion]" β”‚
                                         β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Step 1: Capture Call Audio

You need both sides of the conversation in real time. Options:

Option A: Softphone + audio routing If you're using a VoIP system (Aircall, RingCentral, Dialpad), route the audio output to a local capture tool. On Mac, use BlackHole or Loopback to create a virtual audio device that captures system audio.

Option B: Speakerphone + mic The simplest approach: put the call on speaker and use your computer's microphone to capture both sides. Not ideal audio quality, but it works and takes 5 minutes to set up.

Option C: Phone integration Some phone systems offer real-time audio streams via API. Twilio's Media Streams, for example, can pipe live call audio to a WebSocket endpoint.

Step 2: Real-Time Transcription

Speed matters here. A suggestion that arrives 10 seconds late is useless. You need transcription with sub-second latency:

Deepgram is the go-to for this. Their streaming API transcribes audio in real time with ~300ms latency. It also handles speaker diarization (knowing who said what) which is critical for understanding the conversation flow.

Set up a WebSocket connection that sends audio chunks and receives transcript fragments as they come in.

Step 3: Build the Coaching Engine

This is where OpenClaw shines. Feed the rolling transcript to your agent with instructions like:

## Sales Coach Rules
You are a real-time sales coach. You see a live transcript of a cold call.

Your job:
- Detect objections and suggest responses immediately
- Notice buying signals ("that's interesting", "how does pricing work?")
- Flag when the salesperson is talking too much (suggest shutting up)
- Recommend when to go for the close
- Provide specific phrases, not vague advice

Common objections and responses:
- "Send me an email" β†’ "Totally, I'll send something over. Quick question before I do..."
- "We already have a vendor" β†’ "Makes sense. Most of our clients did too. Curious, what would need to change for you to consider an alternative?"
- "No budget" β†’ "I hear you. If budget wasn't a factor, is this something you'd want to explore?"
- "Not interested" β†’ "Fair enough. Just so I know, is it the timing or the concept that doesn't fit?"

Keep suggestions SHORT. The salesperson is on a live call. 1-2 sentences max.

Step 4: The Display

The suggestions need to appear somewhere you can see during the call without it being awkward. Options:

The display should show:

Step 5: Post-Call Analysis

After the call ends, the agent generates a summary:

Over time, this builds a dataset of your calls. You can see patterns: which objection responses actually work, what your average talk-time ratio is on successful calls, and where you consistently lose prospects.

Privacy and Ethics

This is important: always comply with call recording laws.

The AI isn't recording the call for someone else. It's helping you be better at your job in real time. But make sure you're on the right side of local laws.

Real Results

The builder who created this (Jonah) uses it for his agency's cold calling. The real-time suggestions help with:

The biggest win? Confidence. When you know you have backup, you make calls you'd otherwise skip.