Article
How real-time voice AI handles interruptions
Interruption handling is where voice systems either start to feel human or immediately feel fake.
Independent educational article.
Core idea
When a user interrupts a voice assistant, the system has to detect the interruption, decide whether it is meant for the model, stop its own speech, keep listening, and re-plan the response. Doing that gracefully is one of the hardest parts of real-time voice AI.
Step 1: detect the interruption
The system has to notice that incoming audio matters and is not just noise.
- It must separate target speech from background sound.
- It must recognize overlap fast enough to react.
- Delay makes the interruption feel ignored.
Step 2: decide whether to stop
Not all incoming sound should force a stop. The system has to judge whether the user is redirecting the interaction.
- A false stop makes the assistant feel fragile.
- A false continuation makes it feel rude.
- This is partly a timing problem and partly a relevance problem.
Step 3: adapt the response
Once interrupted, the assistant has to keep listening, reinterpret the user's intent, and continue in a way that feels coherent.
- It may need to discard its previous answer path.
- It may need to acknowledge the interruption naturally.
- The best systems make this feel seamless rather than brittle.
Related pages
Compare interruption stories across products
Use the compare pages to see which systems publicly foreground interruption handling and which focus more on overall assistant experience.