REAL-TIME

Live Transcription API
with Real-Time Captions

A live transcription API delivers real-time speech-to-text conversion during active video meetings -- not after they end. V100's live transcription provides interim results within 300 milliseconds, final results with word-level timestamps, and speaker diarization across 40+ languages. Powered by Deepgram for low-latency streaming and Whisper for enhanced accuracy, V100's transcription runs during the meeting as participants speak, enabling live captions overlay, real-time AI analysis, accessibility compliance, and instant searchable meeting records. This is fundamentally different from post-processing transcription, which only generates text after a recording is complete.

Sub-300ms latency
40+ languages supported
Speaker diarization included