SPEAKER DIARIZATION

Video API with
Speaker Diarization

A transcript without speaker labels is a wall of text. You know what was said, but not who said it. V100's speaker diarization API identifies each speaker in a video session and tags every word in the transcript with the speaker who said it. The result is a color-coded, searchable, per-speaker transcript that powers meeting analytics, accountability tracking, and intelligent editing. Powered by Deepgram and Whisper, with word-level precision across up to 20 speakers.

Up to 20 speakers per session
Word-level speaker labels
Per-speaker analytics