All About Transcription for Real-Time (Live) Audio Streaming

Company

Deepgram

Date Published

Aug. 22, 2022

Author

Keith Lam

Word count

1171

Language

English

Hacker News points

None

URL

deepgram.com/learn/all-about-transcription-for-real-time-audio-streaming

Summary

Real-time streaming transcription involves transcribing live audio into text, with applications such as live captioning for the hearing impaired or enabling machines to understand human speech. The process is similar to pre-recorded transcription but requires different input and output configurations. Key metrics for evaluating real-time streaming transcriptions include Word Error Rate (WER), Word Recall Rate/Word Recognition Rate (WRR), and ASR latency. Deepgram offers several benefits over other ASR providers, including high accuracy, low ASR latency, support for all languages and use-case models, and the choice of cloud or on-premise deployment.