Speech-to-Text Accuracy on Podcasts, News Broadcasts, and Social Media
This report evaluates the accuracy of Speech Recognition technology from AssemblyAI, AWS Transcribe, and Google Speech-to-Text on 12 audio/video files from various sources. The Word Error Rate (WER) is used to measure the accuracy of each transcription API. Additionally, the results of AssemblyAI's unique Content Safety, Topic Recognition, and Keyword Detection features are reviewed. This report aims to serve as a point of reference for comparing the best Automated Speech Recognition solutions in the market.
Company
AssemblyAI
Date published
June 15, 2021
Author(s)
Joe Zaghloul
Word count
906
Language
English
Hacker News points
None found.