New Punctuation and Casing Model Released
AssemblyAI has significantly improved its speech-to-text features, including punctuation and casing restoration. The company's new model is a multi-class classifier that predicts actions such as adding punctuation or changing casing for each word in the transcription. This transformer-based model architecture yields an accuracy of over 92% for punctuation and casing restoration, trained on over 1 billion tokens. The model performs exceptionally well even with industry-specific language. Punctuation and casing are applied by default to all API requests, making it easy for users to utilize this new feature.
Company
AssemblyAI
Date published
March 2, 2021
Author(s)
Andrew Galyan-Mann
Word count
709
Language
English
Hacker News points
None found.