Newsletter #30: 🚀Universal-1 Model Launch
This week, AssemblyAI launched Universal-1, their most powerful and accurate Speech-to-Text model to date. Trained on 12.5 million hours of multilingual audio data, it offers improved speaker count estimation, word timestamp estimation, fewer hallucinations, and more accuracy compared to competitors' speech-to-text APIs. The new model can transcribe multiple languages within a single audio file and processes an hour of audio in just 38 seconds. Additionally, AssemblyAI released tutorials on using Universal-1 with Go and Python for transcription tasks and building AI voice bots from scratch.
Company
AssemblyAI
Date published
April 12, 2024
Author(s)
Smitha Kolan
Word count
313
Language
English
Hacker News points
None found.