Kaldi Speech Recognition for Beginners - A Simple Tutorial
In this tutorial, we learn how to use the open-source speech recognition toolkit Kaldi in conjunction with Python to automatically transcribe audio files. The process involves several steps including installing Kaldi and its dependencies, creating necessary input files for Kaldi, modifying MFCC configuration file, feature extraction, pre-trained model download and extraction, decoding graph construction, transcription retrieval, and rescoring with LSTM-based model. The tutorial also provides information on how to use the AssemblyAI Speech-to-Text API for easy transcription if Kaldi seems too complex or time-consuming.
Company
AssemblyAI
Date published
Jan. 20, 2022
Author(s)
Ryan O'Connor
Word count
4046
Hacker News points
6
Language
English