/plushcap/analysis/mux/mux-how-to-leverage-ai-models-without-breaking-the-bank

AI models on CPUs: accurate audio transcriptions without breaking the bank

What's this blog post about?

The text discusses how Mux was able to offer auto-generated captions on all videos uploaded for free by leveraging AI models like Whisper and OpenCLIP, which were previously considered expensive. They discovered GGML, an open source tensor library that optimizes CPU utilization, making it cost-effective for deployment. By using smaller model sizes, they achieved acceptable results while keeping costs low. Running inference on CPUs was found to be cheaper than using GPUs or third-party APIs. Mux's existing infrastructure for transcode jobs allowed them to efficiently schedule and run Whisper jobs alongside video transcodes, resulting in free auto captions for all their customers.

Company
Mux

Date published
Jan. 31, 2024

Author(s)
Colton Karoses

Word count
1215

Hacker News points
7

Language
English


By Matt Makai. 2021-2024.