Supervised Machine Learning For Beginners |
Patrick Loeber |
Jan. 24, 2022 |
41 |
- |
What is BERT and How Does It Work? |
Misra Turp |
Jan. 24, 2022 |
46 |
- |
DeltaHacks - AssemblyAI at McMaster University Hackathon |
Britney Xiu |
Jan. 24, 2022 |
608 |
- |
Kaldi Speech Recognition for Beginners - A Simple Tutorial |
Ryan O'Connor |
Jan. 20, 2022 |
4046 |
6 |
Recurrent Neural Networks (RNNs) Explained |
Patrick Loeber |
Jan. 18, 2022 |
69 |
- |
Bias and Variance for Machine Learning |
Misra Turp |
Jan. 10, 2022 |
110 |
- |
Best Speech-to-Text Software |
Patrick Loeber |
Jan. 06, 2022 |
83 |
- |
Built with AssemblyAI - Wordcab |
Kelsey Foster |
Jan. 05, 2022 |
314 |
- |
Backpropagation For Neural Networks Explained |
Patrick Loeber |
Jan. 04, 2022 |
67 |
- |
Variational Autoencoders for Dummies |
Ryan O'Connor |
Jan. 03, 2022 |
5608 |
7 |
How to Evaluate Machine Learning Models |
Misra Turp |
Jan. 25, 2022 |
55 |
- |
Unsupervised Machine Learning For Beginners |
Patrick Loeber |
Jan. 26, 2022 |
21 |
2 |
Review - data2vec: A General Framework for Self-supervised Learning in Speech, Vision, and Language |
Guru Rao |
Jan. 26, 2022 |
480 |
- |
Transcribe Twilio Phone Calls in Real-Time with AssemblyAI |
Ryan O'Connor |
Jan. 31, 2022 |
685 |
- |
What is Weight Initialization for Neural Networks? |
Misra Turp |
Jan. 31, 2022 |
85 |
- |
Machine Learning Concepts for Beginners |
Kelsey Foster |
Feb. 01, 2022 |
724 |
- |
Built with AssemblyAI - Rhetoric |
Kelsey Foster |
Feb. 02, 2022 |
204 |
- |
Reinforcement Learning With (Deep) Q-Learning Explained |
Patrick Loeber |
Feb. 02, 2022 |
29 |
- |
Review - Perceiver: General Perception with Iterative Attention |
Dillon Pulliam |
Feb. 04, 2022 |
461 |
- |
Boosting Transcript Readability with Automatic Punctuation and Casing and ITN |
Kelsey Foster |
Feb. 07, 2022 |
1471 |
- |
Top Speaker Diarization Libraries and APIs in 2022 |
Kelsey Foster |
Feb. 08, 2022 |
1893 |
- |
What is Layer Normalization? |
Misra Turp |
Feb. 08, 2022 |
66 |
- |
How to Build a Python Project that Summarizes Your Lectures |
Patrick Loeber |
Feb. 14, 2022 |
140 |
- |
Hyperparameters of Neural Networks |
Misra Turp |
Feb. 14, 2022 |
54 |
- |
Why You Should (or Shouldn't) Be Using JAX in 2022 |
Ryan O'Connor |
Feb. 15, 2022 |
4927 |
66 |
Best APIs for Sentiment Analysis in 2022 |
Kelsey Foster |
Feb. 17, 2022 |
1292 |
- |
What is Gradient Clipping for Neural Networks? |
Misra Turp |
Feb. 21, 2022 |
50 |
- |
Learn How To Get Started with OpenAI API and GPT-3 |
Patrick Loeber |
Feb. 24, 2022 |
85 |
- |
What is Audio Intelligence? |
Kelsey Foster |
Feb. 24, 2022 |
1480 |
- |
BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models |
Taufiquzzaman Peyash |
Feb. 25, 2022 |
311 |
- |
Differentiable Programming - A Simple Introduction |
Ryan O'Connor |
Mar. 02, 2022 |
2445 |
159 |
Announcing Our $28M Series A Led by Accel |
Dylan Fox |
Mar. 04, 2022 |
573 |
- |
How To Convert Voice To Text Using JavaScript |
Patrick Loeber |
Mar. 08, 2022 |
1023 |
- |
Auto-Tweet Your Words Using Speech Recognition in Python |
Mısra Turp |
Mar. 09, 2022 |
1004 |
- |
ASR, NLP, and NLU Tools for Smart Media Monitoring |
Kelsey Foster |
Mar. 11, 2022 |
1131 |
- |
Kaldi Install for Dummies |
Ryan O'Connor |
Mar. 11, 2022 |
1240 |
- |
Built with AssemblyAI - YouTube Transcripts |
Kelsey Foster |
Mar. 15, 2022 |
193 |
- |
Transcribe Audio Files in an S3 Bucket with AssemblyAI |
Ryan O'Connor |
Mar. 15, 2022 |
982 |
- |
Review - ALBERT: A Lite BERT for Self-supervised Learning of Language Representations |
Sergio Ramirez Martin |
Mar. 16, 2022 |
425 |
- |
React Speech Recognition with React Hooks |
Stefan Rosanitsch |
Mar. 16, 2022 |
3047 |
- |
Automate Meeting Notes with Python |
Mısra Turp |
Mar. 17, 2022 |
1317 |
- |
A Beginner's Guide to TorchStudio, The PyTorch IDE |
Ryan O'Connor |
Mar. 28, 2022 |
3387 |
16 |
AssemblyAI Recognized as G2 High Performer, Momentum Leader in Voice Recognition Software for Spring 2022 |
Kelsey Foster |
Mar. 29, 2022 |
230 |
- |
React Text to Speech - Simplified! |
Stefan Rosanitsch |
Mar. 30, 2022 |
867 |
- |
JavaScript Text-to-Speech - The Easy Way |
Stefan Rosanitsch |
Apr. 04, 2022 |
1209 |
1 |
MediaPipe for Dummies |
Ryan O'Connor |
Apr. 07, 2022 |
3559 |
11 |
Top ASR, NLP, and NLU Tools that Power Conversation Intelligence Platforms |
Kelsey Foster |
Apr. 08, 2022 |
1503 |
- |
How to Build a JavaScript Audio Transcript Application |
Stefan Rosanitsch |
Apr. 12, 2022 |
1428 |
- |
Built with AssemblyAI - Real-time Speech-to-Image Generation |
Kelsey Foster |
Apr. 18, 2022 |
481 |
- |
6 Best Named Entity Recognition APIs for Entity Detection |
Kelsey Foster |
Apr. 21, 2022 |
1027 |
- |
How to Optimize Video Editing Platforms with ASR, NLP, and NLU Tools |
Kelsey Foster |
May. 05, 2022 |
1389 |
- |
Building an Intelligent Cloud-based Contact Center? How ASR, NLP, and NLU Tools Can Help |
Kelsey Foster |
May. 11, 2022 |
1581 |
- |
Introduction to Diffusion Models for Machine Learning |
Ryan O'Connor |
May. 12, 2022 |
3048 |
98 |
Building Standout Hybrid Event Solutions with ASR, NLP, and NLU Technology |
Kelsey Foster |
Jun. 03, 2022 |
1754 |
- |
Getting Started with ESPnet |
Ryan O'Connor |
Jun. 06, 2022 |
1714 |
- |
How Leading Customer Research Platforms Leverage ASR, NLP, and NLU Tools |
Kelsey Foster |
Jun. 07, 2022 |
1835 |
- |
Review - Decision Transformer & SPIRAL |
Kevin Zhang, Francis McCann |
Jun. 08, 2022 |
406 |
- |
Hack with AssemblyAI: HawkHacks 2022 |
Kelsey Foster |
Jun. 15, 2022 |
380 |
- |
Review – TOXIGEN & Knowledge Distillation Meets Open-Set Semi-Supervised Learning |
Domenic Donato, Dillon Pulliam |
Jun. 16, 2022 |
437 |
- |
Deep Learning Paper Recap - Streaming ASR and Summarization |
Guru Rao, Sergio Ramirez Martin |
Jun. 22, 2022 |
387 |
- |
How Imagen Actually Works |
Ryan O'Connor |
Jun. 23, 2022 |
6060 |
142 |
Content Moderation: What It Is, How It Works, and the Best APIs |
Kelsey Foster |
Jun. 27, 2022 |
1212 |
- |
Topic Detection in NLP: The Top APIs in 2022 |
Kelsey Foster |
Jun. 28, 2022 |
1268 |
- |
Deep Learning Paper Recaps - Modality Matching and Masked Autoencoders |
Luka Chkhetiani, Ruben Bousbib |
Jul. 27, 2022 |
332 |
- |
Deep Learning Paper Recap - Language Models |
Taufiquzzaman Peyash |
Jul. 07, 2022 |
273 |
- |
Our $30M Series B |
Dylan Fox |
Jul. 14, 2022 |
654 |
- |
AssemblyAI Named a G2 High Performer and Momentum Leader for Summer 2022 |
Kelsey Foster |
Jul. 20, 2022 |
383 |
- |
Creating Top Hiring Intelligence Platforms with ASR, NLP, and NLU Tools |
Kelsey Foster |
Jul. 19, 2022 |
1871 |
- |
Deep Learning Paper Recap - Automatic Speech Recognition |
Gabriel Oexle, Yash Khare |
Aug. 03, 2022 |
396 |
- |
Deep Learning Paper Recap - Transfer Learning |
Michael Liang |
Aug. 10, 2022 |
273 |
- |
Build Your Own Imagen Text-to-Image Model |
Ryan O'Connor |
Aug. 17, 2022 |
6700 |
111 |
Deep Learning Paper Recap - Redundancy Reduction and Sparse MoEs |
Domenic Donato, Kevin Zhang |
Aug. 17, 2022 |
470 |
- |
How to Run Stable Diffusion Locally to Generate Images |
Ryan O'Connor |
Aug. 23, 2022 |
2020 |
8 |
Deep Learning Paper Recap - Diffusion and Transformer Models |
Dillon Pulliam, Sergio Ramirez Martin |
Aug. 24, 2022 |
373 |
- |
How to Automatically Transcribe Zoom Calls in Real-Time |
Ryan O'Connor |
Aug. 31, 2022 |
1151 |
- |
What are the Top PII Redaction APIs and AI Models for 2023? |
Kelsey Foster |
Aug. 30, 2022 |
1321 |
- |
Coming Soon in Fall 2022 at AssemblyAI! |
Kelsey Foster |
Sep. 01, 2022 |
676 |
- |
2022 Benchmark Report |
Lee Vaughn |
Sep. 02, 2022 |
1386 |
- |
AssemblyAI Obtains SOC 2 Type 2 Compliance for 2022/2023 |
Mike Groves |
Sep. 06, 2022 |
462 |
- |
New for Enterprise: Improved Accuracy, Always-on Support, and SOC 2 Type 2 |
Micky Teng |
Sep. 06, 2022 |
982 |
- |
AI Research Review - Spelling and ASR |
Taufiquzzaman Peyash |
Sep. 08, 2022 |
215 |
- |
AI Research Review - Multistream CNN |
Luka Chkhetiani |
Sep. 21, 2022 |
126 |
- |
Introducing the AssemblyAI Creators Program |
Patrick Loeber |
Sep. 14, 2022 |
431 |
- |
AssemblyAI Recognized as G2 High Performer, Momentum Leader for Fall 2022 |
Kelsey Foster |
Sep. 28, 2022 |
251 |
- |
How Aloware Shipped AI-powered Smart Transcription and QA in 6 Weeks |
Kelsey Foster |
Sep. 29, 2022 |
1274 |
- |
Getting Started with Hugging Face's Gradio |
Ryan O'Connor |
Sep. 21, 2022 |
7897 |
1 |
How to Run OpenAI’s Whisper Speech Recognition Model |
Ryan O'Connor |
Sep. 22, 2022 |
3409 |
- |
Transcribe audio or video files right from your terminal |
Francisco Castillo |
Oct. 19, 2022 |
464 |
- |
An Introduction to Poisson Flow Generative Models |
Ryan O'Connor |
Oct. 26, 2022 |
6860 |
53 |
New AI Models to summarize audio and video for any use case |
Ryan O'Connor |
Dec. 02, 2022 |
2244 |
- |
Stable Diffusion 1 vs 2 - What you need to know |
Ryan O'Connor |
Dec. 06, 2022 |
2527 |
3 |
DeepMind's AlphaTensor Explained |
Marco Ramponi |
Nov. 22, 2022 |
3520 |
5 |
Stable Diffusion in Keras - A Simple Tutorial |
Ryan O'Connor |
Nov. 30, 2022 |
1913 |
- |
AI for product managers: Today’s top terms to stay in the know |
Kelsey Foster |
Nov. 07, 2022 |
1242 |
- |
AI research review - Merging Models Modulo Permutation Symmetries |
Yash Khare |
Nov. 16, 2022 |
245 |
- |
Build standout call coaching features with AI Summarization |
Kelsey Foster |
Dec. 12, 2022 |
1385 |
- |
Releasing our new v9 transcription model - 11% better accuracy |
Ryan O'Connor |
Dec. 14, 2022 |
1336 |
- |
7 best practices for building better products with AI |
Kelsey Foster |
Dec. 22, 2022 |
244 |
- |
How ChatGPT actually works |
Marco Ramponi |
Dec. 23, 2022 |
3262 |
4 |
Build a free Stable Diffusion app with a GPU backend |
Ryan O'Connor |
Jan. 19, 2023 |
1820 |
- |
AI research review – Locating and Editing Factual Associations in GPT |
Gabriel Oexle |
Jan. 18, 2023 |
681 |
- |
What is Conversational Intelligence AI? |
Kelsey Foster |
Jan. 23, 2023 |
1059 |
- |
How CallRail doubled its Conversation Intelligence customers by building with a trusted AI partner |
Kelsey Foster |
Feb. 14, 2023 |
1184 |
- |
Why every Fortune 500 business needs a chief AI officer |
Dylan Fox |
Feb. 21, 2023 |
724 |
- |
Emergent Abilities of Large Language Models |
Ryan O'Connor |
Mar. 07, 2023 |
4055 |
7 |
6 Best AI playgrounds in 2023 |
Kelsey Foster |
Mar. 08, 2023 |
822 |
- |
3 easy ways to add AI Summarization to Conversation Intelligence tools |
Kelsey Foster |
Mar. 13, 2023 |
1516 |
- |
Conformer-1’s architecture |
- |
Mar. 16, 2023 |
1958 |
3 |
How AI-powered transcription helped a hiring intelligence platform cut time spent on manual tasks by 90% for its customers |
Kelsey Foster |
Apr. 11, 2023 |
1096 |
- |
How physics advanced Generative AI |
Ryan O'Connor |
Apr. 19, 2023 |
2781 |
49 |
How Grain builds with AI to generate powerful insights from customer meetings |
Kelsey Foster |
Apr. 26, 2023 |
773 |
- |
Everything you need to know about Generative AI |
Ryan O'Connor |
May. 02, 2023 |
442 |
- |
Introduction to Generative AI |
Ryan O'Connor |
May. 02, 2023 |
2527 |
3 |
The Full Story of Large Language Models and RLHF |
Marco Ramponi |
May. 03, 2023 |
5719 |
108 |
How AI helps Marvin's users spend 60% less time analyzing research data |
Kelsey Foster |
May. 04, 2023 |
619 |
- |
Introducing LeMUR, our new framework for applying powerful LLMs to transcribed speech |
- |
May. 09, 2023 |
963 |
- |
Modern Generative AI for images |
Ryan O'Connor |
May. 10, 2023 |
2584 |
3 |
3 ways to build and deploy AI tools and features faster |
Kelsey Foster |
May. 16, 2023 |
912 |
- |
Introduction to Large Language Models for Generative AI |
Ryan O'Connor |
May. 17, 2023 |
2832 |
2 |
Large Language Models for Product Managers: 5 Things to Know |
Marco Ramponi |
May. 23, 2023 |
2199 |
2 |
Python Speech-to-Text with Punctuation, Casing, and Formatting |
Matt Makai |
May. 25, 2023 |
928 |
- |
How Jiminny builds with AI models to secure 15% higher win rates for customers |
Kelsey Foster |
Jun. 01, 2023 |
796 |
- |
How to evaluate Speech Recognition models |
Ryan O'Connor |
Jun. 15, 2023 |
3816 |
5 |
Recent developments in Generative AI for Audio |
Marco Ramponi |
Jun. 27, 2023 |
4075 |
7 |
Conformer-2 |
- |
Jul. 20, 2023 |
2156 |
5 |
LeMUR |
- |
Jul. 27, 2023 |
1049 |
129 |
8 best AI subtitle generators for 2023 |
Kelsey Foster |
Jul. 24, 2023 |
1413 |
- |
How Reinforcement Learning from AI Feedback works |
Ryan O'Connor |
Aug. 01, 2023 |
5218 |
2 |
How RLHF Preference Model Tuning Works (And How Things May Go Wrong) |
Marco Ramponi |
Aug. 03, 2023 |
2160 |
95 |
Transcribe audio to text on Cloudflare Workers with AssemblyAI, Node.js, and TypeScript |
Niels Swimberghe |
Aug. 03, 2023 |
2651 |
5 |
Customer Stories: Conformer-2 in Action |
Kelsey Foster |
Aug. 14, 2023 |
721 |
- |
Automatic summarization with LLMs in Python |
Ryan O'Connor |
Aug. 15, 2023 |
1548 |
- |
How to integrate spoken audio into LangChain.js using AssemblyAI |
Niels Swimberghe |
Aug. 15, 2023 |
981 |
- |
Introducing the AssemblyAI integration for LangChain.js |
Niels Swimberghe |
Aug. 15, 2023 |
372 |
- |
Why Language Models Became Large Language Models And The Hurdles In Developing LLM-based Applications |
Marco Ramponi |
Aug. 18, 2023 |
1519 |
- |
RLHF vs RLAIF for language model alignment |
Ryan O'Connor |
Aug. 22, 2023 |
2635 |
2 |
Conversation AI: What it is and top use cases |
Kelsey Foster |
Aug. 29, 2023 |
1005 |
- |
How to build an interactive lecture summarization app |
Ryan O'Connor |
Aug. 31, 2023 |
1234 |
- |
How to use audio data in LangChain with Python |
Patrick Loeber |
Aug. 31, 2023 |
816 |
- |
What is Residual Vector Quantization? |
Marco Ramponi |
Sep. 04, 2023 |
1243 |
45 |
Convert Speech to Text in Python in 5 Minutes |
Ryan O'Connor |
Sep. 06, 2023 |
1005 |
- |
What is ASR? A Comprehensive Overview of Automatic Speech Recognition Technology |
Kelsey Foster |
Sep. 12, 2023 |
1816 |
- |
What Is Media Monitoring? (Definition, Benefits, and AI) |
Jesse Sumrak |
Sep. 12, 2023 |
1211 |
- |
How to use AI to automatically summarize meeting transcripts |
Kelsey Foster |
Sep. 13, 2023 |
1201 |
- |
How to get Zoom Transcripts with the Zoom API |
Ryan O'Connor |
Sep. 14, 2023 |
1996 |
- |
Best Large Language Models & Frameworks of 2023 |
Jesse Sumrak |
Sep. 18, 2023 |
1530 |
- |
Build a podcast question & answer application using Rivet and AssemblyAI |
Niels Swimberghe |
Sep. 20, 2023 |
1878 |
- |
Announcing the AssemblyAI plugin for Rivet |
Niels Swimberghe |
Sep. 20, 2023 |
148 |
- |
What AI Music Generators Can Do (And How They Do It) |
Marco Ramponi |
Sep. 22, 2023 |
2202 |
6 |
How to Choose the Best Speech-to-Text API |
Kelsey Foster |
Sep. 20, 2023 |
1431 |
- |
Retrieval Augmented Generation on audio data with LangChain and Chroma |
Ryan O'Connor |
Sep. 26, 2023 |
1886 |
1 |
How to use Speech AI systems for podcast hosting, editing, and monetization |
Kelsey Foster |
Sep. 27, 2023 |
1025 |
- |
How DALL-E 2 Actually Works |
Ryan O'Connor |
Sep. 29, 2023 |
3937 |
252 |
8 Ways Automatic Speech Recognition Can Increase Efficiency For Your Business |
Julie Griffin |
Sep. 29, 2023 |
1185 |
- |
Is Word Error Rate Useful? |
Dylan Fox |
Sep. 05, 2023 |
1405 |
- |
New Usage Dashboard + Mistral 7B First Look |
Smitha Kolan |
Oct. 02, 2023 |
474 |
- |
Real-time transcription in Python |
Ryan O'Connor |
Oct. 06, 2023 |
1260 |
- |
Speech AI for Sales Intelligence Platforms: How to Use AI in 2023 |
Jesse Sumrak |
Oct. 06, 2023 |
1028 |
- |
🚀LlamaIndex Integration + Model-Specific Usage Dashboards |
Smitha Kolan |
Oct. 09, 2023 |
427 |
- |
Building with Automatic Speech Recognition (ASR) models: Why accuracy matters |
Kelsey Foster |
Oct. 10, 2023 |
1180 |
- |
Announcing the AssemblyAI Node SDK 2.0 |
Niels Swimberghe |
Oct. 12, 2023 |
245 |
- |
How to use audio data in LlamaIndex with Python |
Patrick Loeber |
Oct. 16, 2023 |
650 |
- |
New Multilingual Capabilities and TypeScript/JavaScript SDK |
Smitha Kolan |
Oct. 19, 2023 |
371 |
- |
Speaker Diarization: Adding Speaker Labels for Enterprise Speech-to-Text |
Kelsey Foster |
Oct. 23, 2023 |
1798 |
- |
New Code Cookbooks & AssemblyAI's Q4 Product Enhancements |
Smitha Kolan |
Oct. 26, 2023 |
380 |
- |
Combining Speech Recognition and Diarization in one model |
Marco Ramponi |
Oct. 27, 2023 |
915 |
- |
How to Use Speech to Text AI for Ad Targeting & Brand Protection |
Jesse Sumrak |
Oct. 27, 2023 |
1365 |
- |
Faster Audio File Handling and Improved Error Messages |
Smitha Kolan |
Nov. 01, 2023 |
374 |
- |
Key phrase detection in audio files using Python |
Patrick Loeber |
Nov. 02, 2023 |
1037 |
- |
Automatically summarize audio and video files at scale with AI |
Kelsey Foster |
Nov. 03, 2023 |
1397 |
- |
Speech-to-Text AI for Product Managers: How It Works and Key Considerations |
Julie Griffin |
Nov. 03, 2023 |
1093 |
- |
Automatically determine video sections with AI using Python |
Ryan O'Connor |
Nov. 07, 2023 |
1579 |
4 |
Improved Punctuation Restoration & Truecasing Models |
Smitha Kolan |
Nov. 07, 2023 |
462 |
- |
Introducing Our New Punctuation Restoration and Truecasing Models |
Marco Ramponi |
Nov. 08, 2023 |
1759 |
- |
Text Summarization for NLP: 5 Best APIs, AI Models, and AI Summarizers in 2024 |
Kelsey Foster |
Nov. 09, 2023 |
2447 |
- |
2022 at AssemblyAI - A Year in Review |
Kelsey Foster, Ryan O'Connor |
Dec. 29, 2022 |
1115 |
- |
Why You Should (or Shouldn't) be Using Google's JAX in 2023 |
Ryan O'Connor |
Feb. 15, 2022 |
4992 |
3 |
Jupyter Notebooks Tips and Tricks |
Mısra Turp |
Jan. 04, 2022 |
76 |
- |
Winners and Honorable Mentions - AssemblyAI $50k Winter Hackathon |
Ryan O'Connor |
Dec. 15, 2022 |
1187 |
- |
MinImagen - Build Your Own Imagen Text-to-Image Model |
Ryan O'Connor |
Aug. 17, 2022 |
6698 |
3 |
Top Speaker Diarization Libraries and APIs in 2023 |
Kelsey Foster |
Feb. 08, 2022 |
1936 |
- |
Introduction to Variational Autoencoders Using Keras |
Ryan O'Connor |
Jan. 03, 2022 |
5654 |
- |
7 best practices for product teams to consider when building with AI |
Kelsey Foster |
Nov. 15, 2023 |
2115 |
- |
Enhancing Our Speech-to-Text Models with Google v5e TPUs and 🎉100K on YouTube |
Smitha Kolan |
Nov. 16, 2023 |
440 |
- |
AssemblyAI is now on the Amazon Web Services (AWS) Marketplace |
Kelsey Foster |
Nov. 20, 2023 |
308 |
- |
🚀LeMUR's Custom Text Input + Revamped Playground |
Smitha Kolan |
Nov. 24, 2023 |
375 |
- |
Should I build or buy an AI speech recognition system? |
Kelsey Foster |
Nov. 27, 2023 |
1397 |
- |
Extract phone call insights with LLMs in Python |
Ryan O'Connor |
Nov. 30, 2023 |
938 |
- |
5 Benefits of Speech AI for Video Editing Platforms |
Amanda Smith |
Dec. 01, 2023 |
950 |
- |
Improved Hold Music Detection + Build LLM Audio Apps with LeMUR |
Smitha Kolan |
Dec. 01, 2023 |
296 |
- |
AI-powered call analytics: How to extract insights from customer conversations |
Kelsey Foster |
Nov. 30, 2023 |
885 |
- |
6 Ways Telehealth Platforms Can Leverage Speech-to-Text AI |
Julie Griffin |
Nov. 30, 2023 |
1225 |
- |
Announcing our $50M Series C to build superhuman Speech AI models |
Dylan Fox |
Dec. 03, 2023 |
754 |
2 |
AI for Universal Audio Understanding: Qwen-Audio Explained |
Marco Ramponi |
Dec. 07, 2023 |
1513 |
1 |
Announcing the AssemblyAI integration for LlamaIndex.TS |
Niels Swimberghe |
Dec. 06, 2023 |
377 |
- |
How to integrate spoken audio into LlamaIndex.TS using AssemblyAI |
Niels Swimberghe |
Dec. 06, 2023 |
940 |
- |
🎉 Announcing our $50M Series C to build superhuman Speech AI models |
Smitha Kolan |
Dec. 08, 2023 |
353 |
- |
How to Create SRT Files for Videos in Python |
Patrick Loeber |
Dec. 14, 2023 |
812 |
- |
Speech AI use cases for Learning Management Systems |
Amanda Smith |
Dec. 18, 2023 |
1395 |
- |
How to Create VTT Files for Videos in Python |
Patrick Loeber |
Dec. 15, 2023 |
832 |
- |
2023 at AssemblyAI - A Year in Review |
Smitha Kolan |
Dec. 20, 2023 |
623 |
- |
🚀 New Punctuation & Casing Model For Real-Time Transcription |
Smitha Kolan |
Dec. 15, 2023 |
378 |
- |
Why Virtual Meeting Companies Should Use Speech AI |
Julie Griffin |
Jan. 02, 2024 |
1066 |
- |
AssemblyAI's New Integrations & Latest Tutorials |
Smitha Kolan |
Jan. 05, 2024 |
488 |
- |
Ask .NET Rocks! questions with Semantic Kernel, GPT, and Chroma DB |
Niels Swimberghe |
Jan. 08, 2024 |
3851 |
- |
Announcing the AssemblyAI integration for Semantic Kernel .NET |
Niels Swimberghe |
Jan. 08, 2024 |
312 |
- |
Introducing the AssemblyAI Java SDK |
Smitha Kolan |
Jan. 08, 2024 |
277 |
- |
Lower latency, lower cost, more possibilities |
Ryan O'Connor |
Jan. 10, 2024 |
1008 |
1 |
9 no-code and low-code ways to build AI-powered Speech-to-Text tools |
Kelsey Foster |
Jan. 12, 2024 |
1000 |
- |
Lower latency, reduced prices, and our Java SDK release |
Smitha Kolan |
Jan. 12, 2024 |
510 |
- |
Announcing the AssemblyAI Integration for Haystack |
Mısra Turp |
Jan. 16, 2024 |
354 |
- |
Claude 2.1 Now Available with LeMUR + New Integrations |
Smitha Kolan |
Jan. 19, 2024 |
422 |
- |
Announcing the AssemblyAI Go SDK |
Marcus Olsson |
Jan. 19, 2024 |
302 |
- |
Top 6 benefits of integrating LLMs for Conversation Intelligence platforms |
Kelsey Foster |
Jan. 23, 2024 |
956 |
- |
How to do Speech-To-Text with Go |
Patrick Loeber |
Jan. 23, 2024 |
387 |
- |
Top 3 benefits of Speech AI for Revenue Intelligence |
Kelsey Foster |
Jan. 25, 2024 |
1156 |
- |
🚀New AssemblyAI Go SDK & Speech-to-Text Tutorials |
Smitha Kolan |
Jan. 26, 2024 |
307 |
- |
Ask questions about your audio with LLMs |
Smitha Kolan |
Feb. 01, 2024 |
397 |
- |
Top AI models for conversation intelligence |
Kelsey Foster |
Feb. 07, 2024 |
1619 |
- |
Improvements to Real-Time Transcription |
Smitha Kolan |
Feb. 09, 2024 |
432 |
- |
Transcribe phone calls in real-time in Go with Twilio and AssemblyAI |
Marcus Olsson |
Feb. 12, 2024 |
1797 |
- |
Transcribe a phone call in real-time using Python with AssemblyAI and Twilio |
Ryan O'Connor |
Feb. 15, 2024 |
2918 |
- |
PII Redaction and Entity Detection In 13 New Languages 🇫🇷🇩🇪🇮🇳 |
Smitha Kolan |
Feb. 17, 2024 |
525 |
- |
AI trends in 2024: Graph Neural Networks |
Marco Ramponi |
Feb. 20, 2024 |
2934 |
3 |
The top free Speech-to-Text APIs, AI Models, and Open Source Engines |
Kelsey Foster |
Feb. 21, 2024 |
1581 |
1 |
Why product teams at top call tracking solutions are turning to AI |
Kelsey Foster |
Feb. 23, 2024 |
1543 |
- |
New Utterance Controls for Real-Time Transcription |
Smitha Kolan |
Feb. 23, 2024 |
326 |
- |
Detect scam calls using Go with LeMUR and Twilio |
Marcus Olsson |
Feb. 27, 2024 |
2065 |
- |
Top 3 ways to enhance AI video editing tools with Speech AI |
Kelsey Foster |
Feb. 29, 2024 |
1313 |
- |
AssemblyAI Java SDK New Features & Improvements |
Smitha Kolan |
Mar. 01, 2024 |
320 |
- |
How to use AI to build powerful market research tools |
Kelsey Foster |
Mar. 04, 2024 |
1576 |
- |
AI tools for business: Top 6 considerations before building with AI models and LLMs |
Kelsey Foster |
Mar. 07, 2024 |
1140 |
- |
Improved Audio LLM Docs & AssemblyAI Go SDK |
Smitha Kolan |
Mar. 08, 2024 |
396 |
- |
7 LLM use cases and applications in 2024 |
Jesse Sumrak |
Mar. 11, 2024 |
1311 |
- |
AssemblyAI Go SDK v1.3.0: Utterance Detection and Word Search |
Marcus Olsson |
Mar. 13, 2024 |
451 |
- |
A New Free Python Course to Build Real-World Audio AI Apps |
Patrick Loeber |
Mar. 13, 2024 |
205 |
- |
🔥 New Free Video Course from Talk Python: Build an Audio AI App |
Smitha Kolan |
Mar. 14, 2024 |
331 |
- |
Business use cases for Generative AI |
Amanda Smith |
Mar. 18, 2024 |
1318 |
- |
Automatically redact PII from audio and video with Python |
Ryan O'Connor |
Mar. 18, 2024 |
1092 |
- |
Real-Time is now Streaming Speech-to-Text, with added customization and control for users |
Kelsey Foster |
Mar. 19, 2024 |
502 |
- |
Improved Streaming Speech-to-Text Pricing and Features |
Mısra Turp |
Mar. 21, 2024 |
346 |
- |
Introducing Universal-1 |
Kelly Moon |
Apr. 04, 2024 |
2232 |
- |
Transcribe an audio file with Universal-1 using Go |
Marcus Olsson |
Apr. 04, 2024 |
486 |
- |
Build Audio LLM Apps with AssemblyAI |
Smitha Kolan |
Apr. 05, 2024 |
281 |
- |
Transcribe an audio file with Universal-1 in Node.js |
Marcus Olsson |
Apr. 08, 2024 |
436 |
- |
Transcribe audio and video files with Python and Universal-1 |
Matt Makai |
Apr. 09, 2024 |
709 |
- |
Introducing the AssemblyAI integration for LangChain Go |
Marcus Olsson |
Apr. 10, 2024 |
245 |
- |
9 ways to transform contact center results with AI-powered speech analytics |
Jesse Sumrak |
Apr. 08, 2024 |
1251 |
- |
Newsletter #30: 🚀Universal-1 Model Launch |
Smitha Kolan |
Apr. 12, 2024 |
313 |
- |
How Speech AI technology can improve transcription services |
Jesse Sumrak |
Apr. 15, 2024 |
1388 |
- |
Transcribe audio with Java using Universal-1 |
Niels Swimberghe |
Apr. 15, 2024 |
462 |
- |
Transcribe audio with Ruby using Universal-1 |
Niels Swimberghe |
Apr. 18, 2024 |
468 |
- |
AssemblyAI + 🔗LangChain Go, Universal-1 Recap |
Smitha Kolan |
Apr. 19, 2024 |
432 |
- |
Newsletter 32:⚡️Upgrades To Streaming Speech-to-Text |
Smitha Kolan |
Apr. 19, 2024 |
360 |
- |
Redact PII in Audio with Make and AssemblyAI |
Niels Swimberghe |
Apr. 23, 2024 |
824 |
- |
Introducing the AssemblyAI app for Make (Integromat) |
Niels Swimberghe |
Apr. 23, 2024 |
239 |
- |
Newsletter #33: Make.com Speech AI Integration and Streaming STT Updates |
Smitha Kolan |
Apr. 26, 2024 |
427 |
- |
Newsletter #34: AssemblyAI API Reference & Latest Tutorials |
Smitha Kolan |
May. 03, 2024 |
362 |
- |
Best and Nano Tiers: More Speech-to-Text and Pricing Options |
Kelly Moon |
May. 08, 2024 |
576 |
- |
Newsletter #35: Nano & Best: New Speech-to-text Pricing Options |
Marcus Olsson |
May. 10, 2024 |
331 |
- |
18 Ways Businesses are Launching New Products with Speech AI |
Jesse Sumrak |
May. 14, 2024 |
1453 |
- |
Newsletter #36: Latest Speech-to-text Model Benchmarks |
Smitha Kolan |
May. 17, 2024 |
354 |
- |
Filter profanity from audio files using Python |
Ryan O'Connor |
May. 22, 2024 |
589 |
- |
How to Use Speech AI for Healthcare Market Research |
Jesse Sumrak |
May. 24, 2024 |
965 |
- |
Newsletter #37: Speaker Diarization Now in 5 New Languages 🇨🇳🇮🇳🇯🇵🇰🇷🇻🇳 & Latest Speech AI tutorials |
Smitha Kolan |
May. 24, 2024 |
341 |
- |
Content moderation on audio files with Python |
Ryan O'Connor |
May. 27, 2024 |
1620 |
- |
Filter profanity from audio files using Node.js |
Niels Swimberghe |
May. 28, 2024 |
557 |
- |
Node.js Speech-to-Text with Punctuation, Casing, and Formatting |
Niels Swimberghe |
May. 30, 2024 |
576 |
- |
Newsletter #38: Apply LLMs To Voice Data |
Smitha Kolan |
May. 31, 2024 |
339 |
- |
How to Transcribe Audio to Text Accurately at Scale |
Jesse Sumrak |
May. 31, 2024 |
1047 |
- |
Summarize audio with LLMs in Node.js |
Niels Swimberghe |
Jun. 03, 2024 |
1168 |
- |
Newsletter #39: Build With AssemblyAI's Integrations |
Smitha Kolan |
Jun. 07, 2024 |
334 |
- |
Redact Personal Identifiable Information (PII) from audio with Node.js |
Niels Swimberghe |
Jun. 12, 2024 |
1271 |
- |
🇩🇪 New German STT & Improved PII Detection Models |
Smitha Kolan |
Jun. 14, 2024 |
414 |
- |
10 Ways Streaming Speech-to-Text (Live Transcription) is Being Used Today |
Jesse Sumrak |
Jun. 14, 2024 |
1406 |
- |
How to Create WebVTT Files for Videos in Node.js |
Niels Swimberghe |
Jun. 17, 2024 |
685 |
- |
How to Create SRT Files for Videos in Node.js |
Niels Swimberghe |
Jun. 17, 2024 |
685 |
- |
Announcing the AssemblyAI Starter App for Encore |
Marcus Olsson |
Jun. 19, 2024 |
805 |
- |
Speaker diarization improvements: new languages, increased accuracy |
Ryan O'Connor |
Jun. 20, 2024 |
1361 |
- |
🎙️ Speaker Diarization Now More Accurate & 🔔 Introducing Billing Alerts |
Mısra Turp |
Jun. 21, 2024 |
569 |
- |
Transcribe and generate subtitles for YouTube videos with Node.js |
Niels Swimberghe |
Jun. 24, 2024 |
1245 |
- |
How to Do Hotword Detection with Streaming Speech-to-Text and Go |
Yasoob Khalid |
Jun. 25, 2024 |
2407 |
- |
Get More from Audio Data with Conversational Intelligence |
Mısra Turp |
Jun. 28, 2024 |
543 |
- |
Keep up with the latest in AI |
Kelly Moon |
Apr. 03, 2024 |
117 |
- |
Keep up with the latest in AI |
Kelly Moon |
Apr. 03, 2024 |
117 |
- |
Build an AI-powered video conferencing app with Next.js and Stream |
Patrick Loeber, Stefan Blos |
Jul. 02, 2024 |
2866 |
- |
Create Multi-Lingual Subtitles with AssemblyAI and DeepL |
Aniket Bhattacharyea |
Jul. 08, 2024 |
2967 |
- |
Build Powerful Speech AI Apps with AssemblyAI and LLM Integrations |
Smitha Kolan |
Jul. 08, 2024 |
414 |
- |
Claude 3 Models now available with LeMUR |
JD Prater |
Jul. 10, 2024 |
565 |
- |
How to Create a Real-Time Language Translation Service with AssemblyAI and DeepL in JavaScript |
Aniket Bhattacharyea |
Jul. 12, 2024 |
2643 |
- |
Florence-2: How it works and how to use it |
Ryan O'Connor |
Jul. 15, 2024 |
2524 |
1 |
Use Claude 3.5 Sonnet With Audio Data & Latest Speech-to-Text Tutorials |
Smitha Kolan |
Jul. 15, 2024 |
206 |
- |
Announcing New Language Support for PII Text Redaction and Expanding Entity Detection |
JD Prater |
Jul. 18, 2024 |
899 |
- |
Speech-to-Text security: Top foundational security questions to consider for your next project using speech |
Miki Fukushima, Jesse Sumrak |
Jul. 18, 2024 |
2059 |
- |
Get started using Claude 3.5 Sonnet with audio data |
Patrick Loeber |
Jul. 19, 2024 |
1015 |
- |
🔥 New PII Redaction and Entity Detection Features |
Smitha Kolan |
Jul. 25, 2024 |
354 |
- |
🎉 AssemblyAI's Python SDK Crosses 100K Monthly Downloads & Latest Tutorials |
Smitha Kolan |
Aug. 02, 2024 |
258 |
- |
Introducing the enhanced AssemblyAI app for Zapier |
Niels Swimberghe |
Aug. 06, 2024 |
299 |
- |
Generate subtitles with AssemblyAI and Zapier |
Niels Swimberghe |
Aug. 06, 2024 |
730 |
- |
How to evaluate AI models and systems: Why objective benchmarks are important |
Kelly Moon |
Aug. 05, 2024 |
2028 |
- |
The Best Audio File Formats for Speech-to-Text: A Guide |
Patrick Loeber |
Aug. 09, 2024 |
1744 |
- |
New LeMUR Claude 3 Endpoints & Latest Zapier Integration |
Smitha Kolan |
Aug. 09, 2024 |
186 |
- |
The Best Audio File Formats for Speech-to-Text: A Guide |
Patrick Loeber |
Aug. 09, 2024 |
1744 |
- |
Introducing the AssemblyAI Ruby SDK |
Niels Swimberghe |
Aug. 12, 2024 |
227 |
- |
What is Customer Success? The key role of technical customer success and support teams in winning and retaining customers |
Jesse Sumrak, Lee Vaughn |
Aug. 12, 2024 |
1691 |
- |
Build with AssemblyAI's Speaker Diarization Model + Latest Tutorials |
Smitha Kolan |
Aug. 16, 2024 |
375 |
- |
Decoding Strategies: How LLMs Choose The Next Word |
Marco Ramponi |
Aug. 21, 2024 |
3810 |
8 |
Conversation intelligence: How to better understand the voice of the customer with Speech AI |
Joseph Rendeiro |
Aug. 22, 2024 |
1272 |
- |
Build with AssemblyAI's Streaming Speech-to-Text + Latest Tutorials |
Smitha Kolan |
Aug. 23, 2024 |
322 |
- |
Automatic language detection improvements: increased accuracy & expanded language support |
JD Prater |
Aug. 26, 2024 |
837 |
- |
Analyze Audio from Zoom Calls with AssemblyAI and Node.js |
David Ekete |
Aug. 28, 2024 |
2360 |
- |
What is speech to text? The complete guide |
Jesse Sumrak |
Aug. 29, 2024 |
2564 |
- |
🚀 Upgraded Automatic Language Detection + Latest Tutorials |
Smitha Kolan |
Aug. 30, 2024 |
219 |
- |
Introducing the AssemblyAI C# .NET SDK |
Niels Swimberghe |
Sep. 03, 2024 |
419 |
- |
What is speech recognition? A comprehensive guide |
Kelsey Foster |
Sep. 04, 2024 |
2066 |
- |
Build a Discord Voice Bot to Add ChatGPT to Your Voice Channel |
Michael Nyamande |
Sep. 05, 2024 |
2642 |
- |
AssemblyAI's C# .NET SDK + Latest Tutorials |
Smitha Kolan |
Sep. 06, 2024 |
377 |
- |
Speaker diarization vs speaker recognition - what's the difference? |
Ryan O'Connor |
Sep. 09, 2024 |
1015 |
- |
How to perform Speaker Diarization in Python |
Ryan O'Connor |
Sep. 10, 2024 |
1166 |
- |
Speech AI apps: 8 new Speech AI tools, releases, updates, and more |
Kelsey Foster |
Sep. 11, 2024 |
832 |
- |
How to identify languages in audio data using Python |
Patrick Loeber |
Sep. 12, 2024 |
834 |
- |
Build Powerful Speech AI Apps with AssemblyAI & Speaker Diarization Tutorials |
Smitha Kolan |
Sep. 13, 2024 |
307 |
- |
Introducing the AssemblyAI piece for Activepieces |
Niels Swimberghe |
Sep. 19, 2024 |
165 |
- |
Build a Real-Time AI Voice Bot Using Python, AssemblyAI, and ElevenLabs |
Smitha Kolan |
Sep. 19, 2024 |
1094 |
- |
Introducing the Postman collection for AssemblyAI |
Niels Swimberghe |
Sep. 26, 2024 |
143 |
- |
Speech-to-Text with Django |
Patrick Loeber |
Sep. 27, 2024 |
1597 |
- |
AI-powered meeting company Supernormal launches customizable Voice Agents |
Kelsey Foster |
Oct. 03, 2024 |
578 |
- |
Put Speech AI on the roadmap |
Jesse Sumrak |
Oct. 07, 2024 |
2016 |
- |
Introducing the AssemblyAI integration for Langflow |
Patrick Loeber |
Oct. 11, 2024 |
172 |
- |
How Delphi leverages AI to create digital clones of thought leaders |
Madison Boyd |
Oct. 18, 2024 |
908 |
- |
Speech recognition in the browser using Web Speech API |
Patrick Loeber |
Oct. 19, 2024 |
1294 |
- |
How to convert speech to text in Java |
Smitha Kolan |
Oct. 21, 2024 |
1088 |
- |
How to build a free Whisper API with GPU backend |
Ryan O'Connor |
Oct. 22, 2024 |
2502 |
- |
New 2024 Insights Report: How AI is shaping product strategy |
Chelsea Weber |
Oct. 23, 2024 |
325 |
- |
Beyond Word Error Rate: Universal-2 Delivers Accuracy Where It Matters |
JD Prater |
Oct. 31, 2024 |
1597 |
- |
7 best transcript summarizers powered by AI |
Kelsey Foster |
Oct. 31, 2024 |
1476 |
- |
The race to AI integration |
Chelsea Weber |
Nov. 07, 2024 |
1190 |
- |
Universal-2 vs OpenAI's Whisper: Comparing Speech-to-Text models in real-world use cases |
Patrick Loeber |
Nov. 07, 2024 |
2446 |
- |
Auto-generate subtitles with Python and AssemblyAI |
Marcus Olsson |
Nov. 05, 2024 |
692 |
- |
How to use Google's Speech-to-Text API to transcribe audio in Python |
Ryan O'Connor |
Nov. 12, 2024 |
2116 |
- |
Talk to ChatGPT on a Phone Call |
Artem Oppermann |
Nov. 15, 2024 |
3312 |
- |
Veed co-founders turn to Speech AI to democratize AI video editing |
Kelsey Foster |
Nov. 14, 2024 |
293 |
- |
How we built our AI Lakehouse |
Ahmed Etefy, Ryan O'Connor |
Nov. 19, 2024 |
3135 |
- |
How Dexa transforms podcasts into an interactive knowledge base |
Madison Boyd |
Nov. 19, 2024 |
778 |
- |
Build or buy? What industry leaders are choosing |
Chelsea Weber |
Nov. 21, 2024 |
965 |
- |
Voice content moderation with AI: Everything you need to know |
Jesse Sumrak |
Nov. 25, 2024 |
1641 |
- |
How to transcribe Zoom participant recordings (multichannel) |
Ryan O'Connor |
Nov. 25, 2024 |
2915 |
- |