16 blog posts published by month since the start of 2022. Start from a different year:

Posts year-to-date
2 (1 posts by this month last year.)
Average posts per month since 2022
0.3

Post details (2022 to today)

Title Author Date Word count HN points
Building a Real-Time Shopping Assistant: Turn Live Video into Instant Purchases Michael Louis Aug 14, 2024 2435 -
Using Codestral to Summarize, Correct and Auto-Approve Pull Requests Michael Louis Jun 15, 2024 1526 -
Creating a realtime RAG voice agent Michael Louis Jul 21, 2024 1857 -
Productionize your Comfy UI Workflow Apr 09, 2024 97 1
Installing Python Packages via UV leads to 3.75x increase in build performance Feb 15, 2024 28 -
Getting better price-performance, latency, and availability on AWS Trn1/Inf2 instances Michael Louis May 20, 2024 1546 -
Creating an Executive Assistant using LangChain, LangSmith, Cerebrium and Cal.com Michael Louis May 19, 2024 2482 -
Running Llama 3 8B with TensorRT-LLM on Serverless GPUs Michael Louis May 16, 2024 1410 -
How to Build a Real-Time AI Avatar for Training and Coaching Michael Louis Sep 17, 2024 2529 -
Cerebrium supports HIPAA compliance: A guide for health applications Kyle Gani Sep 30, 2024 1208 -
Benchmarking vLLM, SGLang and TensorRT for Llama 3.1 API Michael Louis Oct 10, 2024 643 -
An Alternative to OpenAI Realtime API for Voice Capabilities Michael Louis Oct 14, 2024 1359 7
ML apps at scale: ASGI support now available on Cerebrium Kyle Gani Oct 28, 2024 452 -
Overcoming Transcription Challenges for Multilingual AI voice agents Michael Louis Dec 19, 2024 1275 -
Building a Real-time Coding Assistant Kyle Gani Feb 20, 2025 3114 -
Creating a realtime AI Commentator with Cerebrium, LiveKit and Cartesia Michael Louis Feb 18, 2025 4243 -