Building a Real-Time Shopping Assistant: Turn Live Video into Instant Purchases |
Michael Louis |
Aug 14, 2024 |
2435 |
- |
Using Codestral to Summarize, Correct and Auto-Approve Pull Requests |
Michael Louis |
Jun 15, 2024 |
1526 |
- |
Creating a realtime RAG voice agent |
Michael Louis |
Jul 21, 2024 |
1857 |
- |
Productionize your Comfy UI Workflow |
|
Apr 09, 2024 |
97 |
1 |
Installing Python Packages via UV leads to 3.75x increase in build performance |
|
Feb 15, 2024 |
28 |
- |
Getting better price-performance, latency, and availability on AWS Trn1/Inf2 instances |
Michael Louis |
May 20, 2024 |
1546 |
- |
Creating an Executive Assistant using LangChain, LangSmith, Cerebrium and Cal.com |
Michael Louis |
May 19, 2024 |
2482 |
- |
Running Llama 3 8B with TensorRT-LLM on Serverless GPUs |
Michael Louis |
May 16, 2024 |
1410 |
- |
How to Build a Real-Time AI Avatar for Training and Coaching |
Michael Louis |
Sep 17, 2024 |
2529 |
- |
Cerebrium supports HIPAA compliance: A guide for health applications |
Kyle Gani |
Sep 30, 2024 |
1208 |
- |
Benchmarking vLLM, SGLang and TensorRT for Llama 3.1 API |
Michael Louis |
Oct 10, 2024 |
643 |
- |
An Alternative to OpenAI Realtime API for Voice Capabilities |
Michael Louis |
Oct 14, 2024 |
1359 |
7 |
ML apps at scale: ASGI support now available on Cerebrium |
Kyle Gani |
Oct 28, 2024 |
452 |
- |
Overcoming Transcription Challenges for Multilingual AI voice agents |
Michael Louis |
Dec 19, 2024 |
1275 |
- |
Building a Real-time Coding Assistant |
Kyle Gani |
Feb 20, 2025 |
3114 |
- |
Creating a realtime AI Commentator with Cerebrium, LiveKit and Cartesia |
Michael Louis |
Feb 18, 2025 |
4243 |
- |