48 blog posts published by month since the start of 2024. Start from a different year:

Posts year-to-date
5 (4 posts by this month last year.)
Average posts per month since 2024
2.0

Post details (2024 to today)

Title Author Date Word count HN points
Processing 2 Billion Images for Stable Diffusion Model Training - Definitive Guides with Ray Series Max Pumperla, Marwan Sarieddine May 14, 2024 4209 -
Improve Utilization and Simplify Cluster Management with Anyscale Job Queues Dominic Catalano, Alexey Kudinkin Jul 23, 2024 735 -
Ray Summit 2024 Call for Proposals is now open Anyscale team Apr 19, 2024 264 -
Deploy Ray Serve with up to 50% fewer nodes using Anyscale Replica Compaction Matt Connor, Akshay Malik, Cindy Zhang Jul 15, 2024 883 -
We Pre-Trained Stable Diffusion Models on 2 billion Images and Didn't Break the Bank - Definitive Guides with Ray Series Max Pumperla, Marwan Sarieddine May 21, 2024 4553 -
Now Available: The LLM Router Template Amjad Almahairi Jul 19, 2024 256 -
Fine-tuning LLMs for longer context and better RAG systems Artur Niederfahrenhorst, Kourosh Hakhamaneshi Feb 13, 2024 2847 1
Accelerating AI: Harnessing Intel(R) Gaudi(R) 3 with Ray 2.10 Ramit Hora Apr 09, 2024 596 -
Introducing Anyscale’s Unified Log Viewer Alan Guo, Gene Su Jul 18, 2024 405 -
Cross-modal Search for E-commerce: Building and Scaling a Cross-Modal Image Retrieval App Marwan Sarieddine, Natalia Czerep, Mateusz Kwasniak, Artur Zygadło Jun 04, 2024 3253 -
​​Reinventing Multi-Modal Search with Anyscale and MongoDB Marwan Sarieddine, Kamil Kaczmarek Jul 25, 2024 5145 -
New on Anyscale: Debug and Optimize Ray Applications Faster with Structured Logging Jiajun Yao, Kai-Hsun Chen Jul 16, 2024 449 -
Easily Debug Ray Applications with Ray Distributed Debugger Anyscale team May 15, 2024 624 -
End-to-end LLM Workflows Guide Goku Mohandas Jun 17, 2024 4910 1
Building an LLM Router for High-Quality and Cost-Effective Responses Amjad Almahairi Jul 01, 2024 4430 1
Don’t Miss: Hands-On Ray Training at Ray Summit 2024 Kamil Kaczmarek Aug 13, 2024 788 -
Low-latency Generative AI Model Serving with Ray, NVIDIA Triton Inference Server, and NVIDIA TensorRT-LLM Neelay Shah, Akshay Malik Mar 13, 2024 642 -
Reducing the Cost of Pre-training Stable Diffusion by 3.7x with Anyscale Yunxuan Xiao, Hao Chen May 09, 2024 2176 5
RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone Scott Lee, Kyle Huang, Cheng Su, Hao Chen Jan 16, 2024 995 1
Update on Ray CVE-2023-48022: New Verification Tooling Available Anyscale team Mar 27, 2024 606 -
Ray Spotlight Series: Multitenant Serve Applications with Runtime Envs as Containers Sam Chan, Cindy Zhang Jun 13, 2024 800 -
Direct Preference Optimization with Synthetic Data on Anyscale Franklin Wang, Sumanth Hegde, Kourosh Hakhamaneshi Aug 21, 2024 9249 1
Welcome Keerti Robert Nishihara Jul 31, 2024 743 2
Introducing Elastic Distributed Training on Anyscale Matthew Deng, Justin Yu Jul 22, 2024 478 -
Ray Spotlight: How we delivered Ray weekly releases Sam Chan Jun 25, 2024 629 -
Scaling Embedding Generation Pipelines From Pandas to Ray Data Marwan Sarieddine Sep 04, 2024 2154 -
Fine-tuning Llama-3, Mistral and Mixtral with Anyscale Marwan Sarieddine and Kamil Kaczmarek Sep 11, 2024 2256 -
Building a RAG Batch Inference Pipeline with Anyscale and Union Kevin Su and Kai-Hsun Chen Sep 12, 2024 1665 -
Roblox Guest Blog: Fast and Efficient Online Model Serving Younes Abouelnagah Sep 19, 2024 2925 -
Accelerated Metadata Fetching in Ray Data up to 4.5x Faster on Anyscale Balaji Veeramani, Hao Chen, Richard Liaw, Matthew Connor and Praveen Gorthy Oct 01, 2024 607 -
Anyscale on Kubernetes: Simplifying AI Workloads on User-Managed Infrastructure Dominic Catalano and Yifei Feng Oct 01, 2024 792 -
Anyscale Now Available on AWS Marketplace and Achieves Generative AI Competency The Anyscale Team Oct 01, 2024 510 -
Batch LLM Inference on Anyscale slashes AWS Bedrock costs by up to 6x Cody Yu, Scott Lee, Ricky Xu, William Lin, Praveen Gorthy and Richard Liaw Oct 01, 2024 1180 -
Ray Data GA Hao Chen, Richard Liaw and Praveen Gorthy Oct 01, 2024 1037 -
Anyscale’s New User Experience: A Comprehensive Overview The Anyscale Team Oct 01, 2024 1161 -
Anyscale Now on GCP Marketplace The Anyscale Team Oct 01, 2024 381 -
Autoscaling Large AI Models up to 5.1x Faster on Anyscale Christopher Chou, Austin Kuo, Richard Liaw, Edward Oakes and Chris Sivanich Oct 01, 2024 1260 -
Enterprise Governance and Observability on Anyscale The Anyscale Team Oct 01, 2024 479 -
Announcing RayTurbo Akshay Malik, Praveen Gorthy and Richard Liaw Oct 01, 2024 1453 -
Ray Summit 2024: Breaking Through the AI Complexity Wall The Anyscale Team Oct 03, 2024 1600 -
Ray Compiled Graphs: Optimized AI Workloads with Native GPU Communication Sang Cho, Sam Chan and Stephanie Wang Oct 07, 2024 1910 -
Unlocking the Power of Scalable Machine Learning with Anyscale and Astronomer The Anyscale Team Oct 29, 2024 1063 -
Anyscale Named a Cool Vendor for AI Engineering by Gartner® The Anyscale Team Nov 13, 2024 399 -
Deploying DeepSeek R1 on Anyscale The Anyscale Team Feb 12, 2025 995 -
Introducing the Ray Kubectl Plugin: A Simpler Way to Manage Ray Clusters on Kubernetes The KubeRay Team Feb 20, 2025 1182 -
KubeRay v1.3.0: Enhancing Observability, Reliability, and Usability The KubeRay Team Feb 20, 2025 2151 -
uv + Ray: Pain-Free Python Dependencies in Clusters Christina Zhu and Philipp Moritz Feb 27, 2025 1718 1
Announcing the Anyscale Technical Webinar Series: Learn Ray and Distributed AI The Anyscale Team Mar 06, 2025 413 -