Processing 2 Billion Images for Stable Diffusion Model Training - Definitive Guides with Ray Series |
Max Pumperla, Marwan Sarieddine |
May 14, 2024 |
4209 |
- |
Improve Utilization and Simplify Cluster Management with Anyscale Job Queues |
Dominic Catalano, Alexey Kudinkin |
Jul 23, 2024 |
735 |
- |
Ray Summit 2024 Call for Proposals is now open |
Anyscale team |
Apr 19, 2024 |
264 |
- |
Deploy Ray Serve with up to 50% fewer nodes using Anyscale Replica Compaction |
Matt Connor, Akshay Malik, Cindy Zhang |
Jul 15, 2024 |
883 |
- |
We Pre-Trained Stable Diffusion Models on 2 billion Images and Didn't Break the Bank - Definitive Guides with Ray Series |
Max Pumperla, Marwan Sarieddine |
May 21, 2024 |
4553 |
- |
Now Available: The LLM Router Template |
Amjad Almahairi |
Jul 19, 2024 |
256 |
- |
Fine-tuning LLMs for longer context and better RAG systems |
Artur Niederfahrenhorst, Kourosh Hakhamaneshi |
Feb 13, 2024 |
2847 |
1 |
Accelerating AI: Harnessing Intel(R) Gaudi(R) 3 with Ray 2.10 |
Ramit Hora |
Apr 09, 2024 |
596 |
- |
Introducing Anyscale’s Unified Log Viewer |
Alan Guo, Gene Su |
Jul 18, 2024 |
405 |
- |
Cross-modal Search for E-commerce: Building and Scaling a Cross-Modal Image Retrieval App |
Marwan Sarieddine, Natalia Czerep, Mateusz Kwasniak, Artur Zygadło |
Jun 04, 2024 |
3253 |
- |
Reinventing Multi-Modal Search with Anyscale and MongoDB |
Marwan Sarieddine, Kamil Kaczmarek |
Jul 25, 2024 |
5145 |
- |
New on Anyscale: Debug and Optimize Ray Applications Faster with Structured Logging |
Jiajun Yao, Kai-Hsun Chen |
Jul 16, 2024 |
449 |
- |
Easily Debug Ray Applications with Ray Distributed Debugger |
Anyscale team |
May 15, 2024 |
624 |
- |
End-to-end LLM Workflows Guide |
Goku Mohandas |
Jun 17, 2024 |
4910 |
1 |
Building an LLM Router for High-Quality and Cost-Effective Responses |
Amjad Almahairi |
Jul 01, 2024 |
4430 |
1 |
Don’t Miss: Hands-On Ray Training at Ray Summit 2024 |
Kamil Kaczmarek |
Aug 13, 2024 |
788 |
- |
Low-latency Generative AI Model Serving with Ray, NVIDIA Triton Inference Server, and NVIDIA TensorRT-LLM |
Neelay Shah, Akshay Malik |
Mar 13, 2024 |
642 |
- |
Reducing the Cost of Pre-training Stable Diffusion by 3.7x with Anyscale |
Yunxuan Xiao, Hao Chen |
May 09, 2024 |
2176 |
5 |
RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone |
Scott Lee, Kyle Huang, Cheng Su, Hao Chen |
Jan 16, 2024 |
995 |
1 |
Update on Ray CVE-2023-48022: New Verification Tooling Available |
Anyscale team |
Mar 27, 2024 |
606 |
- |
Ray Spotlight Series: Multitenant Serve Applications with Runtime Envs as Containers |
Sam Chan, Cindy Zhang |
Jun 13, 2024 |
800 |
- |
Direct Preference Optimization with Synthetic Data on Anyscale |
Franklin Wang, Sumanth Hegde, Kourosh Hakhamaneshi |
Aug 21, 2024 |
9249 |
1 |
Welcome Keerti |
Robert Nishihara |
Jul 31, 2024 |
743 |
2 |
Introducing Elastic Distributed Training on Anyscale |
Matthew Deng, Justin Yu |
Jul 22, 2024 |
478 |
- |
Ray Spotlight: How we delivered Ray weekly releases |
Sam Chan |
Jun 25, 2024 |
629 |
- |
Scaling Embedding Generation Pipelines From Pandas to Ray Data |
Marwan Sarieddine |
Sep 04, 2024 |
2154 |
- |
Fine-tuning Llama-3, Mistral and Mixtral with Anyscale |
Marwan Sarieddine and Kamil Kaczmarek |
Sep 11, 2024 |
2256 |
- |
Building a RAG Batch Inference Pipeline with Anyscale and Union |
Kevin Su and Kai-Hsun Chen |
Sep 12, 2024 |
1665 |
- |
Roblox Guest Blog: Fast and Efficient Online Model Serving |
Younes Abouelnagah |
Sep 19, 2024 |
2925 |
- |
Accelerated Metadata Fetching in Ray Data up to 4.5x Faster on Anyscale |
Balaji Veeramani, Hao Chen, Richard Liaw, Matthew Connor and Praveen Gorthy |
Oct 01, 2024 |
607 |
- |
Anyscale on Kubernetes: Simplifying AI Workloads on User-Managed Infrastructure |
Dominic Catalano and Yifei Feng |
Oct 01, 2024 |
792 |
- |
Anyscale Now Available on AWS Marketplace and Achieves Generative AI Competency |
The Anyscale Team |
Oct 01, 2024 |
510 |
- |
Batch LLM Inference on Anyscale slashes AWS Bedrock costs by up to 6x |
Cody Yu, Scott Lee, Ricky Xu, William Lin, Praveen Gorthy and Richard Liaw |
Oct 01, 2024 |
1180 |
- |
Ray Data GA |
Hao Chen, Richard Liaw and Praveen Gorthy |
Oct 01, 2024 |
1037 |
- |
Anyscale’s New User Experience: A Comprehensive Overview |
The Anyscale Team |
Oct 01, 2024 |
1161 |
- |
Anyscale Now on GCP Marketplace |
The Anyscale Team |
Oct 01, 2024 |
381 |
- |
Autoscaling Large AI Models up to 5.1x Faster on Anyscale |
Christopher Chou, Austin Kuo, Richard Liaw, Edward Oakes and Chris Sivanich |
Oct 01, 2024 |
1260 |
- |
Enterprise Governance and Observability on Anyscale |
The Anyscale Team |
Oct 01, 2024 |
479 |
- |
Announcing RayTurbo |
Akshay Malik, Praveen Gorthy and Richard Liaw |
Oct 01, 2024 |
1453 |
- |
Ray Summit 2024: Breaking Through the AI Complexity Wall |
The Anyscale Team |
Oct 03, 2024 |
1600 |
- |
Ray Compiled Graphs: Optimized AI Workloads with Native GPU Communication |
Sang Cho, Sam Chan and Stephanie Wang |
Oct 07, 2024 |
1910 |
- |
Unlocking the Power of Scalable Machine Learning with Anyscale and Astronomer |
The Anyscale Team |
Oct 29, 2024 |
1063 |
- |
Anyscale Named a Cool Vendor for AI Engineering by Gartner® |
The Anyscale Team |
Nov 13, 2024 |
399 |
- |
Deploying DeepSeek R1 on Anyscale |
The Anyscale Team |
Feb 12, 2025 |
995 |
- |
Introducing the Ray Kubectl Plugin: A Simpler Way to Manage Ray Clusters on Kubernetes |
The KubeRay Team |
Feb 20, 2025 |
1182 |
- |
KubeRay v1.3.0: Enhancing Observability, Reliability, and Usability |
The KubeRay Team |
Feb 20, 2025 |
2151 |
- |
uv + Ray: Pain-Free Python Dependencies in Clusters |
Christina Zhu and Philipp Moritz |
Feb 27, 2025 |
1718 |
1 |
Announcing the Anyscale Technical Webinar Series: Learn Ray and Distributed AI |
The Anyscale Team |
Mar 06, 2025 |
413 |
- |