Processing 2 Billion Images for Stable Diffusion Model Training - Definitive Guides with Ray Series |
Max Pumperla, Marwan Sarieddine |
May. 14, 2024 |
4209 |
- |
Improve Utilization and Simplify Cluster Management with Anyscale Job Queues |
Dominic Catalano, Alexey Kudinkin |
Jul. 23, 2024 |
735 |
- |
Ray Summit 2024 Call for Proposals is now open |
Anyscale team |
Apr. 19, 2024 |
264 |
- |
Deploy Ray Serve with up to 50% fewer nodes using Anyscale Replica Compaction |
Matt Connor, Akshay Malik, Cindy Zhang |
Jul. 15, 2024 |
883 |
- |
We Pre-Trained Stable Diffusion Models on 2 billion Images and Didn't Break the Bank - Definitive Guides with Ray Series |
Max Pumperla, Marwan Sarieddine |
May. 21, 2024 |
4553 |
- |
Now Available: The LLM Router Template |
Amjad Almahairi |
Jul. 19, 2024 |
256 |
- |
Fine-tuning LLMs for longer context and better RAG systems |
Artur Niederfahrenhorst, Kourosh Hakhamaneshi |
Feb. 13, 2024 |
2847 |
1 |
Accelerating AI: Harnessing Intel(R) Gaudi(R) 3 with Ray 2.10 |
Ramit Hora |
Apr. 09, 2024 |
596 |
- |
Introducing Anyscale’s Unified Log Viewer |
Alan Guo, Gene Su |
Jul. 18, 2024 |
405 |
- |
Cross-modal Search for E-commerce: Building and Scaling a Cross-Modal Image Retrieval App |
Marwan Sarieddine, Natalia Czerep, Mateusz Kwasniak, Artur Zygadło |
Jun. 04, 2024 |
3253 |
- |
Reinventing Multi-Modal Search with Anyscale and MongoDB |
Marwan Sarieddine, Kamil Kaczmarek |
Jul. 25, 2024 |
5145 |
- |
New on Anyscale: Debug and Optimize Ray Applications Faster with Structured Logging |
Jiajun Yao, Kai-Hsun Chen |
Jul. 16, 2024 |
449 |
- |
Easily Debug Ray Applications with Ray Distributed Debugger |
Anyscale team |
May. 15, 2024 |
624 |
- |
End-to-end LLM Workflows Guide |
Goku Mohandas |
Jun. 17, 2024 |
4910 |
1 |
Building an LLM Router for High-Quality and Cost-Effective Responses |
Amjad Almahairi |
Jul. 01, 2024 |
4430 |
1 |
Don’t Miss: Hands-On Ray Training at Ray Summit 2024 |
Kamil Kaczmarek |
Aug. 13, 2024 |
788 |
- |
Low-latency Generative AI Model Serving with Ray, NVIDIA Triton Inference Server, and NVIDIA TensorRT-LLM |
Neelay Shah, Akshay Malik |
Mar. 13, 2024 |
642 |
- |
Reducing the Cost of Pre-training Stable Diffusion by 3.7x with Anyscale |
Yunxuan Xiao, Hao Chen |
May. 09, 2024 |
2176 |
5 |
RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone |
Scott Lee, Kyle Huang, Cheng Su, Hao Chen |
Jan. 16, 2024 |
995 |
1 |
Update on Ray CVE-2023-48022: New Verification Tooling Available |
Anyscale team |
Mar. 27, 2024 |
606 |
- |
Ray Spotlight Series: Multitenant Serve Applications with Runtime Envs as Containers |
Sam Chan, Cindy Zhang |
Jun. 13, 2024 |
800 |
- |
Direct Preference Optimization with Synthetic Data on Anyscale |
Franklin Wang, Sumanth Hegde, Kourosh Hakhamaneshi |
Aug. 21, 2024 |
9249 |
1 |
Welcome Keerti |
Robert Nishihara |
Jul. 31, 2024 |
743 |
2 |
Introducing Elastic Distributed Training on Anyscale |
Matthew Deng, Justin Yu |
Jul. 22, 2024 |
478 |
- |
Ray Spotlight: How we delivered Ray weekly releases |
Sam Chan |
Jun. 25, 2024 |
629 |
- |
Scaling Embedding Generation Pipelines From Pandas to Ray Data |
Marwan Sarieddine |
Sep. 04, 2024 |
2154 |
- |
Fine-tuning Llama-3, Mistral and Mixtral with Anyscale |
Marwan Sarieddine and Kamil Kaczmarek |
Sep. 11, 2024 |
2256 |
- |
Building a RAG Batch Inference Pipeline with Anyscale and Union |
Kevin Su and Kai-Hsun Chen |
Sep. 12, 2024 |
1665 |
- |
Roblox Guest Blog: Fast and Efficient Online Model Serving |
Younes Abouelnagah |
Sep. 19, 2024 |
2925 |
- |
Accelerated Metadata Fetching in Ray Data up to 4.5x Faster on Anyscale |
Balaji Veeramani, Hao Chen, Richard Liaw, Matthew Connor and Praveen Gorthy |
Oct. 01, 2024 |
607 |
- |
Anyscale on Kubernetes: Simplifying AI Workloads on User-Managed Infrastructure |
Dominic Catalano and Yifei Feng |
Oct. 01, 2024 |
792 |
- |
Anyscale Now Available on AWS Marketplace and Achieves Generative AI Competency |
The Anyscale Team |
Oct. 01, 2024 |
510 |
- |
Batch LLM Inference on Anyscale slashes AWS Bedrock costs by up to 6x |
Cody Yu, Scott Lee, Ricky Xu, William Lin, Praveen Gorthy and Richard Liaw |
Oct. 01, 2024 |
1180 |
- |
Ray Data GA |
Hao Chen, Richard Liaw and Praveen Gorthy |
Oct. 01, 2024 |
1037 |
- |
Anyscale’s New User Experience: A Comprehensive Overview |
The Anyscale Team |
Oct. 01, 2024 |
1161 |
- |
Anyscale Now on GCP Marketplace |
The Anyscale Team |
Oct. 01, 2024 |
381 |
- |
Autoscaling Large AI Models up to 5.1x Faster on Anyscale |
Christopher Chou, Austin Kuo, Richard Liaw, Edward Oakes and Chris Sivanich |
Oct. 01, 2024 |
1260 |
- |
Enterprise Governance and Observability on Anyscale |
The Anyscale Team |
Oct. 01, 2024 |
479 |
- |
Announcing RayTurbo |
Akshay Malik, Praveen Gorthy and Richard Liaw |
Oct. 01, 2024 |
1453 |
- |
Ray Summit 2024: Breaking Through the AI Complexity Wall |
The Anyscale Team |
Oct. 03, 2024 |
1600 |
- |
Ray Compiled Graphs: Optimized AI Workloads with Native GPU Communication |
Sang Cho, Sam Chan and Stephanie Wang |
Oct. 07, 2024 |
1910 |
- |
Unlocking the Power of Scalable Machine Learning with Anyscale and Astronomer |
The Anyscale Team |
Oct. 29, 2024 |
1063 |
- |
Anyscale Named a Cool Vendor for AI Engineering by Gartner® |
The Anyscale Team |
Nov. 13, 2024 |
399 |
- |