Leveling up Workers AI: General Availability and more new capabilities
Cloudflare has announced the General Availability (GA) of its Workers AI inference platform, which is now more reliable and performant with improved pricing. The company also revealed updates on GPU hardware momentum, an expansion of its Hugging Face partnership, Bring Your Own LoRA fine-tuned inference, Python support in Workers, more providers in AI Gateway, and Vectorize metadata filtering. Additionally, Cloudflare plans to deploy GPUs to over 150 cities worldwide by the end of 2024, making it the most widely distributed cloud-AI inference platform.
Company
Cloudflare
Date published
April 2, 2024
Author(s)
Michelle Chen, Jesse Kipp, Syona Sarma, Brendan Irvine-Broque, Vy Ton
Word count
2011
Language
English
Hacker News points
None found.