/plushcap/analysis/cloudflare/workers-ai-ga-huggingface-loras-python-support

Leveling up Workers AI: General Availability and more new capabilities

What's this blog post about?

Cloudflare has announced the General Availability (GA) of its Workers AI inference platform, which is now more reliable and performant with improved pricing. The company also revealed updates on GPU hardware momentum, an expansion of its Hugging Face partnership, Bring Your Own LoRA fine-tuned inference, Python support in Workers, more providers in AI Gateway, and Vectorize metadata filtering. Additionally, Cloudflare plans to deploy GPUs to over 150 cities worldwide by the end of 2024, making it the most widely distributed cloud-AI inference platform.

Company
Cloudflare

Date published
April 2, 2024

Author(s)
Michelle Chen, Jesse Kipp, Syona Sarma, Brendan Irvine-Broque, Vy Ton

Word count
2011

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.