Company
Date Published
Author
Together AI
Word count
984
Language
English
Hacker News points
None

Summary

Together AI is expanding its infrastructure to support large-scale DeepSeek-R1 workloads with Together Reasoning Clusters, which provide dedicated GPU infrastructure for high-throughput, low-latency inference. This offering is designed for companies running large-scale reasoning models and provides benefits such as consistent, low-latency performance, cost-effective scaling, secure environments, and enterprise support. The company also offers a fast, secure serverless API for DeepSeek-R1, with features like instant scalability, flexible pricing, and higher rate limits compared to other providers. Additionally, Together AI provides a free endpoint for the 70B distilled model, allowing users to get started with no upfront cost.