Together AI and NVIDIA collaborate to power Llama 3.1 models for enterprises on NVIDIA DGX Cloud

Company

Together AI

Date Published

July 23, 2024

Author

Together AI

Word count

612

Language

English

Hacker News points

None

URL

www.together.ai/blog/nvidia-ai-foundry-partnership

Summary

Together AI and NVIDIA have collaborated to power Llama 3.1 models for enterprises on NVIDIA DGX Cloud, bringing industry-leading Together Inference Engine to NVIDIA AI Foundry customers. This collaboration empowers enterprises to leverage openly available models like Llama 3.1 running on the Together Inference Engine on NVIDIA DGX Cloud, enabling highly optimized inference capabilities with unmatched performance, accuracy, and cost-efficiency. The partnership introduces the highly optimized Together Inference Engine to DGX Cloud, offering companies efficient and scalable AI inference capabilities, while allowing them to fine-tune models with their proprietary data for higher accuracy and performance. The collaboration marks an inflection point for open source AI with the launch of Llama 3.1 405B, the largest openly available foundation model, which offers unmatched flexibility, control, and state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation. This partnership enables enterprises to deploy endpoints with the highest performance, scalability, and security on NVIDIA DGX Cloud, accelerating the adoption of open-source AI among developers and enterprises.