Together AI and NVIDIA have collaborated to power Llama 3.1 models for enterprises on NVIDIA DGX Cloud, bringing industry-leading Together Inference Engine to NVIDIA AI Foundry customers. This collaboration empowers enterprises to leverage openly available models like Llama 3.1 running on the Together Inference Engine on NVIDIA DGX Cloud, enabling highly optimized inference capabilities with unmatched performance, accuracy, and cost-efficiency. The partnership introduces the highly optimized Together Inference Engine to DGX Cloud, offering companies efficient and scalable AI inference capabilities, while allowing them to fine-tune models with their proprietary data for higher accuracy and performance. The collaboration marks an inflection point for open source AI with the launch of Llama 3.1 405B, the largest openly available foundation model, which offers unmatched flexibility, control, and state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation. This partnership enables enterprises to deploy endpoints with the highest performance, scalability, and security on NVIDIA DGX Cloud, accelerating the adoption of open-source AI among developers and enterprises.