This guide provides a step-by-step tutorial on setting up a small, distributed AI system using two Mac Minis, leveraging Ray for distributed computing and adapting vLLM concepts for Apple Silicon. The goal is to create an accessible entry point into distributed computing for AI workloads while maintaining control and privacy. By following the guide, developers can build a cost-effective environment for development, testing, and smaller production scenarios, offering a balance of performance, energy efficiency, and affordability. However, it's essential to understand the limitations of this setup, including memory constraints and potential bottlenecks in network performance.