LLM Reddit 4h ago 2 min read
A detailed LocalLLaMA post compared a $10K Mac Studio M3 Ultra 512GB with a similarly priced dual DGX Spark setup for running Qwen3.5 397B A17B locally. The Mac delivered 30 to 40 tok/s and easier setup, while the dual Spark build offered faster prefill and embedding performance at much higher operational complexity.