AMD Ryzen AI Max Pro 495 Leaks with 192GB Unified Memory — Local AI Gets a Big Upgrade
Original: AMD Strix Halo refresh with 192gb! View original →
The Leak
Specifications for AMD's Ryzen AI Max Pro 495, internally codenamed Gorgon Halo, have surfaced via videocardz.com. The headline number: 192GB of unified memory, a 50% jump over the 128GB found in the current Strix Halo lineup. The post scored 350 on r/LocalLLaMA, signaling strong interest from the local AI community.
Why It Matters
For local LLM inference, unified memory capacity is the primary constraint on which models can run and at what precision. With the current 128GB Strix Halo, fitting a 70B model at 4-bit quantization is already tight. At 192GB, users could run much larger models at higher precision, or keep multiple models loaded in memory simultaneously.
The chip also reportedly pairs the memory upgrade with a Radeon 8065S iGPU. Combined with the larger memory pool, inference speeds and batch processing throughput should see meaningful improvement over the current generation.
Context and Timeline
This is a leak, not an official announcement. Community speculation points to a late 2026 launch. Apple's M4 Max tops out at 128GB, making the potential 192GB Gorgon Halo the highest-memory consumer AI chip if it ships as leaked.
Related Articles
HN latched onto the RAM shortage because the uncomfortable link is physical: HBM demand for AI data centers is now shaping prices for phones, laptops, and handhelds.
Personal AI is shifting from single-session answers to durable context. OpenAI says the new ChatGPT memory starts with US Plus and Pro users and uses a more efficient architecture that reduced compute needs by about 5x.
DeepSeek has reportedly raised $7.4B at a valuation above $50B in its first external funding round. The unusual part is control: most investors are said to accept a five-year lock-up and no voting rights.