#rocm

AI Hacker News Apr 13, 2026 2 min read

Hacker News spotlights AMD's step-by-step ROCm strategy against CUDA's moat

A front-page Hacker News discussion resurfaced an EE Times interview outlining how AMD wants ROCm, Triton, OneROCm, and an open-source release model to chip away at CUDA dependence. The real test is not a headline compatibility claim, but whether stacks like vLLM and SGLang work in a boring, dependable way.

#rocm #cuda #amd

LLM Reddit Mar 23, 2026 2 min read

LocalLLaMA Shares Mi50 ROCm 7 vs Vulkan Benchmarks for llama.cpp

A benchmark thread on r/LocalLLaMA compared ROCm 7 nightlies and Vulkan on an AMD Mi50 for llama.cpp, arguing that Vulkan wins short dense workloads while ROCm pulls ahead on long context and some MoE scenarios.

#llama.cpp #rocm #vulkan

LLM Reddit Mar 8, 2026 2 min read

LocalLLaMA shares a llama.cpp tuning tip: smaller n_ubatch unlocked much faster Qwen 27B prompt processing

A LocalLLaMA thread reported a large prompt-processing speedup on Qwen3.5-27B by lowering llama.cpp `--ubatch-size` to 64 on an RX 9070 XT. The interesting part is not a universal magic number, but the reminder that prompt ingestion and token generation can respond very differently to `n_ubatch` tuning.

#llama.cpp #qwen #rocm