LLM X/Twitter 3h ago 2 min read NVIDIA TwoTower keeps 98.7% quality while generating 2.42x faster NVIDIA is testing a different route to faster LLM decoding. Nemotron-Labs-TwoTower adapts a 30B backbone into a two-tower diffusion model that keeps 98.7% of baseline quality while reaching 2.42x throughput. #nvidia#nemotron#diffusion-llm 1