DeepSeek V4-Pro makes its 75% API price cut permanent
Original: DeepSeek V4-Pro locks in 75% API discount as default pricing View original →
DeepSeek has moved a temporary promotion into its baseline API pricing, putting another hard number under the cost pressure facing large model providers. In a May 22, 2026 post on X, the company said V4-Pro’s discounted price will remain in place after the offer period, with the attached image stating: “DeepSeek-V4-Pro 75% OFF - Now Permanent.”
“We are making our discount permanent!”
The concrete change is a fourfold price cut. The image attached to the post lists cache-hit input at $0.003625, down from $0.0145; cache-miss input at $0.435, down from $1.74; and output at $0.87, down from $3.48. The quoted earlier post had extended the V4-Pro discount until May 31, 2026 at 15:59 UTC. The new post turns that deadline into a permanent pricing reset rather than another short promotion.
That matters because the next phase of LLM competition is not only about benchmark rankings. API economics decide whether developers can run coding agents, research assistants, and long-context workflows repeatedly enough to be useful. DeepSeek’s official account is typically used for model releases, API changes, open-weight availability, and pricing updates, so this post functions as a direct pricing signal to developers already testing V4-Pro against proprietary and open model alternatives.
The practical test now shifts to capacity. A low token price helps only if throughput, latency, and rate limits hold up under production load. If V4-Pro can sustain the new pricing without frequent overload errors, it may force buyers to reprice what they expect from higher-cost model APIs. Watch for responses from model routers, enterprise AI platforms, and rival labs that sell coding or agent workloads by token usage. The source post is available on DeepSeek’s X account.
Related Articles
Cache-hit pricing can decide whether long-context assistants are cheap enough to ship. DeepSeek said the entire API series now charges just one-tenth of the old rate for input cache hits, while keeping a 75% off V4-Pro promotion live.
LocalLLaMA reacted hard because DeepSeek's visual-primitives idea makes points and boxes part of reasoning itself, and the repo going private only made the thread hotter.
HN did not latch onto DeepSeek V4 because of a polished launch page. The thread took off when commenters realized the front-page link was just updated docs while the weights and base models were already live for inspection.
Comments (0)
No comments yet. Be the first to comment!