#deepseek

LLM Hacker News Jun 28, 2026 2 min read

DeepSeek DSpark shifts the LLM inference bottleneck to smarter verification

The useful detail is not just another speedup number: DSpark asks which drafted tokens deserve verification. DeepSeek reports 60-85% faster per-user generation on DeepSeek-V4 at matched throughput.

#deepseek #speculative-decoding #llm-inference

AI News Jun 17, 2026 1 min read

DeepSeek’s $7.4B round tests founder control at AI scale

DeepSeek has reportedly raised $7.4B at a valuation above $50B in its first external funding round. The unusual part is control: most investors are said to accept a five-year lock-up and no voting rights.

#deepseek #funding #china-ai

LLM X/Twitter May 24, 2026 1 min read

DeepSeek V4-Pro makes its 75% API price cut permanent

DeepSeek turned a temporary V4-Pro API discount into standard pricing, intensifying the cost race around frontier-class LLM access. The posted table cuts output pricing from $3.48 to $0.87 per million tokens.

#deepseek #v4-pro #api-pricing

AI Reddit May 22, 2026 1 min read

DeepSeek Advances $10.29B Financing Round, Founder Declares AGI Goal and Open-Source Commitment

Bloomberg reports DeepSeek is pushing forward with a $10.29 billion financing round. Founder Liang Wenfeng publicly reaffirmed commitment to open-source AI development and AGI over short-term commercialization.

#deepseek #funding #agi

LLM Reddit May 5, 2026 1 min read

DeepSeek V4 Pro Matches GPT-5.2 on Agentic Benchmark — 17x Cheaper, 10 Weeks Later

DeepSeek V4 Pro tied with GPT-5.2 on FoodTruck Bench, a 30-day agentic benchmark using 34 tools, arriving roughly 10 weeks after GPT-5.2 was tested at approximately 17x lower cost.

#deepseek #benchmark #llm

LLM Hacker News May 4, 2026 1 min read

DeepClaude: Run Claude Code's Agent Loop with DeepSeek V4 Pro at 17x Less Cost

DeepClaude keeps Claude Code's complete agent loop — file editing, bash, subagent spawning — while routing API calls to DeepSeek V4 Pro or other backends, cutting output token costs from $15/M to $0.87/M.

#claude-code #deepseek #developer-tools

LLM Hacker News May 2, 2026 1 min read

DeepSeek V4: Near-Frontier LLM Performance at a Fraction of the Cost

DeepSeek released DeepSeek-V4-Pro (1.6T total parameters, 49B active) and V4-Flash (284B total, 13B active), both Mixture-of-Experts models with MIT license and 1M token context. V4-Pro is the largest open-weights model released so far, and its pricing at $1.74/M input undercuts GPT-5.4 and Claude Sonnet 4.6 by more than half.

#deepseek #llm #open-weights

LLM Reddit May 1, 2026 2 min read

LocalLLaMA jumped on DeepSeek's visual-primitives idea, then watched the repo vanish

LocalLLaMA reacted hard because DeepSeek's visual-primitives idea makes points and boxes part of reasoning itself, and the repo going private only made the thread hotter.

#deepseek #multimodal #visual-reasoning

AI Apr 27, 2026 2 min read

Washington turns model distillation into diplomacy, with DeepSeek in the crosshairs

This matters because the fight over model copying is no longer staying inside lobbying letters and company blog posts. Reuters reported on April 26 that the U.S. State Department told diplomats worldwide to warn foreign governments about AI models allegedly distilled from U.S. systems, naming DeepSeek and also mentioning Moonshot AI and MiniMax.

#deepseek #distillation #policy

LLM Apr 26, 2026 2 min read

DeepSeek cuts input cache pricing to one-tenth across its full API line

Cache-hit pricing can decide whether long-context assistants are cheap enough to ship. DeepSeek said the entire API series now charges just one-tenth of the old rate for input cache hits, while keeping a 75% off V4-Pro promotion live.

#deepseek #api-pricing #caching

AI X/Twitter Apr 25, 2026 2 min read

LMSYS posts Day-0 DeepSeek-V4 speeds up to 266 tok/s on H200

Why it matters: model launches live or die on serving and training support, not just weights. LMSYS says its Day-0 stack reached 199 tok/s on B200 and 266 tok/s on H200, while staying strong out to 900K context.

#lmsys #deepseek #benchmarks

AI X/Twitter Apr 25, 2026 2 min read

DeepSeek-V4 opens 1M context with 1.6T/49B and 284B/13B split

Why it matters: open models rarely arrive with both giant context claims and deployable model splits. DeepSeek put hard numbers on the release with a 1M-context design, a 1.6T/49B Pro model, and a 284B/13B Flash variant.

#deepseek #open-weights #llm