#long-context

LLM Reddit 1d ago 2 min read

LocalLLaMA Loves the 80 TPS Qwen3.6 Demo, Then Immediately Starts Auditing the Fine Print

LocalLLaMA did not just cheer the number. The moment 80 tps and a 218k context window appeared, the thread shifted to prompt length, quantization tradeoffs, and whether the vLLM setup really holds up in practice.

#qwen3-6 #vllm #rtx-5090

LLM Hacker News 2d ago 2 min read

HN Spots the Real DeepSeek V4 Story: The Docs Link Was Thin, but the Weights Were Already Live

HN did not latch onto DeepSeek V4 because of a polished launch page. The thread took off when commenters realized the front-page link was just updated docs while the weights and base models were already live for inspection.

#deepseek #llm #moe

LLM Reddit Apr 14, 2026 2 min read

r/LocalLLaMA Finds a Privacy-First Use Case for Gemma 4 Long Context

A popular r/LocalLLaMA thread described using Gemma 4’s 256k context window to analyze a 100k+ token personal journal locally, turning privacy into a practical reason to run an LLM on-device.

#local-llms #gemma-4 #privacy

LLM Reddit Apr 12, 2026 2 min read

A Gemma 4 26B User Pushes Local Context to 245K Tokens

A r/LocalLLaMA stress test claims Gemma 4 26B A4B remained coherent at roughly 94% of a 262,144-token context window in llama.cpp. The post is anecdotal, but it is valuable because it pairs the claim with concrete tuning details and failure modes.

#localllm #gemma-4 #long-context

LLM Reddit Mar 28, 2026 2 min read

r/MachineLearning challenges LoCoMo’s reliability with a detailed audit

A post on r/MachineLearning argues that LoCoMo’s leaderboard is being treated with more confidence than its evaluation setup deserves. The audit claims the benchmark has a 6.4% ground-truth error rate and that its judge accepts intentionally wrong but topically adjacent answers far too often, turning attention from raw scores to benchmark reliability.

#benchmarks #evaluation #long-context

LLM sources.twitter Mar 27, 2026 2 min read

Together Research says divide-and-conquer long-context pipelines can beat GPT-4o single-shot

Together Research said on March 27, 2026 that a smaller model using divide-and-conquer can match or outperform GPT-4o on long-context tasks, with the work accepted at ICLR 2026. Together's blog and the arXiv paper say the method uses a planner-worker-manager pipeline and explains long-context failures in terms of task, model, and aggregator noise.

#together-ai #long-context #multi-agent

LLM Hacker News Mar 15, 2026 2 min read

HN: Anthropic Makes 1M Context Standard for Opus 4.6 and Sonnet 4.6

Anthropic says 1M context is now generally available for Opus 4.6 and Sonnet 4.6 with standard pricing, no long-context premium, and media limits expanded to 600 images or PDF pages. Hacker News treated the announcement as a practical deployment story rather than a simple spec bump.

#anthropic #claude #long-context

LLM sources.twitter Mar 14, 2026 2 min read

Azure Pushes Claude 4.6 in Microsoft Foundry with 1M-Token Context, 600-Page Inputs, and Flat Pricing

Azure posted on March 14, 2026 that Claude Opus 4.6 and Sonnet 4.6 now support 1M-token context in Microsoft Foundry with flat pricing and higher media limits. Microsoft and Anthropic documentation confirm the 1M window, 600 image/PDF-page cap, and standard pricing across the full context range.

#azure #anthropic #claude

AI Hacker News Mar 10, 2026 2 min read

LoGeR Pushes Feedforward 3D Reconstruction to 19,000-Frame Videos

A Hacker News discussion highlighted LoGeR, a Google DeepMind and UC Berkeley project that uses hybrid memory to scale dense 3D reconstruction across extremely long videos without post-hoc optimization.

#computer-vision #3d-reconstruction #long-context

LLM Reddit Feb 25, 2026 2 min read

Reddit Flags Qwen3.5-35B-A3B on Hugging Face with MoE and Long Context

A high-engagement r/LocalLLaMA post surfaced the Qwen3.5-35B-A3B model card on Hugging Face. The card emphasizes MoE efficiency, long context handling, and deployment paths across common open-source inference stacks.

#qwen #open-weights #moe