Δ-Mem: Compact Online Memory State Boosts LLM Long-Term Recall

The Problem

LLMs struggle to accumulate and reuse historical information across long conversations or multi-step agent tasks. Expanding the context window is expensive and doesn't guarantee that the model properly utilizes distant context — it just makes the window bigger.

The Δ-Mem Approach

Δ-Mem adds a fixed-size state matrix to a frozen LLM backbone. This matrix is updated via delta-rule learning and generates low-rank corrections to the attention computation during generation. The result is effective long-term memory without full model fine-tuning or architectural replacement.

Performance Results

Despite using just an 8×8 online memory state, the gains are meaningful: 1.10× improvement over the frozen baseline, 1.15× over non-Δ-Mem baselines on general benchmarks, 1.31× on MemoryAgentBench, and 1.20× on LoCoMo — all while maintaining general capabilities. The efficiency is striking for such a compact mechanism.

Significance

Δ-Mem demonstrates that effective memory can be realized through a compact online state directly coupled with attention, without requiring full model retraining or separate memory modules. This makes it potentially applicable to existing deployed models as an efficient memory augmentation for long-horizon tasks.

LLM Reddit 6d ago 1 min read

Fields Medalist Timothy Gowers: GPT-5.5 Pro Produced PhD-Level Math Proofs — Research Faces 'Crisis'

Fields Medal-winning mathematician Timothy Gowers tested ChatGPT 5.5 Pro on open math problems and found it produced PhD-level proofs in about an hour, warning that mathematical research faces an imminent 'crisis' at the current rate of AI progress.

#chatgpt #mathematics #llm

LLM Hacker News 6d ago 1 min read

Study: LLMs Silently Corrupt 25% of Documents in Delegated Workflows

A new DELEGATE-52 benchmark study finds that even frontier LLMs like Gemini 3.1 Pro, Claude 4.6 Opus, and GPT 5.4 corrupt an average of 25% of document content during long delegated workflows, with errors compounding silently.

#llm #research #ai-safety

LLM Hacker News Mar 21, 2026 2 min read

Hacker News Tracks Moonshot AI’s Attention Residuals as a Drop-In Upgrade for Transformer Depth

The March 20, 2026 HN discussion around Attention Residuals focused on a simple claim with large implications: replace fixed residual addition with learned depth-wise attention and recover performance with modest overhead.

#llm #transformers #research