#arxiv

Sciences Reddit Jun 10, 2026 1 min read

arXiv’s One-Year Ban Warning Puts Verification Ahead of AI Disclosure

The r/artificial thread focused less on banning AI tools and more on author responsibility when unchecked model output reaches the scholarly record.

#arxiv #ai-generated #research-integrity

LLM Hacker News May 16, 2026 1 min read

Δ-Mem: Compact Online Memory State Boosts LLM Long-Term Recall

A new arXiv paper introduces Δ-Mem, a compact fixed-size memory mechanism that augments frozen LLMs with delta-rule learning. It achieves 1.31× improvement on MemoryAgentBench using just an 8×8 state matrix, without retraining the base model.

#memory #attention #llm

AI Reddit May 16, 2026 1 min read

arXiv Bans Authors 1 Year for Papers With Unchecked LLM-Generated Errors

arXiv has begun enforcing a one-year submission ban on authors whose papers contain incontrovertible evidence of unchecked LLM-generated errors such as hallucinated references. The policy marks a firm institutional stance on AI-assisted academic dishonesty.

#arxiv #llm #academic-integrity

Sciences Hacker News Apr 26, 2026 2 min read

HN Debates a Bold Claim: Deep Learning May Finally Be Ready for Theory

Hacker News latched onto this paper because it was not selling a new benchmark or model, but a bigger claim: deep learning may finally be mature enough for a real scientific theory. That mix of excitement and skepticism kept the thread moving.

#deep-learning #theory #learning-mechanics

LLM Apr 17, 2026 2 min read

LLM judges hide instability: 33-67% of documents break consistency

A new arXiv paper shows why low average violation rates can make LLM judges look safer than they are. On SummEval, 33-67% of documents showed at least one directed 3-cycle, and prediction-set width tracked absolute error strongly.

#llm #evaluation #benchmarks

LLM Hacker News Apr 8, 2026 2 min read

MegaTrain turns a Hacker News paper pick into a memory-systems debate about single-GPU LLM training

MegaTrain proposes training 100B+ parameter LLMs at full precision on a single GPU by keeping parameters and optimizer states in host memory and streaming layers through the device. The recent Hacker News interest is notable because the paper reframes the problem as one of memory-system design rather than simple GPU count.

#llm-training #systems #gpu

AI Hacker News Mar 18, 2026 2 min read

Hacker News debates a cognitive-science roadmap for autonomous AI learning

A Hacker News front-page paper from Emmanuel Dupoux, Yann LeCun, and Jitendra Malik argues that current AI still lacks autonomous learning and sketches an architecture built around observation, active behavior, and meta-control.

#autonomous-learning #cognitive-science #ai-architecture

LLM Mar 14, 2026 2 min read

Ares Paper Shows Dynamic Reasoning Can Cut LLM Agent Tokens by Up to 52.7%

The arXiv paper Ares, submitted on March 9, 2026, proposes dynamic per-step reasoning selection for multi-step LLM agents. The authors report up to 52.7% lower reasoning token usage versus fixed high-effort settings with only minimal drops in task success.

#llm-agents #reasoning #efficiency

105

LLM Reddit Mar 13, 2026 2 min read

r/singularity highlights a paper arguing the LM head wastes most of the training signal

A Reddit thread surfaced arXiv paper 2603.10145, which argues the output layer of language models is not just a softmax expressivity issue but an optimization bottleneck that suppresses 95-99% of gradient norm. The discussion centered on whether better head designs could unlock more efficient LLM training.

#backpropagation #lm-head #optimization

AI Reddit Feb 25, 2026 2 min read

Reddit Highlights H-Neurons Paper Linking Specific Neurons to LLM Hallucination

A r/singularity thread boosted attention on an arXiv paper studying hallucination-associated neurons in LLMs. The authors report that a very small subset of neurons can predict hallucination behavior and may be causally involved.

#hallucination #llm-reliability #arxiv

LLM Reddit Feb 21, 2026 2 min read

Reddit Discusses arXiv 2602.15322: Masked Adaptive Updates (Magma) for LLM Pretraining

A high-engagement r/singularity post pointed to arXiv 2602.15322, which reports that masked adaptive updates and the proposed Magma optimizer can improve 1B-model perplexity versus Adam and Muon with minimal overhead.

#llm-training #optimizers #rmsprop

Sciences Hacker News Feb 16, 2026 2 min read

Towards Autonomous Mathematics Research Hits Hacker News: Aletheia Framed as a Research Agent

A Hacker News thread highlighted arXiv 2602.10177, where DeepMind researchers introduce Aletheia, an agent workflow for mathematics research. The paper claims progress from Olympiad-style reasoning toward PhD-level tasks and semi-autonomous open-problem exploration.

#mathematics #ai-research #agents