#mathematics

AI sources.twitter Apr 14, 2026 2 min read

EinsteinArena lifts a Newton-era math bound from 593 to 604

This is the kind of numeric jump that makes multi-agent research hard to ignore. Together says EinsteinArena agents raised the 11-dimensional kissing number lower bound from 593 to 604 and had already logged 11 new SOTA results on open problems by April 11.

#agents #open-science #mathematics

Sciences Hacker News Mar 31, 2026 2 min read

Hacker News Highlights a Continuous-Time Route from RL to Diffusion Models

A March 28 essay on the Hamilton-Jacobi-Bellman equation drew Hacker News attention by showing how continuous-time control theory connects reinforcement learning, optimal control, and diffusion models.

#reinforcement-learning #diffusion-models #optimal-control

Sciences Mar 28, 2026 2 min read

Google DeepMind says Gemini Deep Think is moving into scientific research workflows

Google DeepMind said on February 11, 2026 that Gemini Deep Think is being used on professional research problems across mathematics, physics, and computer science. The company highlighted its Aletheia math agent, up to 90% on IMO-ProofBench Advanced, and collaborations on 18 research problems as evidence that AI is moving from benchmark performance toward real scientific workflow support.

#google-deepmind #gemini #scientific-research

Sciences Hacker News Mar 24, 2026 1 min read

Hacker News debates Epoch’s FrontierMath solve confirmation for GPT-5.4 Pro

A heavily discussed HN post focused on Epoch AI’s confirmation that GPT-5.4 Pro helped solve one FrontierMath Open Problems combinatorics challenge, shifting attention from benchmark scores toward expert-verified research workflows.

#frontiermath #gpt-5.4 #mathematics

LLM Mar 16, 2026 2 min read

OpenAI shares First Proof submissions for all 10 research-level math problems

OpenAI said on February 20, 2026 that its theorem-proving model produced proof attempts for all 10 research-level First Proof problems. After expert feedback, the company believes at least five attempts are likely correct, while some remain under review and the attempt for problem 2 now appears incorrect.

#openai #theorem-proving #reasoning

Sciences Reddit Mar 11, 2026 2 min read

r/singularity Spots a Real Math Result in Claude Opus 4.6

A high-scoring r/singularity post pointed readers to Donald Knuth’s note <em>Claude’s Cycles</em>, where he says Claude Opus 4.6 helped solve an open combinatorics problem that arose while he was preparing a future TAOCP volume.

#automated-reasoning #combinatorics #donald-knuth

Sciences Mar 8, 2026 2 min read

Google DeepMind says Gemini Deep Think is moving from Olympiad benchmarks into math, physics, and CS research

Google DeepMind said on February 11, 2026 that Gemini Deep Think is now helping tackle professional problems in mathematics, physics, and computer science under expert supervision. The company tied the claim to two fresh papers, a research agent called Aletheia, and examples ranging from autonomous math results to work on algorithms, optimization, economics, and cosmic-string physics.

#google-deepmind #gemini #research

AI Reddit Mar 3, 2026 1 min read

Google DeepMind's Aletheia Autonomously Solves 6 Research-Level Math Problems

Google DeepMind's Aletheia AI research agent solved 6 out of 10 open research-level math problems in the FirstProof Challenge as judged by expert mathematicians. The system also generated a fully autonomous research paper and solved 4 open conjectures from Bloom's Erdős database.

#google-deepmind #aletheia #mathematics

LLM Hacker News Mar 3, 2026 1 min read

Claude Opus 4.6 Solves Don Knuth's Open Math Problem

Anthropic's Claude Opus 4.6 independently solved a directed Hamiltonian cycle decomposition problem that computer science legend Donald Knuth had spent weeks working on. Knuth documented the achievement in a formal Stanford paper, marking one of the first times a top-tier computer scientist has formally credited an LLM with solving a genuine research problem.

#claude #knuth #mathematics

Sciences Hacker News Feb 16, 2026 2 min read

Towards Autonomous Mathematics Research Hits Hacker News: Aletheia Framed as a Research Agent

A Hacker News thread highlighted arXiv 2602.10177, where DeepMind researchers introduce Aletheia, an agent workflow for mathematics research. The paper claims progress from Olympiad-style reasoning toward PhD-level tasks and semi-autonomous open-problem exploration.

#mathematics #ai-research #agents

Sciences Feb 14, 2026 2 min read

Google DeepMind Details Gemini Deep Think Progress in Math and Science Research

Google DeepMind published new results on February 11, 2026 showing Gemini Deep Think workflows for mathematics, physics, and computer science research. The post outlines two new papers, evaluation benchmarks, and agent-assisted verification methods.

#deepmind #gemini #scientific-research

AI Reddit Feb 12, 2026 1 min read

Mathematicians Challenge AI: Show Us Your Proof Work

Leading mathematicians launched 'First Proof,' an exam testing AI on unpublished problems. It's academia's skeptical response to AI companies' inflated claims about mathematical breakthroughs.

#mathematics #ai-capabilities #verification