This is the kind of numeric jump that makes multi-agent research hard to ignore. Together says EinsteinArena agents raised the 11-dimensional kissing number lower bound from 593 to 604 and had already logged 11 new SOTA results on open problems by April 11.
#mathematics
RSS FeedA March 28 essay on the Hamilton-Jacobi-Bellman equation drew Hacker News attention by showing how continuous-time control theory connects reinforcement learning, optimal control, and diffusion models.
Google DeepMind said on February 11, 2026 that Gemini Deep Think is being used on professional research problems across mathematics, physics, and computer science. The company highlighted its Aletheia math agent, up to 90% on IMO-ProofBench Advanced, and collaborations on 18 research problems as evidence that AI is moving from benchmark performance toward real scientific workflow support.
A heavily discussed HN post focused on Epoch AI’s confirmation that GPT-5.4 Pro helped solve one FrontierMath Open Problems combinatorics challenge, shifting attention from benchmark scores toward expert-verified research workflows.
OpenAI said on February 20, 2026 that its theorem-proving model produced proof attempts for all 10 research-level First Proof problems. After expert feedback, the company believes at least five attempts are likely correct, while some remain under review and the attempt for problem 2 now appears incorrect.
A high-scoring r/singularity post pointed readers to Donald Knuth’s note <em>Claude’s Cycles</em>, where he says Claude Opus 4.6 helped solve an open combinatorics problem that arose while he was preparing a future TAOCP volume.
Google DeepMind said on February 11, 2026 that Gemini Deep Think is now helping tackle professional problems in mathematics, physics, and computer science under expert supervision. The company tied the claim to two fresh papers, a research agent called Aletheia, and examples ranging from autonomous math results to work on algorithms, optimization, economics, and cosmic-string physics.
Google DeepMind's Aletheia AI research agent solved 6 out of 10 open research-level math problems in the FirstProof Challenge as judged by expert mathematicians. The system also generated a fully autonomous research paper and solved 4 open conjectures from Bloom's Erdős database.
Anthropic's Claude Opus 4.6 independently solved a directed Hamiltonian cycle decomposition problem that computer science legend Donald Knuth had spent weeks working on. Knuth documented the achievement in a formal Stanford paper, marking one of the first times a top-tier computer scientist has formally credited an LLM with solving a genuine research problem.
A Hacker News thread highlighted arXiv 2602.10177, where DeepMind researchers introduce Aletheia, an agent workflow for mathematics research. The paper claims progress from Olympiad-style reasoning toward PhD-level tasks and semi-autonomous open-problem exploration.
Google DeepMind published new results on February 11, 2026 showing Gemini Deep Think workflows for mathematics, physics, and computer science research. The post outlines two new papers, evaluation benchmarks, and agent-assisted verification methods.
Leading mathematicians launched 'First Proof,' an exam testing AI on unpublished problems. It's academia's skeptical response to AI companies' inflated claims about mathematical breakthroughs.