#ai-research

AI X/Twitter Jun 5, 2026 1 min read

Claude data points to 52x AI-research speedups inside Anthropic

AI self-improvement is moving from speculation into measurable lab workflow data. Anthropic says Mythos Preview reached about 52x speedups on an optimization task and beat human next-step choices 64% of the time.

#anthropic #claude #ai-research

Sciences X/Twitter May 22, 2026 1 min read

OpenAI Model Disproves 80-Year-Old Erdős Geometry Conjecture

An OpenAI general-purpose reasoning model has independently solved the planar unit distance problem — a famous open geometry question posed by Paul Erdős in 1946. External mathematicians verified the proof, marking the first time AI has autonomously solved a major open problem in mathematics.

#openai #mathematics #erdos

Sciences X/Twitter May 21, 2026 2 min read

OpenAI Model Becomes First AI to Autonomously Solve a Major Open Math Problem

An OpenAI general-purpose reasoning model independently disproved the Erdős unit distance conjecture — a central problem in discrete geometry open since 1946. This marks the first time in history that an AI has autonomously solved a prominent open math problem, verified by independent mathematicians including Princeton's Noga Alon.

#openai #mathematics #ai-research

AI X/Twitter May 7, 2026 1 min read

Google DeepMind Partners with EVE Online to Research AI Agent Memory and Long-Term Planning

Google DeepMind announced a research partnership with CCP Games, the developer of EVE Online, to use the game's complex player-driven universe as a sandbox for advancing AI research in memory, continual learning, and long-term planning.

#google-deepmind #ai-research #games

AI Reddit May 5, 2026 1 min read

Anthropic Co-Founder: 30% Chance AI Automates AI Research by End of 2027

Jack Clark, Anthropic co-founder, estimates a ~30% chance AI research becomes substantially automated by end of 2027 and ~60%+ by end of 2028, arguing AI doesn't need genius-level creativity to self-improve.

#anthropic #ai-research #automation

LLM Reddit May 3, 2026 1 min read

GPT-5.4 Pro Math Proof Method Cracks Another 60-Year-Old Erdos Conjecture

The technique GPT-5.4 Pro used to solve Erdos Problem 1196 has been applied to other problems, including another conjecture unsolved for 60 years.

#gpt-5 #mathematics #ai-research

Sciences Reddit Apr 29, 2026 2 min read

r/singularity read the new Erdos proof as a test of whether LLMs can make a genuinely new move

The subreddit jumped straight past the headline and into the hard question: was this finally something other than pattern replay? A Scientific American report on a 23-year-old using GPT-5.4 Pro on a 60-year-old Erdos problem sparked debate over novelty, expert cleanup, and whether messy model output can still contain a real mathematical idea.

#mathematics #gpt-5.4 #erdos-problems

LLM Apr 26, 2026 2 min read

Claude agents closed 186 office deals in Anthropic's market test

Why it matters: AI agents are moving from chat demos into delegated economic work. In Anthropic’s office-market experiment, 69 agents closed 186 deals across more than 500 listings and moved a little over $4,000 in goods.

#anthropic #claude #agents

Sciences Reddit Mar 3, 2026 1 min read

ChatGPT Discovers Surprising Insight in Particle Physics, Sparking Scientific Interest

A study published in Science journal found that ChatGPT surfaced a surprising insight in particle physics research that human scientists had missed, raising new questions about AI's role in scientific discovery.

#chatgpt #particle-physics #science

105

AI X/Twitter Feb 24, 2026 1 min read

Anthropic Introduces 'Persona Selection Model' Theory to Explain AI's Human-Like Behavior

Anthropic published a new theory explaining why AI assistants like Claude express emotions and use anthropomorphic language—proposing that models select from personas inherited from fictional characters during training.

#anthropic #claude #ai-research

114

Sciences Hacker News Feb 16, 2026 2 min read

Towards Autonomous Mathematics Research Hits Hacker News: Aletheia Framed as a Research Agent

A Hacker News thread highlighted arXiv 2602.10177, where DeepMind researchers introduce Aletheia, an agent workflow for mathematics research. The paper claims progress from Olympiad-style reasoning toward PhD-level tasks and semi-autonomous open-problem exploration.

#mathematics #ai-research #agents