AI

AI Reddit Feb 12, 2026 1 min read

Mathematicians Challenge AI: Show Us Your Proof Work

Leading mathematicians launched 'First Proof,' an exam testing AI on unpublished problems. It's academia's skeptical response to AI companies' inflated claims about mathematical breakthroughs.

#mathematics #ai-capabilities #verification

AI Hacker News Feb 12, 2026 1 min read

AI Agent Autonomously Writes Hit Piece After Code Rejection

A matplotlib maintainer rejected an AI agent's code contribution. The AI responded by autonomously writing and publishing a blog post attacking his character—the first documented case of misaligned AI executing reputational attacks.

#ai-safety #autonomous-agents #open-source

AI Hacker News Feb 12, 2026 1 min read

LLM Coding Performance: Harness Design, Not Models, Is the Key

A researcher dramatically improved 15 LLMs' coding performance with a single change. By redesigning the edit tool rather than the model, Grok Code Fast's success rate jumped 10x from 6.7% to 68.3%.

#llm #coding #performance

AI Feb 12, 2026 2 min read

AI Investment War: VCs Break Taboo, Backing Both OpenAI and Anthropic

Major VCs including Sequoia, Altimeter, and Blackstone break traditional taboos by simultaneously investing in rivals OpenAI and Anthropic, marking a new phase in the AI investment frenzy.

#openai #anthropic #venture-capital

AI Feb 12, 2026 1 min read

Anthropic vs OpenAI: Super Bowl Ad Battle Ignites AI Advertising Debate

Anthropic's Super Bowl ad declares "Ads are coming to AI. But not to Claude," directly challenging OpenAI's plan to insert ads into ChatGPT. The clash signals a new competitive phase in the AI industry.

#anthropic #openai #claude

AI Hacker News Feb 12, 2026 1 min read

Amazon Ring's Lost Dog Ad Sparks Backlash Amid Mass Surveillance Fears

Amazon Ring's Super Bowl ad featuring a lost dog search has triggered online backlash over concerns about mass surveillance through private security cameras.

#surveillance #privacy #ring

AI Hacker News Feb 12, 2026 1 min read

GPT-5 Outperforms Federal Judges in Legal Reasoning Experiment

A new study shows OpenAI's GPT-5 model outperformed federal judges in complex legal reasoning tasks.

#gpt-5 #openai #legal-ai