Researchers have demonstrated that transformer models with fewer than 100 parameters can add two 10-digit numbers with 100% accuracy. The key ingredient is digit tokenization rather than treating numbers as opaque strings — a finding with implications for mathematical reasoning in larger LLMs.
#research
Scientists at Oregon State University engineered a new iron-based nanomaterial that exploits cancer's unique chemistry — its acidity and high hydrogen peroxide levels — to trigger two simultaneous chemical reactions that destroy tumors while leaving healthy tissue unharmed.
Scientists at Oregon State University engineered a new iron-based nanomaterial that exploits cancer's unique chemistry — its acidity and high hydrogen peroxide levels — to trigger two simultaneous chemical reactions that destroy tumors while leaving healthy tissue unharmed.
Anthropic analyzed millions of real Claude interactions and found the 99.9th percentile session duration nearly doubled to 45+ minutes in 3 months, with software engineering accounting for nearly half of all agentic use.
Anthropic analyzed millions of real Claude interactions and found the 99.9th percentile session duration nearly doubled to 45+ minutes in 3 months, with software engineering accounting for nearly half of all agentic use.
Anthropic analyzed millions of real Claude interactions and found the 99.9th percentile session duration nearly doubled to 45+ minutes in 3 months, with software engineering accounting for nearly half of all agentic use.
A highly upvoted r/MachineLearning thread debates whether skyrocketing acceptance rates at top venues like CVPR and ICLR are diluting the academic value of conference publication, raising concerns about review quality.
World Labs, the startup from AI pioneer Fei-Fei Li, has raised $1 billion to scale its spatial intelligence technology. Autodesk leads with $200M, alongside Andreessen Horowitz, Nvidia, and AMD.
A new MIT Technology Review investigation reveals that humanoid robot companies routinely obscure the scale of human teleoperation and data collection labor behind their demos, risking a repeat of AI's early automation-washing scandals.
Unitree, Galbot, Noetix, and MagicLab showcased advanced humanoid robots at the world's most-watched TV event, performing world-first acrobatics including 3m aerial flips, parkour sequences, and kung fu routines — a dramatic leap from last year.
DeepSeek released V4 on Lunar New Year with 1 trillion parameters, 1M-token context windows, and novel mHC architecture. The open-weight model claims benchmark-topping coding performance at 10–40× lower inference costs than Western frontier models.
Researchers warn that AI-generated fake faces have crossed a critical threshold: they now appear more trustworthy than real human faces, challenging deepfake detection and undermining digital trust.