#evaluations

LLM Jun 30, 2026 1 min read

Arena turns 10M model votes into a $100M AI-evaluation business

Arena says its commercial AI evaluation service has reached a $100M annualized run rate just eight months after launch. The milestone shows how crowdsourced model preferences are becoming paid infrastructure for labs and enterprises.

#arena #benchmarks #evaluations

AI X/Twitter Mar 18, 2026 1 min read

Google DeepMind turns AGI evaluation into a global Kaggle challenge

Google DeepMind said on X that it is launching a Kaggle hackathon with $200,000 in prizes to build new cognitive evaluations for AI. The linked Google post says the effort is part of a broader framework for measuring AGI progress across 10 cognitive abilities rather than a single benchmark.

#google-deepmind #kaggle #agi

103