Anthropic analyzed millions of real Claude interactions and found the 99.9th percentile session duration nearly doubled to 45+ minutes in 3 months, with software engineering accounting for nearly half of all agentic use.
#claude
RSS FeedAnthropic published a new theory explaining why AI assistants like Claude express emotions and use anthropomorphic language—proposing that models select from personas inherited from fictional characters during training.
Opper tested 53 leading LLMs with a deceptively simple logic question about whether to walk or drive to a car wash 50 meters away. Only 11 models answered correctly — the car must be driven to the car wash.
Opper tested 53 leading LLMs with a deceptively simple logic question about whether to walk or drive to a car wash 50 meters away. Only 11 models answered correctly — the car must be driven to the car wash.
Claude Sonnet 4.6 achieves 72.5% on OSWorld—just 0.2 points below Opus 4.6—with a 1M-token context window in beta. At $3/$15 per million tokens, it brings flagship-class agentic capabilities to a mid-tier price point.
Developer Vladimir Varankin used Claude Code to port the Linux brcmfmac Wi-Fi driver to FreeBSD for a 2016 MacBook Pro, demonstrating AI's capability to tackle low-level kernel driver development.
Claude Sonnet 4.6, released February 17, delivers dramatically improved coding and computer use (72.5% on OSWorld—a nearly fivefold improvement) with a 1M token context window in beta, at unchanged pricing from Sonnet 4.5.
Claude Code Security, announced February 20, uses AI reasoning to scan codebases for vulnerabilities and found 500+ undetected bugs in production open-source code. Cybersecurity stocks fell sharply on the news.
Anthropic's Claude Sonnet 4.6, released February 17, delivers Opus 4.5-level performance at Sonnet pricing with a 1M-token context window in beta, and becomes the new default for Free and Pro users.
Claude Opus 4.6 achieved a 50%-time-horizon of approximately 14.5 hours on METR's software task benchmark — beating all predictions and suggesting a doubling time of under 3 months for AI task capabilities.
GitHub announced that Anthropic's Claude Sonnet 4.6 is now generally available in GitHub Copilot. Early testing shows excellent performance for agentic coding and search operations in VS Code and Copilot CLI.
Anthropic launched Claude Sonnet 4.6 on February 17, offering major upgrades in coding, computer use, and agent planning—now the default model for Free and Pro users at the same $3/$15 per million tokens pricing.