#ai-agents

AI Hacker News Mar 2, 2026 1 min read

WebMCP Early Preview: A New Web Standard for AI Agents

Google's Chrome team has released an early preview of WebMCP, a new web standard enabling direct communication between websites and AI agents. Site owners can now explicitly define how AI agents interact with their services, replacing unreliable DOM scraping with structured APIs.

#webmcp #ai-agents #mcp

AI Hacker News Feb 28, 2026 2 min read

HN Examines "Don’t Trust AI Agents" Architecture: Per-Agent Containers Over App-Level Guards

A February 28, 2026 Hacker News thread discussed NanoClaw’s security model, emphasizing untrusted-agent assumptions, per-agent isolation, and limits of prompt-level safeguards.

#ai-agents #security #sandboxing

AI X/Twitter Feb 26, 2026 1 min read

Perplexity Unveils Perplexity Computer, a Multi-Model Agent System for End-to-End Project Workflows

Perplexity announced Perplexity Computer on 2026-02-25 and described it as a system that can research, design, code, deploy, and manage projects end-to-end. In the same X thread, the company said the product routes work across 19 models and launches first for Max subscribers on web.

#perplexity #perplexity-computer #ai-agents

AI X/Twitter Feb 24, 2026 1 min read

Anthropic Study: AI Agents Are Rapidly Gaining Autonomy in Real-World Deployments

Anthropic analyzed millions of real Claude interactions and found the 99.9th percentile session duration nearly doubled to 45+ minutes in 3 months, with software engineering accounting for nearly half of all agentic use.

#anthropic #ai-agents #autonomy

117

AI X/Twitter Feb 24, 2026 1 min read

OpenAI Launches EVMbench: New Standard for Measuring AI Agents in Smart Contract Security

OpenAI introduced EVMbench, a new benchmark measuring how well AI agents can detect, exploit, and patch high-severity smart contract vulnerabilities in EVM-based blockchains.

#openai #benchmark #smart-contracts

100

AI Reddit Feb 22, 2026 1 min read

40,000+ AI Agents Exposed to the Internet with Full System Access

SecurityScorecard's STRIKE team found 40,214 OpenClaw AI agent instances exposed to the public internet with no authentication. Over 12,000 are vulnerable to Remote Code Execution, and attackers who compromise them inherit full system access including SSH keys, browser sessions, and filesystem control.

#ai-agents #security #openclaw

LLM Feb 22, 2026 1 min read

ByteDance Launches Doubao 2.0 — Frontier-Level AI at One-Tenth the Cost

ByteDance released Doubao 2.0 ahead of Lunar New Year, claiming GPT-5.2 and Gemini 3 Pro parity with 98.3 on AIME 2025, a 3020 Codeforces rating, and pricing 10x cheaper than Western rivals.

#bytedance #llm #product-launch

LLM Reddit Feb 22, 2026 1 min read

Claude Opus 4.6 Hits 14.5-Hour Mark on METR's Software Task Benchmark

Claude Opus 4.6 achieved a 50%-time-horizon of approximately 14.5 hours on METR's software task benchmark — beating all predictions and suggesting a doubling time of under 3 months for AI task capabilities.

#claude #anthropic #metr

113

LLM Hacker News Feb 22, 2026 1 min read

Karpathy: "Claws" Are a New Layer on Top of LLM Agents

Andrej Karpathy coined a new term for OpenClaw-like AI agent systems: "Claws." Just as LLM agents were a new layer on top of LLMs, Claws provide orchestration, scheduling, persistent context, and tool calls on top of LLM agents.

#llm-agents #karpathy #openclaw

AI Hacker News Feb 21, 2026 2 min read

HN Focus: Anthropic Quantifies Real-World AI Agent Autonomy in Claude Code and API Traffic

A high-signal Hacker News thread highlighted Anthropic's February 18, 2026 analysis of millions of agent interactions. The report tracks growing practical autonomy, evolving human oversight behavior, and early but rising higher-risk usage patterns.

#ai-agents #anthropic #claude-code

119

LLM Hacker News Feb 17, 2026 2 min read

Hacker News Spotlights Docker Shell Sandboxes for Safer NanoClaw Agent Deployments

A Docker guide on running NanoClaw inside a Shell Sandbox reached 102 points on Hacker News, highlighting a practical pattern for isolating agent runtime, limiting filesystem exposure, and keeping API keys out of the guest environment.

#docker #sandboxing #ai-agents

Sciences Feb 16, 2026 2 min read

Anthropic Partners with Allen Institute and HHMI to Accelerate Scientific Discovery

Anthropic announced on February 2, 2026 that it is partnering with the Allen Institute and Howard Hughes Medical Institute (HHMI) on AI-enabled life-science workflows. The stated goal is to reduce analysis bottlenecks and improve transparent, interpretable scientific reasoning.

#anthropic #life-sciences #ai-agents

104