NVIDIA introduced OpenShell on March 23, 2026. The company says the open source runtime isolates each autonomous agent in its own sandbox and keeps policy enforcement at the infrastructure layer instead of relying only on model or application safeguards.
LLM
RSS FeedMicrosoft Research announced the 15 billion parameter open-weight model Phi-4-reasoning-vision-15B on March 4, 2026. The lab says the release is designed to deliver stronger multimodal reasoning, math and science performance, and computer-use ability without the compute profile of much larger systems.
OpenAI said on X on March 17, 2026 that GPT-5.4 mini was available in ChatGPT, Codex, and the API. The launch positions mini as a faster coding and multimodal workhorse, while OpenAI’s accompanying post also introduces GPT-5.4 nano for cheaper API-only workloads.
A busy LocalLLaMA thread followed David Noel Ng’s RYS II results, which argue that repeated mid-stack transformer layers can still improve Qwen3.5-27B and that hidden states may align more by meaning than by surface language.
A technical LocalLLaMA thread translated the FlashAttention-4 paper into practical deployment guidance, emphasizing huge Blackwell gains, faster Python-based kernel development, and the fact that most A100 or consumer-GPU users cannot use the full benefits yet.
Perplexity introduced Computer for Enterprise on March 12, 2026 as a managed execution layer for enterprise agent workflows. The company says it inherits SOC 2 Type II, SAML SSO, audit logs, and admin controls, while keeping browser activity and code execution inside isolated sandbox environments.
Cloudflare said on X on March 19 that Kimi K2.5 is now available on Workers AI. The launch pairs a frontier open-source model with platform features aimed at lowering latency and cost for agent workloads.
OpenAI said on X on March 23 that ChatGPT is getting a new Library for uploaded and created files. The rollout adds reusable file storage, recent-file insertion, and broader document continuity across chats.
A Hacker News thread around Skylar Payne's DSPy post argues that teams often rebuild DSPy-style LLM engineering patterns as systems mature, even though unfamiliar abstractions, Python fit, and eval design still slow direct adoption.
OpenAI introduced the Codex app on February 2, 2026. The macOS desktop interface is built to supervise multiple agents in parallel, manage skills and automations, and was expanded to Windows on March 4, 2026.
Anthropic announced Claude Sonnet 4.6 on February 17, 2026. The release combines a 1M-token context beta, unchanged pricing, and broader upgrades across coding, computer use, and long-context reasoning.
Cloudflare said on March 20, 2026 that Kimi K2.5 is now available on Workers AI so developers can run agents end-to-end on its platform. The linked Cloudflare blog says the model ships with a 256K context window, multi-turn tool calling, vision, and structured outputs, and that one internal agent workload cut costs by 77% after the switch.