GitHub announced a major JetBrains Copilot update on March 11, 2026. Custom agents, sub-agents, and plan agent are now generally available, while agent hooks, MCP auto-approve, and project instruction file support push the IDE further toward full agent workflows.
LLM
RSS FeedOpenAIDevs said on March 16, 2026 that subagents are now available in Codex. The feature lets developers keep the main context clean, split work across specialized agents, and steer individual threads as they run, while the official docs already describe PR review and CSV batch fan-out patterns.
An r/singularity post on March 13, 2026 highlighted Anthropic’s move to make 1M context generally available for Opus 4.6 and Sonnet 4.6, with standard per-token pricing, higher media limits, and automatic support in Claude Code tiers.
A March 12, 2026 LocalLLaMA benchmark post claims the best sustained decode for Qwen3.5-397B NVFP4 on 4x RTX PRO 6000 Blackwell GPUs is 50.5 tok/s with Marlin, because native CUTLASS grouped GEMM paths on SM120 fail or fall back.
OpenAI said on March 10, 2026 that its new IH-Challenge dataset improves instruction hierarchy behavior in frontier LLMs, with gains in safety steerability and prompt-injection robustness. The company also released the dataset publicly on Hugging Face to support further research.
Perplexity said on March 11, 2026 that its new Agent API combines search, tool execution, and multi-model orchestration behind one managed runtime. The launch positions Perplexity less as a single-answer interface and more as infrastructure for production agent workflows.
Perplexity said on March 12, 2026 that Computer is now available to Pro subscribers, widening access beyond its highest tier. The company is pitching 20+ advanced models, prebuilt and custom skills, and hundreds of connectors, while reserving monthly credits and higher spend limits for Max users.
On March 13, 2026, GitHub showed a Copilot SDK and Remotion demo that turns a WhatsApp message into a promo video in about five minutes. GitHub’s official SDK announcement describes the stack as a programmable layer that can plan, invoke tools, edit files, and run commands inside other applications.
GitHub used X on March 15, 2026 to spotlight the Copilot CLI `/fleet` command for routine maintenance work. GitHub’s official Copilot CLI materials now describe `/fleet` as a parallel sub-agent workflow that converges multiple runs into one decision-ready result.
A LocalLLaMA release post presents OmniCoder-9B as a Qwen3.5-9B-based coding agent fine-tuned on 425,000-plus agentic trajectories, with commenters focusing on its read-before-write behavior and usefulness at small model size.
A former Manus backend lead argues that one run(command="...") tool can outperform large typed tool catalogs because CLI patterns fit how LLMs consume text, prompting a debate over flexibility versus sandboxing.
Sebastian Raschka's LLM Architecture Gallery drew attention on HN for turning recent model families into comparable diagrams, making dense, MoE, and hybrid design choices easier to scan in one place.