OmniCoder-9B packages agent-style coding behavior into a smaller open model by training on more than 425,000 curated trajectories from real tool-using workflows.
A new paper discussed in r/MachineLearning argues that unofficial model-access providers can quietly substitute models and distort both research and production results.
A post in r/MachineLearning argues that duplicating a specific seven-layer block inside Qwen2-72B improved benchmark performance without changing any weights.
OneCLI proposes a proxy-and-vault pattern for AI agents so tools stay reachable while real credentials remain outside the model runtime.
Anthropic has added inline interactive visuals to Claude, and Hacker News users are treating it as a real workflow upgrade for analysis and explanation rather than a cosmetic demo.
Adobe has put Photoshop AI Assistant into public beta on web and mobile and expanded Firefly Image Editor with new generative editing tools. Announced on March 10, 2026, the release also turns Firefly into a multi-model workspace that includes Adobe, OpenAI, Google, Runway, and Black Forest Labs image models.
NIST says AI 800-3 gives evaluators a clearer statistical framework by separating benchmark accuracy from generalized accuracy and by introducing generalized linear mixed models for uncertainty estimation. The February 19, 2026 report argues that many current benchmark comparisons hide assumptions that can distort procurement, development, and policy decisions.
Google Research says its March 12, 2026 rollout adds urban flash flood forecasts to Flood Hub with up to 24 hours of advance notice. The system is trained in part on Groundsource, a dataset built by using Gemini to extract structured flood events from public news reports.
Google Research says a prospective study with Beth Israel Deaconess Medical Center found AMIE could operate with zero safety stops, strong diagnostic performance, and improved patient trust under live physician oversight. Published on March 11, 2026, the work is an early real-world test of conversational diagnostic AI inside a primary care workflow.
AI at Meta says it is open-sourcing CHMv2, a high-resolution global forest canopy mapping model built with the World Resources Institute. Meta says the release uses DINOv3 Sat-L for satellite imagery and improves accuracy, detail, and global consistency.
Meta says custom silicon is critical to scaling next-generation AI and has published a roadmap update for its MTIA family. The company says it accelerated development enough to release four generations in two years as model architectures keep changing faster than traditional chip cycles.
OpenAI introduced a new evaluation suite and research paper on Chain-of-Thought controllability. The company says GPT-5.4 Thinking shows low ability to obscure its reasoning, which supports continued use of CoT monitoring as a safety signal.