#nist

AI May 6, 2026 1 min read

NIST・CAISI、Google DeepMind・Microsoft・xAIと事前AI安全評価協定を締結

米国商務省NIST傘下のCAISI（旧AISI）が2026年5月5日、Google DeepMind・Microsoft・xAIとフロンティアAIモデルの公開前国家安全保障評価に関する協定を締結した。サイバー・バイオ・化学兵器リスクを中心に評価する。

#ai-safety #regulation #nist

AI Hacker News Apr 20, 2026 1 min read

NISTのCVE triageを、HNはsecurity metadata debtとして読んだ

HNの論点は「全CVEを同じ重みで扱え」ではなく、NVD enrichmentが選択制になる時代に、severityとproduct metadataを誰が補うのかだった。

#security #cve #nist

AI X/Twitter Apr 1, 2026 1 min read

Perplexity、frontier AI security研究のための Secure Intelligence Institute を立ち上げ

Perplexityは2026年3月31日、leading-edge AI systemのsecurity、trustworthiness、practical defenseを研究するSecure Intelligence Instituteを立ち上げると発表した。Instituteページは、数百万ユーザーと数千enterpriseを支える運用経験を基盤に、PurdueのNinghui Li教授が率い、BrowseSafeやNIST関連のAI agent security論文を初期成果として示している。

#perplexity #ai-security #agents

AI Mar 27, 2026 1 min read

NIST、相互運用性と安全性に向けAI Agent Standards Initiativeを始動

NISTは2026年2月17日、Center for AI Standards and InnovationがAI Agent Standards Initiativeを開始すると発表した。この取り組みはautonomous AI systemの普及に向け、技術標準、open protocol、agent securityとidentityの研究を同時に進める。

#nist #standards #ai-agents

AI Mar 17, 2026 1 min read

NIST、deployed AI systems の monitoring を扱う AI 800-4 を公表

NISTは、commercial・government環境でAIが実運用に入るなか、post-deployment monitoringが中核要件になりつつあると論じるAI 800-4を公表した。文書はunforeseen outputs、drift、incident tracking、broader real-world effectsを含むmonitoring practiceとopen questionsを整理している。

#nist #ai-governance #monitoring

LLM Mar 12, 2026 1 min read

NIST、AI 800-3で benchmark accuracy と generalized accuracy を分ける evaluation指針を提示

NISTは2026年2月19日に公開したAI 800-3で、benchmark accuracyとgeneralized accuracyを明確に区別し、generalized linear mixed modelsによるuncertainty estimationを提案した。報告書は、frontier LLM benchmarkの解釈において hidden assumption や不十分な統計処理が意思決定を歪め得ると指摘している。

#nist #llm-evals #benchmarks

AI Mar 12, 2026 1 min read

NIST、配備後AIシステム監視の課題を整理したAI 800-4を公表

NISTが2026年3月9日、配備後のAIシステムmonitoring課題を整理したNIST AI 800-4を公表した。機能、運用、human factors、security、compliance、large-scale impactsの6分類で論点を構造化している。

#nist #standards #monitoring

LLM Feb 15, 2026 1 min read

NIST、言語モデル自動ベンチマーク評価ガイド草案への意見募集を開始

NISTのCAISIは、言語モデル向け自動ベンチマーク評価の草案 NIST AI 800-2 を公表し、2026年3月31日まで意見募集を実施している。草案は目的設定、実行手順、結果分析・報告の標準化を中心に構成される。

#nist #caisi #benchmarking