#nist

AI May 6, 2026 1 min read

미 상무부 CAISI, Google·Microsoft·xAI와 AI 사전 안전 검증 협정

미국 NIST 산하 인공지능표준혁신센터(CAISI)가 5월 5일 Google DeepMind, Microsoft, xAI와 프런티어 AI 모델의 공개 전 국가안보 평가 협정을 체결했다. OpenAI와 Anthropic은 기존 협정을 재협상했다.

#ai-safety #regulation #nist

AI Hacker News Apr 20, 2026 1 min read

NIST의 CVE triage, HN은 security metadata 부채로 읽었다

HN의 쟁점은 “모든 CVE를 똑같이 봐야 한다”가 아니라, NVD가 대부분의 enrichment를 늦출 때 누가 severity와 product metadata를 믿을 만하게 보완하느냐였다.

#security #cve #nist

AI X/Twitter Apr 1, 2026 2 min read

Perplexity, frontier AI security 연구를 위한 Secure Intelligence Institute 출범

Perplexity는 2026년 3월 31일 leading-edge AI system의 security, trustworthiness, practical defense를 연구하는 Secure Intelligence Institute를 출범한다고 밝혔다. Institute 페이지는 이 작업이 수백만 사용자와 수천 enterprise를 지원한 운영 경험에 기반하며, Purdue의 Ninghui Li 교수가 이끌고 BrowseSafe와 NIST 기반 AI agent security 논문 같은 초기 연구 성과도 포함한다고 설명한다.

#perplexity #ai-security #agents

AI Mar 27, 2026 1 min read

NIST, 상호운용성과 보안을 위한 AI Agent Standards Initiative 출범

NIST는 2026년 2월 17일 Center for AI Standards and Innovation이 AI Agent Standards Initiative를 시작한다고 밝혔다. 이 프로그램은 autonomous AI system의 확산을 위해 기술 표준, open protocol, agent security와 identity 연구를 함께 다룬다.

#nist #standards #ai-agents

AI Mar 17, 2026 1 min read

NIST, 배포 후 AI monitoring 지침 토대가 될 AI 800-4 공개

NIST는 commercial·government 환경에서 AI가 실제 운영되기 시작한 만큼 post-deployment monitoring이 핵심 요구사항이 되고 있다고 설명하는 AI 800-4 보고서를 공개했다. 이 문서는 unforeseen outputs, drift, incident tracking, broader real-world effects를 포함한 monitoring practice와 open questions를 체계화한다.

#nist #ai-governance #monitoring

LLM Mar 12, 2026 1 min read

NIST, AI 800-3로 benchmark accuracy와 generalized accuracy를 구분하는 AI evaluation 지침 제시

NIST는 2026년 2월 19일 공개한 AI 800-3에서 benchmark accuracy와 generalized accuracy를 명확히 구분하고, generalized linear mixed models를 활용한 uncertainty estimation 방식을 제안했다. 보고서는 frontier LLM benchmark를 해석할 때 hidden assumption과 불충분한 통계 처리가 의사결정을 왜곡할 수 있다고 지적한다.

#nist #llm-evals #benchmarks

AI Mar 12, 2026 1 min read

NIST, 배포된 AI 시스템 모니터링 과제 정리한 AI 800-4 공개

NIST가 2026년 3월 9일 배포 이후 AI 시스템 monitoring 과제를 정리한 NIST AI 800-4를 발표했다. 기능, 운영, human factors, security, compliance, large-scale impacts의 여섯 범주로 이슈를 구조화한 점이 핵심이다.

#nist #standards #monitoring

LLM Feb 15, 2026 1 min read

NIST, 언어모델 자동 벤치마크 평가 초안(NIST AI 800-2) 의견수렴 시작

NIST 산하 CAISI는 2026년 1월 30일 언어모델 자동 벤치마크 평가 가이드 초안 NIST AI 800-2를 공개하고 3월 31일까지 공개 의견을 받는다. 문서는 평가 목표 정의, 실행, 결과 분석·보고의 실무 절차를 제시한다.

#nist #caisi #benchmarking