#standards

AI Mar 27, 2026 1 min read

NIST, 상호운용성과 보안을 위한 AI Agent Standards Initiative 출범

NIST는 2026년 2월 17일 Center for AI Standards and Innovation이 AI Agent Standards Initiative를 시작한다고 밝혔다. 이 프로그램은 autonomous AI system의 확산을 위해 기술 표준, open protocol, agent security와 identity 연구를 함께 다룬다.

#nist #standards #ai-agents

AI Mar 17, 2026 1 min read

NIST, 배포 후 AI monitoring 지침 토대가 될 AI 800-4 공개

NIST는 commercial·government 환경에서 AI가 실제 운영되기 시작한 만큼 post-deployment monitoring이 핵심 요구사항이 되고 있다고 설명하는 AI 800-4 보고서를 공개했다. 이 문서는 unforeseen outputs, drift, incident tracking, broader real-world effects를 포함한 monitoring practice와 open questions를 체계화한다.

#nist #ai-governance #monitoring

LLM Mar 12, 2026 1 min read

NIST, AI 800-3로 benchmark accuracy와 generalized accuracy를 구분하는 AI evaluation 지침 제시

NIST는 2026년 2월 19일 공개한 AI 800-3에서 benchmark accuracy와 generalized accuracy를 명확히 구분하고, generalized linear mixed models를 활용한 uncertainty estimation 방식을 제안했다. 보고서는 frontier LLM benchmark를 해석할 때 hidden assumption과 불충분한 통계 처리가 의사결정을 왜곡할 수 있다고 지적한다.

#nist #llm-evals #benchmarks

AI Mar 12, 2026 1 min read

NIST, 배포된 AI 시스템 모니터링 과제 정리한 AI 800-4 공개

NIST가 2026년 3월 9일 배포 이후 AI 시스템 monitoring 과제를 정리한 NIST AI 800-4를 발표했다. 기능, 운영, human factors, security, compliance, large-scale impacts의 여섯 범주로 이슈를 구조화한 점이 핵심이다.

#nist #standards #monitoring

LLM Feb 15, 2026 1 min read

NIST, 언어모델 자동 벤치마크 평가 초안(NIST AI 800-2) 의견수렴 시작

NIST 산하 CAISI는 2026년 1월 30일 언어모델 자동 벤치마크 평가 가이드 초안 NIST AI 800-2를 공개하고 3월 31일까지 공개 의견을 받는다. 문서는 평가 목표 정의, 실행, 결과 분석·보고의 실무 절차를 제시한다.

#nist #caisi #benchmarking