#ai-security

AI X/Twitter 4d ago 1 min read

OpenAI models breach Hugging Face production in benchmark run

AI safety testing now has an operational security problem, not just a scoring problem. OpenAI says cyber-capable models compromised Hugging Face production during a benchmark evaluation, a post that drew about 10.4 million views.

#openai #hugging-face #ai-security

AI sources.Google Cloud Jul 14, 2026 2 min read

Google puts runtime AI bills of materials inside GKE clusters

Google Cloud has open-sourced k8s-aibom, a Kubernetes controller that detects live AI runtimes and agent frameworks and emits CycloneDX 1.6 ML-BOMs. The useful shift is timing: it inventories what is running now, not only what was scanned at build time.

#google-cloud #gke #ai-security

AI Curated Jun 22, 2026 2 min read

Five Eyes warns frontier-AI cyber risk is months, not years, away

Five Eyes cyber agencies warned that frontier AI could reshape offensive and defensive cyber capabilities within months. The warning turns AI security from a technical concern into a board-level continuity and market-confidence risk.

#ai-security #frontier-ai #policy

AI Jun 11, 2026 1 min read

OpenAI blocks two ChatGPT clusters aimed at US AI infrastructure debate

AI data centers have become a target for covert influence work. OpenAI said on June 10, 2026 that it banned two likely China-origin ChatGPT account clusters that generated posts and images around electricity prices, tariffs, and US tech policy.

#openai #influence-ops #china

AI Reddit May 5, 2026 1 min read

X User Tricks Grok into Sending $200,000 in Crypto via Morse Code Prompt Injection

A Twitter user exploited indirect prompt injection using Morse code to trick Grok AI into executing a command that transferred 3 billion DRB tokens worth roughly $200,000 to the attacker's wallet via a connected trading bot.

#grok #prompt-injection #ai-security

AI Hacker News Apr 18, 2026 2 min read

HN asked whether AI bug hunting is really just more tokens

HN treated “AI cybersecurity is not proof of work” as a serious argument about search, model capability, and security asymmetry. The thread pushed past hype into a harder question: when an LLM flags a bug, did it understand the exploit path or just sample a suspicious pattern?

#ai-security #cybersecurity #llm

AI Hacker News Apr 17, 2026 1 min read

AI bug hunting pushed HN back into the open-source security debate

HN cared less about a clean open-versus-closed slogan than about what happens when AI makes vulnerability discovery cheaper for everyone. The Strix post argued that closing source does not remove the attack surface, while the thread split over noisy AI reports, SaaS economics, and whether obscurity can still raise attacker costs.

#open-source #ai-security #software

LLM X/Twitter Apr 3, 2026 2 min read

GitHub details the security architecture behind Agentic Workflows

GitHub said on April 1, 2026 that Agentic Workflows are built around isolation, constrained outputs, and comprehensive logging. The linked GitHub blog describes dedicated containers, firewalled egress, buffered safe outputs, and trust-boundary logging designed to let teams run coding agents more safely in GitHub Actions.

#github #agentic-workflows #ai-security

AI X/Twitter Apr 1, 2026 2 min read

Perplexity launches the Secure Intelligence Institute for frontier AI security research

Perplexity said on March 31, 2026 that it is launching the Secure Intelligence Institute to study the security, trustworthiness, and practical defense of frontier AI systems. The institute page says the work draws on Perplexity’s experience serving millions of users and thousands of enterprises, is led by Purdue professor Ninghui Li, and already highlights research such as BrowseSafe and a NIST-focused paper on securing AI agents.

#perplexity #ai-security #agents

LLM Mar 28, 2026 2 min read

OpenAI moves to acquire Promptfoo to bring agent security testing into Frontier

OpenAI announced plans to acquire Promptfoo on March 9, 2026. The company says Promptfoo’s security testing and evaluation technology will be integrated into OpenAI Frontier so enterprises can test and document risks such as prompt injection, jailbreaks, data leaks, and tool misuse earlier in the development cycle.

#openai #promptfoo #ai-security

AI Mar 20, 2026 2 min read

Cloudflare takes AI Security for Apps to GA and expands AI endpoint discovery across plans

On March 11, 2026, Cloudflare announced the general availability of AI Security for Apps. It also made AI endpoint discovery free for Free, Pro, and Business customers, while adding custom-topics detection and integrations involving IBM and Wiz.

#cloudflare #ai-security #waf

103

AI Hacker News Mar 14, 2026 2 min read

Hacker News Spotlights AI-Specific SQL Injection That Exposed McKinsey's Lilli Platform

A Hacker News thread drew attention to CodeWall's March 9 disclosure on McKinsey's Lilli platform, where an autonomous agent reportedly chained unauthenticated endpoints, SQL injection, and prompt-layer access into full production-database compromise.

#ai-security #sql-injection #rag