#ai-security

AI Hacker News Apr 18, 2026 2 min read

HN asked whether AI bug hunting is really just more tokens

HN treated “AI cybersecurity is not proof of work” as a serious argument about search, model capability, and security asymmetry. The thread pushed past hype into a harder question: when an LLM flags a bug, did it understand the exploit path or just sample a suspicious pattern?

#ai-security #cybersecurity #llm

AI Hacker News Apr 17, 2026 1 min read

AI bug hunting pushed HN back into the open-source security debate

HN cared less about a clean open-versus-closed slogan than about what happens when AI makes vulnerability discovery cheaper for everyone. The Strix post argued that closing source does not remove the attack surface, while the thread split over noisy AI reports, SaaS economics, and whether obscurity can still raise attacker costs.

#open-source #ai-security #software

AI Apr 11, 2026 2 min read

Cloudflare makes AI Security for Apps generally available and opens endpoint discovery to all customers

Cloudflare made AI Security for Apps generally available on March 11, 2026 and opened AI endpoint discovery to all customers, including Free, Pro, and Business plans. The launch adds custom topic detection and folds AI-specific controls into the company’s existing reverse-proxy and WAF stack.

#cloudflare #ai-security #llm-security

LLM sources.twitter Apr 4, 2026 1 min read

Anthropic Claims Large-Scale Distillation Attacks on Claude Involved 24,000 Accounts and 16 Million Exchanges

Anthropic said on February 23, 2026 that DeepSeek, Moonshot AI, and MiniMax carried out industrial-scale distillation attacks against Claude. The company framed model-output extraction as a security and platform integrity problem, not just a competitive concern.

#model-distillation #ai-security #claude

LLM sources.twitter Apr 3, 2026 2 min read

GitHub details the security architecture behind Agentic Workflows

GitHub said on April 1, 2026 that Agentic Workflows are built around isolation, constrained outputs, and comprehensive logging. The linked GitHub blog describes dedicated containers, firewalled egress, buffered safe outputs, and trust-boundary logging designed to let teams run coding agents more safely in GitHub Actions.

#github #agentic-workflows #ai-security

AI sources.twitter Apr 1, 2026 2 min read

Perplexity launches the Secure Intelligence Institute for frontier AI security research

Perplexity said on March 31, 2026 that it is launching the Secure Intelligence Institute to study the security, trustworthiness, and practical defense of frontier AI systems. The institute page says the work draws on Perplexity’s experience serving millions of users and thousands of enterprises, is led by Purdue professor Ninghui Li, and already highlights research such as BrowseSafe and a NIST-focused paper on securing AI agents.

#perplexity #ai-security #agents

LLM Mar 28, 2026 2 min read

OpenAI moves to acquire Promptfoo to bring agent security testing into Frontier

OpenAI announced plans to acquire Promptfoo on March 9, 2026. The company says Promptfoo’s security testing and evaluation technology will be integrated into OpenAI Frontier so enterprises can test and document risks such as prompt injection, jailbreaks, data leaks, and tool misuse earlier in the development cycle.

#openai #promptfoo #ai-security

AI Mar 20, 2026 2 min read

Cloudflare takes AI Security for Apps to GA and expands AI endpoint discovery across plans

On March 11, 2026, Cloudflare announced the general availability of AI Security for Apps. It also made AI endpoint discovery free for Free, Pro, and Business customers, while adding custom-topics detection and integrations involving IBM and Wiz.

#cloudflare #ai-security #waf

LLM Mar 15, 2026 2 min read

OpenAI to acquire Promptfoo and fold agent security testing into Frontier

On March 9, 2026, OpenAI said it plans to acquire Promptfoo and integrate its AI security tooling into OpenAI Frontier. The move pushes security testing, red-teaming, and governance closer to the default workflow for enterprise agents.

#openai #promptfoo #ai-security

AI Hacker News Mar 14, 2026 2 min read

Hacker News Spotlights AI-Specific SQL Injection That Exposed McKinsey's Lilli Platform

A Hacker News thread drew attention to CodeWall's March 9 disclosure on McKinsey's Lilli platform, where an autonomous agent reportedly chained unauthenticated endpoints, SQL injection, and prompt-layer access into full production-database compromise.

#ai-security #sql-injection #rag

AI Mar 14, 2026 2 min read

Cloudflare Takes AI Security for Apps to GA and Makes AI Endpoint Discovery Free

Cloudflare said on March 11, 2026 that AI Security for Apps is now generally available. The company also made AI endpoint discovery free across Free, Pro, and Business plans while adding custom topic detection and expanded policy controls.

#cloudflare #ai-security #waf

AI Mar 14, 2026 2 min read

Google completes Wiz acquisition and keeps multicloud support intact

Google said on March 11, 2026 that it has closed its acquisition of Wiz. Wiz will join Google Cloud, but Google says the platform will continue working across major cloud providers, including AWS, Azure, and Oracle Cloud.

#google #wiz #cloud-security