LLM

LLM Feb 28, 2026 2 min read

OpenAI and Figma Partner to Connect Prompt-Based Design and Production Coding

OpenAI and Figma announced a partnership that links Figma Make with OpenAI Codex workflows. The companies position the integration as a faster path from prompt and prototype to production-ready software.

#openai #figma #developer-tools

LLM Reddit Feb 28, 2026 2 min read

Reddit Highlights “Reverse CAPTCHA” Study on Invisible Unicode Prompt Injection in AI Agents

A Reddit post in r/artificial drew attention to a security study evaluating how hidden Unicode instructions can steer tool-enabled LLM agents, reporting 8,308 graded outputs across five frontier models.

#ai-security #prompt-injection #unicode

LLM Feb 27, 2026 1 min read

Anthropic Releases Claude Opus 4.6 and Sonnet 4.6 with Stronger Coding Reliability

Anthropic announced Claude Opus 4.6 and Sonnet 4.6 on February 18, 2026. The release emphasizes coding performance, longer-context stability, and new dynamic threat prevention controls for enterprise deployment.

#anthropic #claude #opus-4-6

LLM Feb 27, 2026 2 min read

OpenAI and Snowflake Expand Enterprise AI Integration in Cortex

On February 2, 2026, OpenAI and Snowflake announced an expanded partnership to bring OpenAI models directly into Snowflake Cortex AI. The move targets secure, governed, and lower-friction enterprise deployment of generative AI.

#openai #snowflake #enterprise-ai

LLM X/Twitter Feb 27, 2026 1 min read

Google DeepMind highlights Nano Banana 2 for data-rich visual generation

On February 26, 2026 (UTC), Google DeepMind said on X that Nano Banana 2 can turn instructions into data-rich infographics and educational diagrams. The post also emphasized Gemini world knowledge and real-time web-grounded generation.

#google-deepmind #gemini #multimodal

LLM X/Twitter Feb 27, 2026 1 min read

Perplexity Launches `pplx-embed` Family for Web-Scale Retrieval with INT8 and Binary Outputs

Perplexity announced on February 26, 2026 that `pplx-embed-v1` and `pplx-embed-context-v1` are now available in 0.6B and 4B variants. The company positions the release as retrieval-first infrastructure with quantized embeddings and benchmark-focused performance claims.

#perplexity #embeddings #retrieval

LLM Reddit Feb 27, 2026 2 min read

LocalLLaMA Spotlight: 144M Spiking Neural Network LM trained from scratch

A r/LocalLLaMA post reports a from-scratch 144M-parameter Spiking Neural Network language model experiment named Nord. The author claims 97-98% inference sparsity, STDP-based online updates, and better prompt-level topic retention than GPT-2 Small on limited examples, while clearly noting current loss and benchmark limitations.

#spiking-neural-networks #llm #open-source

LLM Feb 27, 2026 2 min read

OpenAI and Paradigm introduce EVMbench for smart contract security testing

OpenAI and Paradigm launched EVMbench, a benchmark for AI agent performance on smart contract detection, patching, and exploitation tasks. OpenAI reports GPT-5.3-Codex scored 72.2% in exploit mode versus 31.9% for GPT-5.

#security #smart-contracts #benchmark

LLM Feb 27, 2026 2 min read

OpenAI and Figma deepen Codex integration for code-to-design workflows

OpenAI and Figma launched a new integration that links Codex directly with Figma through an MCP-based workflow. The goal is to reduce context loss between implementation and design by enabling continuous code-to-canvas roundtrips.

#codex #figma #mcp

LLM Reddit Feb 27, 2026 2 min read

OpenAI Pauses SWE-bench Verified Evaluations After 16.4% Flaw Finding

A trending Reddit post in r/singularity points to OpenAI's statement that it no longer evaluates on SWE-bench Verified, citing at least 16.4% flawed test cases. The announcement reframes how coding-model benchmark scores should be interpreted in production decision-making.

#openai #swe-bench #benchmark

LLM Reddit Feb 26, 2026 1 min read

Reddit Spotlights DeepSeek DualPath for KV-Cache I/O Bottlenecks in Agentic LLMs

A trending r/LocalLLaMA thread highlighted the DualPath paper on KV-Cache bottlenecks in disaggregated inference systems. The arXiv abstract reports up to 1.87x offline throughput and 1.96x average online throughput gains while meeting SLO.

#llm-inference #kv-cache #rdma

LLM Hacker News Feb 26, 2026 1 min read

HN Debate: How OpenAI Can Defend Its Position as AI Distribution Broadens

A high-engagement Hacker News thread (388 points, 535 comments) on Benedict Evans' OpenAI analysis focused on defensibility beyond raw model quality. Users debated stickiness, distribution leverage, and enterprise integration as the real battleground.

#openai #ai-market #distribution