Insights
Home All Articles Series
Bookmarks History

LLM

RSS Feed
LLM Reddit Feb 14, 2026 1 min read

SWE-rebench January 2026 Snapshot Highlights a Tight Race in Coding Agents

A LocalLLaMA discussion of SWE-rebench January runs reports close top-tier results, with Claude Code leading pass@1 and pass@5 while open models narrow the gap.

#benchmark#coding-agents#swe-bench
51
LLM Hacker News Feb 14, 2026 1 min read

Gemini 3 Deep Think Expands From Benchmarks to Science and Engineering Workflows

Google announced a major Gemini 3 Deep Think upgrade with stronger reasoning benchmarks and early API access for researchers and enterprises.

#gemini#google#reasoning
34
LLM Feb 13, 2026 1 min read

OpenAI Begins Testing Ads in ChatGPT Free and Go Tiers

OpenAI announced on February 9 that it's testing ads in ChatGPT for Free and Go tier users in the US. Plus, Pro, Business, and Enterprise tiers remain ad-free.

#openai#chatgpt#advertising
34
LLM Feb 13, 2026 1 min read

Anthropic Launches Claude Opus 4.6, Outperforms GPT-5.2

Anthropic released Claude Opus 4.6, achieving industry-leading performance in coding, long-context retrieval, and knowledge work.

#anthropic#claude#llm
38
LLM Feb 13, 2026 1 min read

OpenAI Disbands 'Mission Alignment' Team Focused on Safe AI Development

OpenAI disbanded its Mission Alignment team, which communicated the company's mission to the public and employees. The team leader was reassigned as 'Chief Futurist' amid renewed AI safety concerns.

#openai#ai-safety#organizational-change
39
LLM Feb 13, 2026 1 min read

Anthropic Hits $380B Valuation, Overtakes OpenAI in Enterprise Market Share

Anthropic raised $30B at a $380B valuation and now leads the enterprise LLM market with 32% share, surpassing OpenAI's 25%.

#anthropic#openai#funding
42
LLM Feb 13, 2026 1 min read

Albertsons Joins OpenAI Ad Pilot, Testing ChatGPT Ad Formats for Grocery Retail

Major U.S. grocery chain Albertsons joined OpenAI's ChatGPT advertising pilot. The test explores conversational AI ad formats for retail, signaling growing industry interest in AI-native advertising.

#openai#chatgpt#advertising
38
LLM Feb 13, 2026 1 min read

Microsoft Discovers 'GRP-Obliteration': A Single Prompt That Breaks LLM Safety Alignment

Microsoft AI Safety team discovered GRP-Obliteration, an attack that disables safety alignment across 15 major LLMs with a single prompt. GPT-OSS-20B's attack success rate jumped from 13% to 93%.

#microsoft#safety#jailbreak
40
LLM Feb 12, 2026 1 min read

Meta Llama 4 Ushers in Native Multimodal AI Era with 10M Token Context

Meta has unveiled Llama 4 Scout and Maverick, the first open-weight natively multimodal models. With industry-leading 10 million token context and MoE architecture, they outperform GPT-4o and Gemini 2.0 Flash.

#meta#llama-4#multimodal
41
LLM Feb 12, 2026 1 min read

DeepSeek V4 Targets Mid-February Launch with Revolutionary Coding Capabilities

DeepSeek is set to launch its next-generation coding-focused AI model V4 in mid-February, featuring 1M+ token context windows and consumer GPU support for unprecedented developer accessibility.

#deepseek#coding#open-source
33
LLM Reddit Feb 12, 2026 1 min read

Z.ai Releases GLM-5: 744B Parameter Open-Source Powerhouse

Z.ai unveiled GLM-5, a 744B parameter (40B active) model pre-trained on 28.5T tokens. Designed for complex systems engineering and long-horizon agentic tasks, it leads open-source models in multiple benchmarks.

#glm-5#open-source#moe
38
LLM Feb 12, 2026 1 min read

OpenAI Unveils GPT-5.3-Codex, the First AI Model That Helped Build Itself

OpenAI launches GPT-5.3-Codex, the first model to debug its own training and manage deployment. Released with tight security controls due to cybersecurity concerns.

#openai#gpt-5#codex
36
Previous 74757677 Next

© 2026 Insights. All rights reserved.

Newsletter Atom