HN spotlights Caveman, a Claude Code plugin that trims tokens with “caveman” responses

A fast-rising Hacker News thread is centered on the GitHub project Caveman. The branding is playful, but the underlying problem is practical. As Claude Code, Codex, and similar coding agents become part of everyday engineering workflows, overly polite and wordy answers are not just a style annoyance. They also consume money, latency, and precious context window budget. At the time of writing, the HN discussion had reached 399 points and 238 comments.

Caveman is positioned as a lightweight skill/plugin layer rather than a model change. Its README says it can reduce output tokens by roughly 75% by removing filler language and hedging while keeping code blocks and technical terms intact. The before-and-after examples are easy to understand: a long explanation of React re-renders becomes a short note about object references and useMemo, without changing the actual fix.

That is why the HN interest matters. Developers are increasingly treating response formatting itself as an optimization surface. If an agent can preserve the same diagnosis, command, or review guidance while emitting fewer tokens, the upside compounds over long sessions. Less output means lower cost, faster completions, and more room for follow-up context.

The obvious caveat is that compression only helps if accuracy survives. Caveman’s README makes that claim, but the real test is broader use across debugging, design review, and implementation workflows. Even so, the project captures a real shift in AI tooling. People are no longer only asking which model is smartest. They are also asking which interaction style is efficient enough to live inside daily engineering loops.

LLM 6d ago 2 min read

466M lines in 20 hours: Claude Code becomes Alberta security infrastructure

Alberta put roughly 50 Claude Code agents across 466 million lines of government code and compressed a security review estimated at 6.5 years into 20 hours. The case matters because it moves coding agents from developer convenience into public-sector cyber operations.

#anthropic #claude-code #cybersecurity

LLM X/Twitter Mar 27, 2026 2 min read

OpenAI expands Codex access and turns plugins into reusable workflow packages

OpenAIDevs said on March 27, 2026 that Codex usage limits had been reset across plans so users could try newly launched plugins. OpenAI's Help Center says Codex is temporarily available on Free and Go, paid plans are getting 2x rate limits, and plugins package reusable workflows built from skills, app integrations, and MCP configurations.

#openai #codex #plugins

104

LLM X/Twitter Mar 27, 2026 2 min read

OpenAI rolls out Codex plugins to connect Slack, Figma, Notion, Gmail, and more

OpenAI Devs said on March 26, 2026 that plugins are rolling out in Codex, letting the agent work with common tools such as Slack, Figma, Notion, and Gmail. OpenAI's Codex docs describe plugins as reusable bundles that package skills, app integrations, and MCP server settings, turning Codex into a more shareable workflow layer for teams.

#openai #codex #plugins

102

Related Articles

466M lines in 20 hours: Claude Code becomes Alberta security infrastructure

OpenAI expands Codex access and turns plugins into reusable workflow packages

OpenAI rolls out Codex plugins to connect Slack, Figma, Notion, Gmail, and more