OpenAI's current Codex rate card now maps credit usage to input, cached-input, and output tokens instead of relying only on rough per-message estimates. The April 5, 2026 Hacker News thread focused on how that gives teams clearer cost visibility, while also leaving a split world where some plans use the new token-based card and others remain on the legacy message-based card until migration.
AI
RSS Feed
xAI used a recent X thread to spell out one of the capability upgrades behind Grok Imagine’s Quality mode. The company says the mode improves world knowledge and prompt understanding, letting the image model better interpret complex scenes, physics, object relationships, and specific cultural or brand references.
Anthropic’s March 2026 Economic Index report argues that longer-tenure Claude users bring higher-value work to the model and achieve better outcomes. The company says experienced users have 10% fewer personal conversations and a 10% higher success rate, even after accounting for differences in task mix and geography.
Lalit Maganti argues that AI coding agents made a long-delayed SQLite tooling project feasible, but only after he threw away the early “vibe-coded” version and rebuilt the project around Rust, tests, and tighter human control. The result is a grounded case study in how AI accelerates engineering and where it still fails.
Mistral AI said on March 26, 2026 that Voxtral TTS offers expressive speech, support for 9 languages and dialects, low latency, and easy adaptation to new voices. Mistral’s March 23 launch post says the 4B-parameter model can adapt from about three seconds of reference audio, reaches roughly 70ms model latency, supports up to two minutes of native audio generation, and is available by API and as open weights.
Netflix’s VOID reached Reddit as an open research release aimed at removing objects from video and repairing the interactions those objects caused in the scene. The notable details are the CogVideoX base, a two-pass pipeline, Gemini+SAM2 mask generation, and a 40GB+ VRAM requirement.
A DGX Spark owner on LocalLLaMA argues that NVFP4 remains far from production-ready, prompting a broader debate about whether NVIDIA's premium local AI box still justifies its price.
Together AI said on April 3, 2026 that Wan 2.7 from Alibaba Cloud is now available on its platform. The accompanying product post says text-to-video is live now, with image-to-video, reference-to-video, and video edit workflows rolling out on the same API, auth, and billing surface.
A widely shared r/singularity post drew attention to Anthropic research arguing Claude Sonnet 4.5 contains functional emotion-related representations rather than mere stylistic language. Anthropic says the vectors can influence preference, blackmail behavior in evaluations, and reward-hacking rates when researchers steer them.
A Hacker News thread amplified Nicholas Carlini's report that Claude Code helped uncover remotely exploitable Linux kernel bugs, including one introduced in 2003. The case suggests frontier coding models are becoming useful vulnerability discovery tools even before they become strong automated exploit builders.
Google AI said on March 25, 2026 that Lyria 3 Pro is now available across a broad mix of consumer, developer, and enterprise surfaces. The rollout suggests Google wants music generation to become part of its mainstream AI stack rather than a standalone experiment.
A `r/singularity` post highlighted reporting that roughly half of planned U.S. data center projects have been delayed or canceled because transformers, switchgear, batteries, and related power equipment remain supply constrained. The story resonated because it reframes AI expansion as a grid and industrial logistics problem, not only a chip problem.