#speech-to-text

AI X/Twitter Apr 18, 2026 2 min read

Grok STT API targets voice apps with 25+ languages at $0.10/hour

Why it matters: xAI has turned the Grok Voice stack into standalone STT/TTS APIs with batch transcription at $0.10/hour and streaming at $0.20/hour. The post puts 25+ languages, diarization, and word-level timestamps in direct competition with enterprise transcription tools.

#xai #grok #speech-to-text

LLM Reddit Apr 15, 2026 2 min read

LocalLLaMA Jumps on Gemma-4 Audio Support in llama-server

The LocalLLaMA thread took off because native speech-to-text inside llama.cpp is exactly the kind of feature that removes an extra pipeline from local agent setups. The post says llama-server can now run STT with Gemma-4 E2A and E4A models, and commenters immediately started comparing the practical experience to Whisper and Voxtral.

#llama.cpp #gemma4 #speech-to-text

AI Hacker News Apr 7, 2026 2 min read

Hacker News Boosts Ghost Pepper’s Case for Fully Local Speech-to-Text on macOS

A 440-point Show HN thread put Ghost Pepper, a menu-bar macOS app that records on Control-hold and transcribes locally, into the agent-tooling conversation because its speech and cleanup stack stays on-device.

#local-ai #speech-to-text #macos