#speech-to-text

AI X/Twitter Apr 18, 2026 1 min read

Grok STT API, 25+개 언어와 시간당 $0.10 가격으로 음성 API 시장 겨냥

왜 중요한가: xAI가 Grok Voice stack을 standalone STT/TTS API로 내며 batch $0.10/hour, streaming $0.20/hour 가격을 제시했다. 25+ languages, diarization, word-level timestamps는 call center와 meeting transcription 시장을 직접 겨냥한다.

#xai #grok #speech-to-text

LLM Reddit Apr 15, 2026 1 min read

LocalLLaMA가 들썩인 Gemma-4 audio 지원, llama-server에서 STT가 바로 돈다

LocalLLaMA가 이 thread를 크게 띄운 이유는 local agent stack에서 가장 귀찮은 별도 음성 파이프라인 하나를 치울 수 있다는 기대 때문이다. 게시물은 llama.cpp의 llama-server가 Gemma-4 E2A와 E4A 모델로 STT를 처리할 수 있게 됐다고 전했고, 댓글은 곧바로 Whisper와 Voxtral 비교로 넘어갔다.

#llama.cpp #gemma4 #speech-to-text

AI Hacker News Apr 7, 2026 1 min read

Hacker News, macOS용 완전 로컬 speech-to-text 앱 Ghost Pepper에 주목

440포인트를 모은 Show HN 스레드는 Control 키를 누르는 동안 녹음하고 완전히 로컬에서 전사하는 메뉴바 macOS 앱 Ghost Pepper를 에이전트 도구 흐름의 일부로 끌어올렸다.

#local-ai #speech-to-text #macos