#speech-recognition

RSSフィード

AI Hacker News Jul 14, 2026 1 min read

Apple SpeechAnalyzer、オンデバイス音声認識でWhisper Smallを上回る結果

Appleが公開していなかったSpeechAnalyzerの精度を同一環境で測ったベンチマークが注目を集めた。旧APIだけでなくWhisper Smallより低い単語誤り率を示した点が焦点だ。

#apple #speech-recognition #whisper

AI Hacker News Apr 1, 2026 1 min read

Cohere、14言語対応のオープンソースASR Transcribeを公開

CohereはApache 2.0の2B ASRモデルTranscribeを公開し、音声認識分野で存在感を強めている。14言語対応、Hugging Faceでの配布、そして平均WER 5.42という主張がリリースの柱だ。

#cohere #speech-recognition #asr

AI X/Twitter Mar 28, 2026 1 min read

Cohere、open 2B ASR model TranscribeとWebGPU browser demoを前面に

Cohereは2026年3月28日、Transcribeがreal-world noise環境でspeech recognition accuracyの新しい基準を示すと述べ、試用リンクを共有した。関連するHugging Face資料ではApache 2.0の2B-parameter・14-language ASR modelとして位置づけられ、別のWebGPU demoはこのmodelがbrowser上でローカル動作することを示している。

#cohere #transcribe #speech-recognition

AI X/Twitter Mar 27, 2026 1 min read

Cohere、2B・Apache 2.0のspeech recognition model「Transcribe」を公開

Cohereは2026年3月26日、Transcribeをopen-source speech recognition modelとして発表した。Cohereによれば、この2BのConformerベースsystemは14言語を支援し、Hugging Face Open ASR Leaderboardで平均WER 5.42の首位に立ち、Apache 2.0 licenseで提供され、download・API・Model Vaultの経路を持つ。

#cohere #speech-recognition #asr

AI Reddit Mar 6, 2026 1 min read

LocalLLaMA投稿: Whisperの無音hallucination対策を実運用から共有

r/LocalLLaMAで、Whisperが無音区間で文章を生成する問題に対し、Silero VADやprompt履歴遮断、blocklistを組み合わせた運用対策が公開された。

#whisper #speech-recognition #vad

AI Hacker News Feb 25, 2026 1 min read

MoonshineのオープンウェイトSTTがHNで注目、Whisper Large v3比較を提示

Show HNでMoonshine Voiceが拡散した。プロジェクトはリアルタイム音声向けに、精度と遅延の両立を狙う実装を前面に出している。

#speech-recognition #asr #edge-ai