IBM and Deepgram bring speech-to-text and text-to-speech into watsonx Orchestrate
Original: Deepgram and IBM Introduce Advanced Voice Capabilities for Enterprise AI View original →
On Feb 24, 2026, IBM and Deepgram announced a collaboration to bring Deepgram's speech-to-text and text-to-speech technology into IBM's watsonx Orchestrate platform. The integration is aimed at enterprise AI teams that want voice interfaces, transcription, and real-time captioning inside the same orchestration stack they already use for digital agents and workflow automation.
IBM says it will embed Deepgram's capabilities into watsonx Orchestrate to support enterprise-grade transcription and real-time captioning. Deepgram becomes IBM's first voice partner under this arrangement, which is a meaningful signal about how IBM wants to extend watsonx. Instead of building every modality alone, IBM is combining its orchestration layer with a specialist provider where speech quality, latency, and reliability matter most.
The announcement also shows where enterprise AI demand is shifting. Both companies frame voice as a default interface for practical systems, not just a convenience feature. That includes digital agents that accept spoken instructions, internal assistants that summarize or route conversations, and customer-facing workflows where low-latency recognition and natural audio output make automation usable in live interactions. IBM is positioning watsonx Orchestrate as the layer that manages those flows, while Deepgram supplies the speech stack underneath.
Deepgram's CEO said enterprise deployments need a real-time platform that is accurate, low latency, and reliable at scale. IBM's partnership team described the integration as a way to modernize operations while preserving customer choice inside an open ecosystem. Those statements matter because they frame this as production infrastructure for large organizations rather than a demo for isolated chatbots.
The practical implication is that voice AI is moving deeper into mainstream enterprise software. Instead of treating speech recognition and text-to-speech as separate add-ons, vendors are folding them into orchestration platforms where models, agents, and business processes are already managed. If IBM executes well, the Deepgram partnership could make voice a native part of enterprise agent deployment instead of a specialized side project.
Related Articles
IBM says attacks that started with exploitation of public-facing applications rose 44% year over year in its 2026 X-Force Threat Index. The report also says vulnerability exploitation made up 40% of incidents observed in 2025 and that more than 300,000 ChatGPT credentials were exposed by infostealers.
OpenAI announced on X that Codex Security has entered research preview. The company positions it as an application security agent that can detect, validate, and patch complex vulnerabilities with more context and less noise.
OpenAI said on X on March 9 that it plans to acquire Promptfoo, an AI security platform, and keep the project open source. The deal strengthens OpenAI Frontier’s agentic testing and evaluation stack.
Comments (0)
No comments yet. Be the first to comment!