Revolutionizing Communication: DeepL's Voice-to-Voice Translation Suite

In a significant expansion of its text translation capabilities, DeepL has announced the release of a voice-to-voice translation suite that aims to bridge the language gap in real-time conversations. This innovative solution caters to various use cases, including meetings, mobile and web-based interactions, and group conversations for frontline workers through custom apps. Moreover, the company is introducing an API that enables developers and businesses to integrate DeepL’s technology into their products, opening up new opportunities for customized applications.

The journey towards voice-to-voice translation was a natural progression for DeepL, given its extensive experience in text translation. According to CEO Jarek Kutylowski, the key challenge lay in striking a balance between reducing latency and maintaining accurate results. By leveraging its expertise in text translation, DeepL has developed an edge in translation quality, which is critical in real-time conversations.

The new voice-to-voice translation suite offers several add-ons for popular platforms like Zoom and Microsoft Teams. These integrations enable listeners to receive real-time translations while others are speaking in their native languages or

DeepL’s technology is designed to adapt to custom vocabulary, such as industry-specific terms and company/personal names, making it an attractive solution for various industries. Furthermore, the voice-to-voice translation suite has the potential to revolutionize customer service by providing support in languages where qualified staff are scarce and expensive to hire.

The move marks a significant expansion of DeepL’s capabilities, positioning the company as a major player in the rapidly evolving landscape of real-time communication. As AI continues to transform industries, it is likely that voice-to-voice translation will play an increasingly important role in shaping customer service experiences.

In the competitive space, DeepL faces stiff competition from well-funded startups like Sanas, Camb.AI, and Palabra. These companies are working on adjacent cornerstones of the space, such as accent modification, speech synthesis, and real-time speech translation. While Palabra’s solution is designed to preserve both meaning and the speaker’s original voice, DeepL’s end-to-end voice translation model aims to skip the text step entirely.

As the market for real-time communication continues to evolve, it remains to be seen how these companies will differentiate themselves and cater to diverse customer needs. One thing is certain: the stage is set for a thrilling competition that will ultimately benefit consumers and businesses alike.


Source: https://techcrunch.com/2026/04/16/deepl-known-for-text-translation-now-wants-to-translate-your-voice/