Speech

Speech input to machine translation APIs

Speech input, also known as voice input or audio input, is a feature of machine translation APIs required for speech translation.

There are different types of speech translation.

Live (simultaneous) speech translation
Offline (asynchronous) speech translation

Speech input can be combined with different types of output.

Speech-to-speech machine translation (dubbing)
Speech-to-text machine translation (subtitles)

Often, speech recognition (transcription) and translation are separate APIs or API calls, that can be combined to effectively support speech translation.

Speech translation APIs are typically slower than text translation APIs.

API support

17 machine translation APIs support speech translation.

AppTek
Baidu Translate
Geofluent
Lengoo
Lilt
Lingmo Translation
Lingua Custodia
LingvaNex
Niutrans
Skrivanek
Tencent Machine Translation
Youdao Translate
OpenAI
Lara
Sarvam AI
Mistral AI
Live Interpreter