Meta’s SeamlessM4T AI can transcribe and translate up to 100 languages

With the help of the SeamlessM4T AI, content shared by users across Meta’s social media space will be more accurately translated, allowing creators to reach audiences beyond their borders

Rifat Ahmed

26 August, 2023, 01:40 pm

Last modified: 26 August, 2023, 01:53 pm

SeamlessM4T reduces errors and delays compared to approaches using separate models. Photo: Reuters

In an attempt to build the world's first universal speech translator, Meta AI has developed a new multimodal multi-lingual AI model that can transcribe and translate speech and text in up to 100 different languages.

Bundled with a new open-source translation dataset containing 443,000 hours of speech with text and 29,000 hours of speech-to-speech alignments, the all-in-one SeamlessM4T transcription and translation model can take input in both verbal and written modes.

This multimodal processing allows it to transcribe speech in nearly 100 languages and produce output as translated text in the same. However, for translated speech output from speech or text, the new AI model is limited to 36 languages, including English.

Keep updated, follow The Business Standard's Google news channel

It means that the model can take a speech in one of the 100 languages, transcribe it, translate it into the desired language, and give the translated text as output. Or, it can go one step further and produce the speech in that translated language. It works both ways between text and speech, allowing text-to-text, text-to-speech, speech-to-text, and even speech-to-speech translation with one single AI model.

SeamlessM4T, shortened from Seamless Massively Multilingual and Multimodal Machine Translation, in spirit, is a successor of last year's No Language Left Behind (NLLB) text-to-text machine translation model that supported 200 languages.

The first direct speech-to-speech translator, however, came a few months later in the form of a demo Universal Speech Translator from the Meta AI team, which was built to translate Hokkien, a language that does not even have a widely-used writing standard.

All of these models, combined with the Massively Multilingual Speech model, released earlier this year, with speech recognition and synthesis capabilities across more than 1,100 languages, laid the foundation for Meta AI's newer models like Voicebox and the most recent SeamlessM4T.

With the help of the SeamlessM4T AI, content shared by users across Meta's social media space, including Facebook, Instagram, Threads, and the Metaverse, will be more accurately translated, allowing creators to reach audiences beyond their borders.

The NPCs in Metaverse could also benefit from this multilingual model, enabling seamless conversation in any language.

If the VR craze takes off again and Metaverse gains traction in their virtual world, this model can also enable real-time translation between users interacting within the Metaverse, acting as a real-life universal translator, which will not only penetrate the language barrier but also make the content shared online, universal and streamlined.

Tech

Meta’s SeamlessM4T AI can transcribe and translate up to 100 languages

With the help of the SeamlessM4T AI, content shared by users across Meta’s social media space will be more accurately translated, allowing creators to reach audiences beyond their borders

Keep updated, follow The Business Standard's Google news channel

Comments

Top Stories

MOST VIEWED

Features

Yadea G150P: Another step towards EV mobility

Marma girl's accident injury being propagated as communal murder: Rumor Scanner

Breathe, bend and be happy: Dhakaites’ new mantra for wellness

Should you get a gaming phone?

More Videos from TBS

Is Today TikTok's Last Day in the United States?

1,150 non-bonded RMGs fear closure as LC, raw material access shrinks

Media witnessed Tulip's anger on Barrister Arman

What to know about Donald Trump's inauguration?

Related News

With the help of the SeamlessM4T AI, content shared by users across Meta’s social media space will be more accurately translated, allowing creators to reach audiences beyond their borders

Comments

MOST VIEWED

Related News