An app offering real-time translations is to allow people in Japan to
speak to foreigners over the phone with both parties using their native
tongue.
NTT Docomo - the country's biggest mobile network - will initially
convert Japanese to English, Mandarin and Korean, with other languages
to follow.
It is the latest in a series of telephone conversation translators to launch in recent months.
Lexifone and Vocre have developed other products.
Alcatel-Lucent and Microsoft are among those working on other solutions.
The products have the potential to let companies avoid having to use
specially trained multilingual staff, helping them cut costs. They could
also aid tourism.
However, the software involved cannot offer perfect translations, limiting its use in some situations.
Cloud technology
NTT Docomo unveiled its Hanashite Hon'yaku app for Android devices at
the Combined Exhibition of Advanced Technologies (Ceatec) show in Japan
earlier this month, and plans to launch it on 1 November.
It provides users with voice translations of the other speaker's
conversation after a slight pause, as well as providing a text readout.
"French, German, Indonesian, Italian, Portuguese, Spanish and Thai will
be added for this application in late November, raising the number of
non-Japanese languages to 10," the firm said in a statement.
"Fast and accurate translations are possible with any smartphone,
regardless of device specifications, because Hanashite Hon'yaku utilises
Docomo's cloud [remote computer servers] for processing."
The caller must subscribe to one of Docomo's packages to be able to use it.
NTT Docomo will soon face competition from France's Alcatel-Lucent which
is developing a rival product, WeTalk. It can handle Japanese and about
a dozen other languages including English, French and Arabic.
The service is designed to work over any landline telephone, meaning the
company has had to find a way to do speech recognition using audio data
sampled at a rate of 8kHz or 16kHz.
Other products - which rely on data connections - have used higher 44kHz samples which are easier to process.
Alcatel-Lucent uses a patented technology to capture the user's voice
and enhance it before applying speech recognition software. The data is
then run through translation software before being run through a speech
synthesiser.
The firm said all this could be done in less than a second. However, it
has opted to wait before the speaker has stopped talking before starting
the translation after experiments carried out with workers at insurance
company Axa suggested users preferred the experience.
"We are still working on improving the system," Gilles Gerlinger, the product's co-founder, told the BBC.
"You can do conversations with one person, but we want to allow
conferences with 10 people and four different languages, and the system
would provide translations in every language needed.
"We also have a project called MyVoice which can have a synthetic voice that sounds like your real one."
Mr Gerlinger suggested that his firm would make money from the product
by renting servers with the necessary software to big businesses, and
charging smaller ones a fee for the amount of time they used the
service.
Converted video chats
Microsoft's Research Labs has also been working on a technology it calls
the Translating Telephone. The firm has acknowledged that one of the
biggest problems was making the software adapt itself to cope with
different ways people pronounce words.
"The technologies are still not perfect," said researcher Kit Thambiratnam in 2010.
"But we feel they are good enough for two people to communicate in their
native languages, as long as they are willing to speak carefully and
maybe occasionally repeat themselves."
Google already has a Translate app that can translate 17 spoken
languages, allowing face-to-face conversations with a foreigner, but it
is not yet designed to work with telephone calls.
Start-up Israeli company Lexifone is hoping to get a head-start with its own phone conversation product.
It launched earlier this year offering translations between English, Spanish, Portuguese, Italian, French and Mandarin.
Its chief executive, an ex-IBM computer engineer, has ambitions to
disrupt the human translation industry which he said was worth $14bn
(£8.7bn) a year.
"Our original plan was for annual growth of 200%," Ike Sagie told Reuters last month.
"The way we see market acceptance and the way we see the market
welcoming the technology I think we have the potential for growing
faster than that."
The firm is working with BT and Telefonica to offer its service to the phone networks' customers.
Meanwhile California-based MyLanguage, is pursuing another strategy by
providing voice and text translations during video chats via its Vocre
app for iPhones.
The facility - which is currently being beta tested - means that customers will need an internet connection to use it.
Lost in translation
Despite the ambitions of those involved in the nascent sector, one analyst questioned their chances of success
"These kind of real-time technologies have been 'two to three years
away' for the past decade," said Benedict Evans, technology expert at
Enders Analysis.
"Both speech recognition and machine translation are sort of there if you're not too fussy.
"But they are generally not as good as speaking the language itself, and
my suspicion is that they would not reliable enough to use them for
business purposes when you need to be really sure about what the other
person said."
www.bbc.com
No comments:
Post a Comment