Bill Pechey

Speech recognition suits network applications

Microsoft's voice technology and progress on speaker-independent systems have the industry talking

Written by Bill Pechey

The IT industry seems to work like London buses: you see nothing new in a field for ages, then several developments appear in a short time. This time it is speech recognition. Nuance announced the release of Dragon Naturally Speaking version 9; and Microsoft followed up a disastrous demonstration of its own voice technology with an impressive presentation that worked perfectly at the SpeechTEK show.

I have used earlier versions of both products and, if used carefully with good microphones, they can produce excellent results. I haven't had a chance to try out Dragon 9 yet but people who have tried it say it is a big step forward. The accuracy is better and, for the first time, it has a speaker-independent mode. If this feature works well it will open up a huge range of applications for speech recognition.

Advertisement

I must admit that I've always thought that talking to your computer is something to be avoided or, at least, done in private. Microsoft obviously disagrees as its speech technology will be integrated into Windows Vista, and I expect that there will be a good deal of publicity about it when Vista is launched. I hope this doesn't lead to a potentially far worse problem than mobile phones on trains.

It is surprising to me how much research into speech recognition is currently going on; many big companies have teams working on it, as do a large number of universities. If it's not for talking to your PC, what is it all for? The answer may lie in networked applications.

The European Telecommunications Standards Institute (Etsi) has a project called Aurora to develop standards for distributed speech recognition in mobile networks. The idea is that part of the recognition engine runs on mobile devices and transmits speech to the central recogniser. This sort of thing seems much more sensible to me; it's more natural to talk into your mobile phone than your laptop.

Etsi says that one application is to dictate your impressions of a meeting and then have them emailed to you so that you can do the editing when you get back to your office or hotel room. That's quite neat. It's not too hard to think of other neat services that could be integrated with various types of networked servers.

As the popular SpinVox service has shown, it is often much better to receive voicemail messages as text than speech. A good centralised speech recogniser could let companies run a similar service on their networks, especially if speaker-independent systems really are just around the corner. Microsoft's Office Communications Server 2007 should be helpful as it will have integrated speech recognition.

Bill Gates says that speech recognition will enter the mainstream in the next decade; he may be right but it may be mainstream in networked applications rather than for just interacting with your PC. In any case, corporates may now wish to consider whether good speech recognition could improve the effectiveness of their systems.

Tags:

Related whitepapers

Related jobs

Do you agree?

IT white papers

Search vnunet IThound

Top categories

Job of the week

Search thousands of IT jobs :

Search thousands of IT jobs:

Advanced search

Hiring now on ComputingCareers:

Related IT jobs

Search thousands of IT jobs :

Search thousands of IT jobs:

Advanced search

Advertisement

Newsletter signup

Sign up for our range of FREE newsletters:

Existing User

Newsletter user login:

Enter email address to edit your newsletter preferences

Watch

Shaun Nichols and Iain Thomson

03 Oct 2008

6.49 MBPodcast Special: Views from the Valley More...

Podcast image

02 Oct 2008

14.35 MBComputing podcast - Next-generation broadband Britain; and we report from Gartner's IT security summit More...

Shaun Nichols and Iain Thomson

26 Sep 2008

3.43 MBPodcast Special: Views from the Valley More...

Poll

Google Android

Google Android

Are you intending to try out a Google Android mobile phone?

Previous poll results

Spotlight

ISSE 2008

Sharing information key to cracking e-crime

Reluctance to report breaches only adding to the problem   More...

AMD logo

AMD expected to split into two

Separate entities to focus on chip design and manufacturing   More...

CA logo

CA pushes into virtualisation management space

Data Center Automation Manager looks after virtual and physical resources   More...

Hacking

Europeans charged in US hack attacks

British man facing 15 years in prison   More...

Primary Navigation