DIGITAL TRANSFORMATION

Powerful AI models are healthy for voice tech

Microsoft’s acquisition of Nuance for almost $20 billion signposted that speech-based services have strong prospects. Healthcare is a good indicator of what innovations voice tech will deliver next.

7 September 2022

James Tyrrell

@JT_bluebird1

james.tyrrell@hybrid.co

All stories

Noteworthy alternative: adding information to medical records is one of many areas where speedy speech-powered services offer advantages over point and click keyboard entry. Secure smartphone apps add to the convenience for users. Image credit: Shutterstock.

One of the biggest developments in speech recognition and voice-activated technology has been the massive improvements made in language models. The ability to feed algorithms with huge amounts of training data, and apply techniques such as deep learning, has given developers access to a much richer set of statistical models. And because these models do a much better job of capturing the complexity of language, the resulting applications that are built on top of them are night and day compared with voice software that you may have used in the past.

“It’s made a fundamental difference to the user experience,” Simon Wallace, Chief Clinical Information Officer, at Nuance told TechHQ. “Speech has become a massive springboard to take technology into a new era.” In healthcare, trends include exploring where advanced tools such as conversational artificial intelligence (AI) can play a bigger role in supporting clinicians. And this vision of the future could be closer than we imagine.

Voice tech today

Over the past decade, speech recognition and voice-activated technology has proven to be a significant time-saver for clinicians. Features such as being able to dictate at the point of cursor represent the first stage of the transition. “We typically speak three times faster than we type,” said Wallace. “Systems also have templates for larger blocks of text, which further speeds up the process.” Nuance has been busy training its language models with a large body of scientific information, including medical dictionaries, so that products are well-equipped to cope with the complex terminology used in the medical profession.

Data packs – English has four versions, one each for the US, UK, Australia and Canada – manage localization scenarios such differences in the pronunciation of a pharmaceutical drug across countries that share the same language. Thanks to the use of much more detailed base models – which have been created to serve a number of languages, not just English – clinicians can use Naunce’s system straight out of the box. And, as users have their own profiles, the language learning routines can continue to feedback and further tailor their operation. Today, voice characteristics (Wallace makes clear that the system doesn’t save any conversations directly) learned from more than half a million clinical users mean that AI tools are extremely well placed to serve the healthcare sector.

Cloud certified

There are other useful features too, such as cloud deployment, which means that authorized users can access voice technology from anywhere. During the pandemic, when some healthcare workers had to work from home, it meant that records could be still be updated and managed using voice commands. Healthcare has proven to be a rich training ground for AI-powered speech recognition and voice-activated technology. Being able to get to grips with complex, highly-technical language is definitely a win for applications providers. As is building systems that meet tough security criteria, which is necessary to safeguard patient data. Wallace, who has worked as a GP and hospital doctor himself, notes the various standards that voice tech providers must comply with. The list includes DCB0129 – prepared by the UK’s NHS Digital Clinical Safety team – which is an information standard designed to help manufacturers of health IT software evidence the clinical safety of their products.

POPULAR TOPICS

POPULAR TOPICS

Voice tech: accelerating audio-based authentication

Sitting up straight? Machine learning strikes the right pose

Number crunching AI’s carbon costs

POPULAR TOPICS

Powerful AI models are healthy for voice tech

READ NEXT

Voice tech: accelerating audio-based authentication

Voice tech today

READ NEXT

Sitting up straight? Machine learning strikes the right pose

Cloud certified

READ NEXT

Number crunching AI’s carbon costs