Voice and Voice-XML... inextricably linked?

Yes and no.

Voice-XML is the World Wide Consortium’s (W3C) standard markup language based on XML used for creating voice user interfaces that use advanced speech recognition (ASR) and text-to-speech (TTS) technologies. From that perspective yes. But when you look at most of the Voice-XML deployments in the market today they are DTMF.

Why?

Simply put, Voice-XML is the next generation IVR platform technology. Because it is an open standard, companies, in theory, do not have vendor reliance and as a result have more portability and freedom of choice. Moreover it leverages the web.

For over two decades, the traditional or proprietary IVR systems were the only IVR deployments in the market. These systems have since earned the name traditional IVR systems. But over the last six to seven years, as business needs have outgrown the constraints of rigid TDM-based telephony architectures - the native environments for traditional IVR systems - organizations have begun to move to more flexible Web services deployment models that favor open standards-based (Voice-XML) platforms. This is key! Today, if you were to look at spending trends, more organizations are replacing traditional IVR systems with next generation IVR platforms based on Voice-XML to complement their migration to an IP/Web-centric architecture.

So chances are companies that are investing in Voice-XML platforms are investing in the open standard, interoperability, TCO, and flexibility rather than voice-enablement.


Site Tags:

Agreed!

+1 (that's a big thumbs up in board-speak)

Too often, enterprises are sold on the "VoiceXML is for Speech" story, which is just non realistic. VoiceXML's power is that it's an interpreted language and that it enables a migration towards a services-oriented infrastructure.

I'm an old time speech hack - and we were building complex voice applications with name and address functionality back in 2001. On a Lucent Conversant, no less.

Sorry- my bad!

VoiceXML and speech have been percieved as tightly linked because that's how VoiceXML was marketed initially, and first impressions tend to stick around for a long time. Going back to the initial 1999 press release that alerted the world of its existence, VoiceXML was marketed as a standard that would help drive the market for voice-enabled, internet-based services. This was a mistake, and it still haunts us nearly eight years later because the speech angle deflected attention from some of VoiceXML's most powerful benefits (described very well by Dan in his posting above.) Here's a link to the original press release, which triggered many stories and actions that further reinforced that critical first impression: http://www.voicexml.org/pr19990302-1.html I was one of the folks who reviewed this press release before it went out, so yes, this is partly my bad. At the time I was marketing speech technology, so let's just say I had a bias. I thought speech would be a great reason for customers to adopt the standard, but looking back I think I got it backwards. The main reasons to adopt the VoiceXML standard have nothing to do with speech... they have to do with it being a standard that's based on internet infrastructure. It just so happens that IF you adopt VoiceXML, THEN it becomes much easier to also adopt speech. Happily, markets are smart. Customers eventually figured this out, and many started treating standards adoption and speech adoption as separate considerations. As early as 2001 we began to see a surge in DTMF-based VoiceXML deployments. I still believe that if we hadn't made the mistake of initially positioning VoiceXML as a speech standard, customers would have adopted it at an even faster rate. But enough speculation. Now that you have the inside scoop, you too can help set the record straight. It's about standards, and speech-enablement is a very attractive but optional by-product.