Conversational AI has been saved
The authors would like to thank Ankur Tulsyan of Deloitte Services India Pvt. Ltd. for his contributions to this article.
Cover image by: Kevin Weier
United States
United States
India
Current versions of chatbots can be inefficient and frustrating to use, besides being confined to answering simple queries that follow a set pattern. But what if a chatbot could help you perform more complex, multistep tasks such as checking your schedule, clearing your calendar, and calling a cab to the airport in 30 minutes?
Our analysis of patents in the area of conversational AI reveals that this might be possible soon.
Rapid adoption of conversational AI will likely be underpinned by innovations in the various steps of chatbot development that have the potential to hasten the creation and training of chatbots and enable them to efficiently handle complex requests—with a personal touch.
By 2022, 70% of white-collar workers will interact regularly with conversational platforms, according to Gartner.
Conversational AI solutions—including chatbots, virtual agents, and voice assistants—have become extraordinarily popular over the last few years, especially in the previous year, with accelerated adoption due to COVID-19. Data from various conversational AI vendors showed that the volume of interactions handled by conversational agents increased by as much as 250% in multiple industries.6 These solutions are already delivering significant value for many organizations. Around 90% of companies mentioned faster complaint resolution and over 80% reported increased call volume processing using conversational AI solutions, according to a recent survey.7 However, the technology still suffers from a number of limitations that make it difficult to use and limit its value. Fortunately, innovators are tackling these challenges. We expect this to lead to much broader adoption of conversational bots in the coming years. Technology and human interaction strategists should take note.
To understand how innovators are working to enhance conversational AI technology, we analyzed conversational AI-related patents filed in the United States over the last couple of years.8 Our analysis, along with Deloitte’s own experience of implementing conversational AI solutions for many clients, suggests we can look forward to progress in at least five areas (figure 1). These developments are likely to increase the value of conversational agents and help to expand their use across industries.
Conversational agents need to be trained on the content pertaining to the domain they are used in. For instance, a contact center chatbot would require training data such as live agent logs and transcripts from previous conversations and FAQs. But training conversational agents with traditional rule-based or manual approaches is tedious and time consuming. The process can take as much as six to nine months.9 And it tends to result in inadequate understanding and unsatisfying—even biased—behavior.10
One of the most common areas of innovation in conversational AI is improving the training process. Around 20% of patents in our survey related to this—the top category.11 Innovations focus on automating and accelerating the training process to better understand users’ inputs and improve the quality of responses.
One patent covers techniques to automatically generate questions and responses to train customer service agents, deriving new data from past customer interactions with similar virtual assistants or entity-specific data sources such as websites.12 Another patent describes a health care chatbot trained using deep learning-based unsupervised learning techniques. The chatbot can answer patients’ queries about suitable health care providers based on symptoms and insurance coverage. It learns from previous interactions to provide responses in future ones in as few iterations as possible.13 A third describes a method of detecting and mitigating bias in training data that could lead, for instance, to labeling customers as upset when they are not.14 Collectively, these training innovations could get conversational agents up and running faster, producing more satisfying and effective interactions.
Wouldn’t it be great if you could simply instruct your personal assistant to clear your calendar for the afternoon and call a cab in 30 minutes to take you to the airport? Most conversational bots cannot fulfill such a request because they are designed to handle only short, simple queries. They operate in a “tic-tac flow” format where the user asks, and the machine responds synchronously. Therefore, they fail to understand multiple intents in a single user command, making the experience inefficient, and even frustrating for the user.
Around 18% of the patents in our analysis address handling of complex conversation scenarios such as this, involving multiple commands in a single utterance or multitopic conversations.15 For example, a patent describes a scenario where on being asked to set up a business meeting with a client, the virtual assistant can handle a series of tasks in the background, such as calendar conflict resolution and restaurant table reservations.16 Another patent involves the use of a “concept lattice” technique to build topic-independent agents that can automatically build a dialogue structure to guide the discussion. For instance, a product recommendation agent using concept lattices can interact with the user autonomously about any product category mentioned in the catalogue. Concept lattices are graphical representations of any object of interest (e.g., camera, vehicle, vacation package, daycare provider) and its attributes as nodes connected with edges representing different possible combinations of the object’s attributes.17 Overall, with these advancements, autonomous agents could asynchronously manage a wide range of conversations effectively.
When chatbots personalize conversations, customers are more likely to take the desired action. In our work implementing chatbots to support high-volume business processes for a range of clients, we have seen that personalization can significantly improve the performance of a chatbot in terms of user experience, cost of operations, and business outcomes. (For a perspective on end-to-end personalization of conversational agents, see figure 2.) Personalization is a major area of innovation, comprising some 16% of patents in our survey.18 For instance, one patent we examined describes customer service agents that tailor their communication style in real time based on customers’ characteristics. If the customer is impatient, the agent will increase the rate of speech; if the customer seems highly dissatisfied, the digital agent will involve a human agent in the conversation.19
Another patent describes a home assistant that can automatically adapt to the age of the user, switching to “kids mode” if appropriate, based on the user’s communication style.20
Digital voice assistants have gone mainstream. More than half of US adults use them on smartphones.21 But voice assistants have their weaknesses. They can fail in noisy environments, for instance. And their intensive processing requirements can rapidly drain batteries on portable devices.
Technology vendors and electronic device manufacturers, among other innovators, have devised approaches to improve the performance of voice assistants; 11% of patents in our survey are focused on this area.22 They include techniques to automatically filter out background noise to improve speech recognition by virtual assistants.23 Innovations are also targeting efficient and precise methods for activating voice assistants. One patent describes a method for reducing the likelihood of a virtual assistant being erroneously triggered by background noise. Systems will be able to ignore wake words used in a TV commercial running in the background, for instance.24 Based on these developments, we can expect greater use of voice assistants in busy environments, including offices.
Conversational bots are typically designed for a narrow purpose such as handling a particular set of queries related to a business function such as IT, HR, or finance. Attempts to build general-purpose bots have generally produced poor results. Another area of innovation aims to sidestep this challenge by describing efficient methods for composing multiple specialized chatbots into an ensemble. These virtual assistant ensembles can handle a range of tasks for a user by automatically inferring intent and routing the request to the appropriate specialist agent. Around 7% of patents are targeting multibot architecture, according to our analysis.25
One of these patents describes an enterprise assistant with a single master interface that can route users to virtual assistant specialists for CRM, ERP, and human capital management (HCM).26 Another describes an approach to build personal or home assistants that may suggest an alternate service provider when one fails to respond to a user query. For instance, the assistant may say, “Alexa does not know how to answer that question. Do you want Google to try and answer that?”27 Technology to orchestrate multiple bots and their interactions with internal systems and users make it easier to maintain and scale conversational bots. Developers can focus on obtaining high performance from chatbots for specific tasks and add new chatbots rather than expanding existing ones—with coordination provided by the routing mechanism or middleware.28
Conversational agents have their limits, but many have already proven their worth. And the technology is getting better. With technological improvements on the way, it’s important to keep in mind that success with conversational AI depends on more than technology; good experience design, informed by behavioral science, is crucial.
Collectively, these vectors of progress point toward a future in which engaging and effective conversational agents will be increasingly common. These agents will likely be able to manage complex conversation scenarios with personalized responses. Voice-based assistants will become usable even in busy environments such as offices and public transport. Modular ensembles of agents will be able to handle diverse sets of tasks. The training of conversational agents will get easier, with some agents up and running in weeks, not months. Judging from these vectors of progress, conversational AI is likely to have a long life span. A flawless experience with a conversational agent can feel like magic. Conversational AI isn’t magic. But it’s getting closer all the time.
Knotch card
The authors would like to thank Ankur Tulsyan of Deloitte Services India Pvt. Ltd. for his contributions to this article.
Cover image by: Kevin Weier