3 Fundamental Conversational AI Capabilities

Rebecca Christie
July 13, 2023
in Article, Opinion

3 Fundamental Conversational AI Capabilities https://vux.world/wp-content/uploads/1689154757814.png 752 423 Rebecca Christie Rebecca Christie https://secure.gravatar.com/avatar/61a62614f405a15d4978524b5df65a86?s=96&d=blank&r=g July 13, 2023 July 13, 2023

When designing or developing a conversational assistant, like a chatbot or a voice assistant, there are three fundamental things the AI assistant should be able to do.

This might sound basic, but with all the renewed interest in conversational AI, fuelled by large language models and ChatGPT, it’s worth layout out this stuff because this doesn’t change.

The three primary capabilities your AI assistant requires are:

1. Understanding language

This relates to the system’s ability to understand a user. This sounds silly and obvious, but it’s the first stumbling block for conversational AI systems.

We have mature tools available for natural language processing, speech recognition, and understanding. Intent-based NLU systems that have formed the foundations of most chatbots over the last 5 years, and are generally sound at predicting a user’s intent (provided they’re trained appropriately and optimised frequently).

Therefore, broadly, the language understanding aspect of conversation design has been well-addressed.

The role of LLMs in understanding

We’re in exploratory stages with large language models, but this technology has the potential to take an AI assistant’s ability to understand to human-level accuracy.

Our early experiments indicate that there’s value in having LLMs compliment intent-based NLU systems by classifying longer utterances that intent-based systems have always, and will always, struggle with, as well as for intent-based training data creation. (LLM capabilities are broader than this, of course, but our clients operate high value, high consequence use cases, which aren’t fit for LLM-centric approaches today.)

Although it’s early days with LLMs, and there’s many gaps in terms of safety and quality assurance, it’s still safe to say that the broader technology’s ability to understand an input isn’t the limiting factor of conversational AI success.

2. Access to knowledge or capability

Obviously, for an AI assistant to be useful, once it’s understood someone, it needs to be able to provide an accurate response.

Various methodologies have been developed for accessing and reasoning with information. The fundamental point of any kind of chatbot or voice assistant is that it has content with which it can answer questions. Or, it has the ability to retrieve data from business systems. Or, the ability to write to business systems. Without such knowledge or capability, the agent would have nothing to offer, rendering it redundant.

There are many ways to structure knowledge for chatbots, from external knowledge bases, to databases, to hard coded responses. Providing knowledge through an AI assistant isn’t an issue today.

From a data retrieval or data-posting perspective, this is typically a solved problem, too. Many businesses have been through some degree of digital transformation and have their business data available via APIs.

They also have the means and security to provide access to those APIs, so they can be utilised in a conversational interaction. And even if they don’t, it’s not rocket science. The path to follow is well-trodden and clear.

This, therefore, is also not a limiting factor in conversational AI success.

3. Conversation management

An AI assistant’s ability to manage a conversation is arguably the single biggest determiner of success. Yet, it’s the most underserved, misunderstood and negated part of most CAI initiatives.

I previously wrote about adjacency pairs and expandable sequences, the building blocks of conversation design. These are two critical elements that form part of conversation management, but there’s more to it than that.

To manage a conversation well, you need to design the end-to-end interaction in such a way as to give users the best possible chance of meeting their needs.

That means you need to have a good handle on conversation context and state. You need memory, reasoning, learning.
You need an understanding of business rules and logic. You need to cater for core primary conversational skills, and secondary conversational skills.

The challenges of conversation management

The challenges in getting this right today are abound. No tech platform I’ve seen to date provides out of the box capabilities to handle the complexities of a conversation particularly well. At least, not in any consistent or standardised way.

For example, IBM Watson uses actions and steps, whereas Google DialogFlow CX has Flows and Pages. None of them come out of the box with primary conversation skills and leave it up to developers to craft the scaffolding for each new conversation.

Finding success

For now, regardless of the technology you’re using and the NLP that sits underneath it (LLMs or otherwise), to have a meaningful and successful conversation with customers, you’re not going to be able to avoid these three fundamentals. Get them right, and you’re well on your way to delighted customers and great CX.

Stay tuned for the next few articles on the best way to structure your dialogue management systems to make sure you can handle conversations successfully and consistently.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.