Why context is truth in conversational AI

Ben McCulloch
November 14, 2022
in Article, Opinion

Why context is truth in conversational AI https://vux.world/wp-content/uploads/openstream.png 1120 840 Ben McCulloch Ben McCulloch https://secure.gravatar.com/avatar/b1f3549c2d953651d69f59ec1fa801a3?s=96&d=blank&r=g November 14, 2022 November 2, 2022

If you want to understand how machines pick apart the things we say to pull relevant data from it then you simply must listen to Raj Tumuluri! Don’t just believe us though – Openstream AI was recognized by Gartner as the sole Visionary in the January 2022 Magic Quadrant for Enterprise Conversational AI Platforms.

No matter whether you’ve been working in this field for years or you’ve just jumped on the conversational train, there’s tons to learn from Raj. He talks about how to train NLP models, how to think about conversations, and also where this industry could be headed in his interview with Kane Simms for VUX World.

Would we understand a transcript of their chat without knowing the context?

Words aren’t truth – context is truth

You should listen when one of the world’s experts on conversational AI says you need to think multimodally (using multiple ‘modes’ to communicate a message, such as words, images, haptics etc).

Why does he say this? Well, it’s all about how we naturally communicate. We don’t just use words to express ourselves – the way we modulate our voice to say them (prosody) matters, as well as our facial expressions, as well as our body language, and more. Even silence communicates a message. So, if you focus exclusively on words, then you’re setting yourself up for misunderstandings.

Our conversations are part of an intricate web – our culture, our history, our health, and various other elements create the context of what we’re saying. When we talk with a person or a machine, we rely on that context to infer meaning.

Things often go awry – the person we’re talking to may not know the cultural connotations of what we say for example. Think of the Irish expression “ya eejit”. While the words mean ‘you idiot’, the way they’re said can be meant as a friendly shared joke. While no offence is intended, you could take a lot of offence at being called an idiot if you don’t know Irish culture.

This all matters because it’s a huge mistake to analyse words without their wider context. Emotion alone affects meaning in a big way, even before we consider all the other details.

Raj says, “If you lose the original voice signal, then you lost 50% of your ability to glean anything from it. You got the text, but you don’t know what is the emotion with which somebody said it.”

It’s all about understanding the inferred meaning of what people say, rather than trying to program a system to detect the literal meaning of the words. That’s what Openstream AI are experts at.

Focus micro so your bot becomes useful on a macro scale

All this talk about context might make you think you should create an all-singing and dancing bot that can do everything under the sun. That’s a path to certain failure.

Funnily enough the opposite is true. What you should do is apply a needle-sharp focus to your specific use case. Only then will your bot excel at its goal.

Currently the majority of bots are suited to one set of use cases – responding to general FAQ style questions – but if we want to be able to handle more complex conversations, we need to train very specific language models.

The world will provide the context, you just need to detect it

What Raj alludes to in his interview is a tantalising glimpse of where we’re headed. We’ll have a world of bots that are all specifically tailored to their use case. Depending on the conversation we’re having – and with the ability to infer meaning from context (which Openstream are developing) – we’ll get connected to the best bot for the job.

So, you see it’s not about trying to cover all bases, it’s about covering your base so well that your bot is the first that gets summoned when the time is right. Like the basic premise of a search engine – the top result is the one that should fit the need best.

That’s an incredible future, and it’s one that can only occur if we think more broadly about our understanding of conversations, while also thinking about our bot’s use case with a super-sharp focus. Think multimodally, and you’re on your way to building that future. As Raj says, “there is a 30 to 40% reduction in the processing time for completing any task if you employ multimodal techniques.”

You can hear Raj’s full interview here.

This article was written by Benjamin McCulloch. Ben is a freelance conversation designer and an expert in audio production. He has a decade of experience crafting natural sounding dialogue: recording, editing and directing voice talent in the studio. Some of his work includes dialogue editing for Philips’ ‘Breathless Choir’ series of commercials, a Cannes Pharma Grand-Prix winner; leading teams in localizing voices for Fortune 100 clients like Microsoft, as well as sound design and music composition for video games and film.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Why context is truth in conversational AI

Words aren’t truth – context is truth

Focus micro so your bot becomes useful on a macro scale

The world will provide the context, you just need to detect it

Why you need Botium before you make your bot

Holistic AI services: the future of CX?