4 tips for effective use of LLMs in conversational AI

Kane Simms
December 7, 2023
in Article

4 tips for effective use of LLMs in conversational AI https://vux.world/wp-content/uploads/Articles-Presentation.png 1920 1080 Kane Simms Kane Simms https://secure.gravatar.com/avatar/26839585565b6484d0560f5e365378f0?s=96&d=blank&r=g December 7, 2023 December 8, 2023

In a recent episode of VUX World, I had the opportunity to discuss current trends in enterprise adoption of generative AI and large language models, with Matt Taylor, Chief Product Officer and Co-Founder of Knowbl. Matt shared some perspectives on how businesses are adopting LLMs, the challenges and limitations of Retrieval Augmented Generation (RAG) and some of the ways Knowbl is approaching reliable AI service delivery.

Focusing on Internal Use Cases: Minimising Risks

A significant trend Matt highlighted, which I’ve also observed from the marketing and positioning of vendors in the CCaaS space, is the current focus on internal, staff-facing use cases for generative AI.

Most enterprises that are exploring generative AI aren’t putting it in front of their customers. They’re using it for things like internal knowledge search, or contact centre agent-facing use cases, such as call summarisation.

This approach is taken to reduce the risk of generating inaccurate or irrelevant responses to customer queries. In theory, these companies are working things out in a risk-free environment until such time when they have more ‘control’ over the AI models they’re using. Presumably, once they get comfortable with this ‘control’, they’ll consider releasing something publicly.

This is a fair enough stance, with one exception: generative AI models will always have the risk of hallucination, and you can never fully ‘control’ them.

The quest for control: removing hallucinations

This is one of the primary trends of 2023; the efforts to put in place guardrails on AI models so that you can remove or mitigate the hallucination problem. Everybody is trying, and a few claim to have solved it sufficiently enough, but as with all of this stuff, the proof is in the pudding.

The challenges with RAG and semantic embeddings

Two trends in attempting to remove hallucinations are Retrieval-Augmented Generation (RAG) and semantic embeddings.

RAG is used to ground LLMs in external, company-specific data. However, Matt pointed out a critical limitation: the inability to guarantee output consistency. This alone doesn’t solve the hallucination problem. For enterprises, this unpredictability is a deal-breaker.

Many vendors layer on top of RAG a whole bunch of additional prompts to check for things like profanity, accuracy, and to turn retrieved data into sufficiently conversational responses. However, all of this prompt chaining behind the scenes isn’t strictly guaranteed to fix the hallucination problem, and businesses can’t vet the responses for accuracy at scale.

This is enough to have the majority of enterprises shy away from customer-facing generative AI implementations.

Knowbl explored semantic embeddings, which use vector representations to match the user’s query with relevant content. While effective in search applications, Matt explained that this method lacked conversational elements like contextual follow-ups, which are essential for a natural interaction.

Mitigating risk and leveraging LLMs for enterprise use cases

To manage some of these limitations, Matt explained Knowbl’s approach to using large language models for enterprise AI applications.

1. Contextual Awareness: Patented technology apparently enables the AI to comprehend previous queries, enabling a more coherent and context-rich conversation.

This isn’t the same as many are trying today. Most people trying to use LLMs to manage context are purely just keeping the conversation transcript and feeding that back through the model at each turn of the conversation, with each prompt. This winds up creating ever-lengthening prompts and actually increases the risk of hallucinations and errors the longer the conversation becomes.

2. Entity Extraction: Traditionally a challenging aspect that, without LLMs, can require bucket loads of training data. Even with LLMs out of the box, I’ve personally experienced mixed results with the raw models. Knowbl leverages LLMs to streamline this process, which Matt claims enhances the accuracy and efficiency of extracting relevant information from conversations.

3. Expedited Workflow Development: By utilising transcript inference, Knowbl accelerates the process of building out complex workflows. This uses transcripts from real conversations with agents, summarises the most commonly traversed conversation pathways, and builds workflows automatically from this.

There are other vendors I’ve seen which have the ability to describe a use case, then have the platform generate the conversation flows. Whether they’re all based on real transcripts though, I’m not so sure.

4. Content Transformation: Most businesses content and knowledge isn’t in great shape. So most folks that attempt to use RAG simply feed the machine with whatever data a brand has. This is not only usually flooded with inaccuracies and out of date content, but it’s certainly not written in a conversational nature.

You might think that this is where LLMs would shine, however, even if the data is in good shape in the first place, you’ve still got the risk of hallucination and an unpredictable user experience.

Matt told me about how Knowbl first spends time to make sure content and knowledge is clean and up to date, then uses summarisation and rephrasing techniques to transform content into a more conversational nature. Then, there’s a human in the loop checking the content and approving it before it’s fed into the knowledge base. This is then the content that is used verbatim in agent responses.

So rather than using an LLM to rephrase and summarise content after retrieving it from a knowledge source, Knowbl use the technology to clean up the data and get it ready for consumption before the content is put into the knowledge base in the first place. That’s pretty novel.

Diving deeper

There were a host of other techniques we discussed during the podcast which businesses and developers can use to increase the effectiveness of their AI agents and to leverage LLMs and generative AI for its strengths.

Check out the full episode on Apple Podcasts, Spotify, YouTube or LinkedIn.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.