Counteracting the risk of LLMs

Kane Simms
April 5, 2024
in Article, Opinion, Uncategorized

Counteracting the risk of LLMs https://vux.world/wp-content/uploads/DALL·E-2024-04-05-16.14.02-An-editorial-style-digital-image-representing-the-concept-of-Counteracting-the-risk-of-Large-Language-Models-LLMs.-The-image-features-a-central-vi.webp 1792 1024 Kane Simms Kane Simms https://secure.gravatar.com/avatar/26839585565b6484d0560f5e365378f0?s=96&d=blank&r=g April 5, 2024 April 5, 2024

As of now, using LLMs in high-stakes enterprise customer interactions could be considered a risk. Although there are more robust ways of mitigating the hallucination issue emerging, it’s never going to be a 100% solved problem. If LLMs have direct access to private data, without an intermediary platform in between the model and the user, then this is a significant security risk.

So how do companies get the benefits of generative AI, without compromising customer experience and brand reputation?

Let’s consider the safer use cases for LLMs that are available to you right now.

We’ll counteract low-risk with some risky use cases, and tell you why those are best avoided with the currently available technology.

These insights into LLMs were shared by Amelia’s Brandon Nott, CPO and Nick Orlando, Director of Product Marketing, during a recent VUX World podcast.

AI + human = less risky

One of the best ways to mitigate risks with LLMs is to limit the data they can access, and which use cases they’re used for. For example, if the LLM has no access to PII, then you don’t have to worry about that data being mistakenly used to train a model or exposed to other users.

Another place you can use LLMs safely is information summarisation – so long as that information is safe to share! We’re talking about summarising long documents for sharing, or having calls summarised for a live agent so they have a good overview of what a customer has previously spoken about. These are safe because the information is only available internally within your organisation, and it will be filtered by a human, who should be able to spot inaccuracies.

As Brandon says, “anytime you’re using generative AI, which is then going to a human to make a judgement, you inherently have a safety net.”

Another possibility is to have an internal RAG model for employees. Imagine a chatbot that answers employee questions about holiday entitlement and other aspects of their job. You should have the information clearly defined on an internal document. So long as the company’s materials are in order, and there isn’t lots of conflicting information, then someone asking how many holidays have I used up this year should lead to an accurate response. It shouldn’t matter how complicated the company is. Even an international company should be able to have an employee-facing assistant that can accurately answer HR questions, depending on which employee it speaks to, and their local labour laws.

Going even further, it’s possible to generate code fairly safely. AI copilots can speed up the coding process. Again, the most important part of the process is to have a human check the results! So long as you’re doing that, errors would be caught before they go to a live product.

Hallucinations and prompt injections

Let’s consider the other end of the scale. What are the riskiest uses of generative AI?

According to Nick, “really the two biggest risks with generative AI are hallucinations and prompt injection attacks.”

Hallucinations are when the LLM fabricates its response, which appears to have little or no relationship with the facts. If a brand will be held accountable for the things said by AI, then hallucinations are risky. Imagine an AI assistant that invented regulations for insurance claims, for example? That’s a level of risk that no company wants.

With advancements in RAG capabilities, using guardrail-based prompt chaining and having a platform like Amelia between your user and your model, you’re able to mitigate the risk of hallucination in ways that weren’t possible even 12 months ago.

That said, the use case severity will still dictate when to use generative vs deterministic technology.

We don’t want to ignore the risks. If we get it wrong, Nick suggested, “it’s going to make people lose trust in these systems. It’s going to make people more afraid to use them, and it’s going to make people say we need to pump the brakes.”

With a prompt injection, someone attempts to trick the LLM into revealing something it wasn’t supposed to. Of course you don’t want to share a customer’s private information with a third party. It’s an ethical and legal disaster, and the bad headlines would likely get plastered all over the internet too.

This is why you need to be careful about the kinds of data the LLM can access. If you give it private data, for example healthcare or banking information, then there’s a risk someone will be able to access that data.

It’s on the practitioners to help people use LLMs to their maximum benefit, and help them navigate around the issues. We know risks exist. We have to be ever-mindful of them, so that we can use this technology well.

What does ‘good’ look like?

Amelia has considered what is the best process to use at each stage. It’s not about focusing on one technology for everything (for example using an LLM rather than an NLU). It’s about using things if they’re the best choice for that particular job, rather than aiming for ‘one size fits all’.

As Brandon says, they always ask themselves questions before they start. “You’re looking at your use cases, right? Do I need this to go the same way every single time? Do I want to add flavour or colour to a conversation? Do I want to generate code which will be reviewed by a human? Do I want to provide an answer on the fly? And as you look at these use cases, it becomes apparent pretty quickly which methods you want to use.”

There’s no on/off button for risk. It’s a matter of reviewing the use cases, how you’re solving them, and considering the best ways to mitigate the risks within.

Thanks to Nick, Brandon and Amelia for these insights! You can watch the full interview on Linkedin, YouTube, Spotify, Apple Podcasts or wherever you get your podcasts..

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Counteracting the risk of LLMs

AI + human = less risky

Hallucinations and prompt injections

What does ‘good’ look like?

7 practical tips to implement Microsoft 365 Copilot for your business

Balancing Tech Advancements with a Human Touch in Customer Service