Why NPS and CSAT don’t work for measuring your AI efforts

Kane Simms
September 23, 2024
in Articles

Why NPS and CSAT don’t work for measuring your AI efforts https://vux.world/wp-content/uploads/thumbs-up-thumbs-down.png 1920 1080 Kane Simms Kane Simms https://secure.gravatar.com/avatar/94740996e14357fbf39db8e8aac6e9498fd1bbecb6a064baa816eb8a2ed577e4?s=96&d=blank&r=g September 23, 2024 September 24, 2024

When evaluating AI initiatives, I see many organisations use NPS and CSAT as barometers of success. But there’s a problem with both of those metrics that make them ineffective at measuring the success of interactions with AI services.

This has been a top of mind topic lately. I’ve written about how containment is the wrong metric a long while ago. Now, it’s time for NPS and CSAT to take their turn. We covered this in detail in the conversation I had with Pypestream CEO, Richard Smullen on the VUX World podcast, and it’s a conversation I’ve been having with clients for almost 2 years. The eagle eyed among you might be thinking “Hold on a minute, Kane, you recently posted about how the exact same way you measure value today is the exact same way to measure the value of AI tomorrow. I measure value with NPS and CSAT, so what gives?”

As you’ll see in this post, there’s nothing inherently wrong with NPS and CSAT, depending on what you’re trying to measure. There are three levels that you want to measure success within your business and with AI:

Business level metrics like cost, revenue, loyalty and such.
Journey level metrics like satisfaction and goal completion,
Interaction level metrics like experience, adoption and usefulness.

What’s wrong with NPS?

NPS (Net Promoter Score) is a long-term business-level metric that measures how likely a customer is to recommend your brand to a friend. For decades, this has been a keen indicator of happy customers and loyalty.

However, you can’t use NPS to judge the success of an interaction with AI. Because it’s a long term metric, it takes into account everything the customer knows and feels about your brand. It’s not intended to measure the interaction the customer has just had, but in general terms, how loyal they are likely to be. One interaction isn’t enough to sway customer loyalty in many cases.

Let’s say you’re a bank. You have a chatbot. Your customer uses it and then you ask them an NPS question. That doesn’t tell you how effective your chatbot is. It tells you how loyal that customer is.

For example, take Mary. She’s banked with The Bank of Kane for the last 15 years. She trusts the Bank of Kane with her life savings and mortgage. Yet, she has a poor experience with the Bank of Kane’s chatbot (I know, not likely 😉 but play along). When Mary is asked the NPS question, she’s bringing all of her trust and baggage with her. It’s likely she’ll still give a high NPS because she trusts the bank with her life savings!

Now, you might rephrase the question and ask ‘based on your experience of this chatbot, how likely are you to recommend The Bank of Kane to a friend?’ which I’ve seen before. But that’s conflating two types of questions: a short-term question asking about the experience itself and a long-term measure of loyalty. It doesn’t sufficiently differentiate between how you feel about the brand overall vs how you feel about the experience you’ve just had. It’s like a travel agent trying to determine long-term loyalty by asking a question about the flight. Some freak turbulence might have made the flight a nightmare, but that won’t affect the overall holiday.

The Einstein quote; “If I had an hour to solve a problem and my life depended on it, I would use the first 55 minutes determining the proper question to ask, for once I know the proper question, I could solve the problem in less than five minutes“

When trying to quantify the effectiveness of an interaction, NPS is the wrong question.

What about CSAT?

Let’s take CSAT instead, surely that’s a better metric? Well, no, not really. Not if you’re trying to measure the interaction itself.

CSAT (Customer Satisfaction) is a short-term metric that measures how satisfied a customer is with a service in the moment. Generally speaking, it’s a good measure and will tell you whether there are any service issues you need to deal with. But it won’t always tell you about the effectiveness of the specific solution a customer has just interacted with.

For example, Jim is also a customer of The Bank of Kane, and Jim wants to take out a new credit card. So he has a chat with the Kane AI Agent that tells him that his credit rating isn’t high enough and that there isn’t a credit card sufficient for him. Then, he’s asked a CSAT question: how satisfied are you with this service?

What do you think he’ll say? Unsatisfied, of course. Why? Because he didn’t get the outcome he was hoping for.

Notice Jim’s dissatisfaction has nothing to do with the experience of interacting with the AI agent. It’s got everything to do with the fact that he was told something that he didn’t like and he hasn’t got the result he wanted. Even though the Kane AI Agent did everything absolutely right and was able to tell Jim whether he’s eligible in 30 seconds.

So what’s the solution?

The solution: effort

Customer Effort Score asks how effortful the interaction you’ve just had was. It’s got nothing to do with how you feel overall about the brand and so it cuts through the baggage and bias you have. It’s got nothing to do with how you feel about the outcome and result you’ve just received, so it doesn’t penalise a perfectly working service. It simply appraises the interaction you’ve just had and whether or not it was easy or difficult to use.

That’s the score that matters when trying to measure the effectiveness of the interaction with your AI solution. Why? Because why else do you want to use AI? Surely because it’s easier, more efficient and more effective than other channels, modalities or technologies? How do you measure that? Effort.

We have a formula for measuring effort which takes into account turn count, escalations, abandons, fallbacks, disambiguations and goal completion that I’d happily share more about for those interested.

Is it perfect? No. Is it good enough? Absolutely.

So, use NPS and CSAT to measure loyalty and satisfaction on the whole, but not to measure the effectiveness of the interaction itself. Use effort for that.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Why NPS and CSAT don’t work for measuring your AI efforts

What’s wrong with NPS?

What about CSAT?

The solution: effort

Exploring the Evolution of AI in Search and Contact Centers: Insights from Google’s Vlad Vuskovic

Scaling your AI solution for long term success

Content

Training

Consulting

Connect