The REAL reasons why voice will be HUGE for brands

The REAL reasons why voice will be HUGE for brands 1800 1200 Kane Simms

Let’s take a close look at the real reasons why voice will be huge for brands. 

There are plenty of statistical reasons why voice assistants and voice-first technology will be huge for your brand, but those aren’t really reasons. Those numbers don’t tell the story of the cause. They just predict the effect of increased user adoption and usage.

But why is user adoption of smart speakers and voice-first devices rising? Why are more people speaking to their phones, speakers and household appliances? And what impact will that have on your brand?

Through understanding why voice is growing, you’ll understand why it’s inevitable that voice will be huge for your brand.

So that’s what we’re going to do.

But first, let’s get those stats out of the way

The stats everyone likes to quote are:

Google voice search quote - 20% of mobile searches are voice searches

And, apparently, there were over 30 million smart speakers sold in 2017 and smart speaker adoption is outpacing smartphone adoption.

So, that’s the numbers, now so what?

Those numbers should certainly grab your attention, but they don’t explain what’s actually going on here.

What’s actually going on here is that we’re heading through a time that will fundamentally shift the way we interact with technology.

I’ll say that again.

Voice-first devices will fundamentally shift the way we interact with technology.

That means retraining our expectations. It means creating a whole new set of mental models. It means completely new use cases.

With all of that comes new behaviour. New experiences. New psychology.

Within that are the reasons why voice will be huge. It’s the reasons you can’t quite quantify. The human, behavioural, psychological and emotional reasons.

And they include:

Zero ‘platform friction’

The first of those reasons why voice will be huge for your brand is because of an absence of platform friction on voice-first devices.

What do I mean by platform friction?

With any other form of UX, you can only remove so much friction before you reach the native friction that’s built into the medium you’re designing for. Phones, laptops, desktops, tablets, apps, web browsers; all have inherent friction that’s beyond the control of a designer.

Take a website for example. You might have the best, most user friendly, friction-free website on the planet. Yet, to get there, your user still has to:

  • Take their phone out of their pocket
  • Unlock it
  • Open a web browser
  • Type in a search term
  • Mentally process the results
  • Click on a result (hopefully your website!)

Then they can begin their ‘friction-free’ website experience.

That’s friction that you can do jack shit about. It’s built into the mobile browsing experience and you don’t have an ounce of control or input into it.

With voice, the platform itself is friction-free. The platform has already removed all those barriers and blocakges and opportunities for failure for you (apart from having to download Skills on Alexa).

Mostly, you can just focus on executing a use case.

Voice lends itself to totally and utterly friction-free experiences unlike any other platform that’s ever came before it.

And when you remove friction, you increase speed.

Inherent speed

Voice platforms are far quicker than any other technology platform available right now.

The speed evolution

To impart information or to access information, we’ve gone from pigeons, to post, to phone to the internet. From there, we’ve gone from hyperlinking our way through the web, to email to search, to messaging and, now, to voice.

With each evolutionary step, we get quicker and our expectations get higher.

Try this:

Collar a colleague or friend, put your phones flat on the table and set a timer.

Then, have someone shout a location, any location, and race your friend to see who can find directions first.

The catch?

You can only use your voice and nothing but your voice. Your friend can only use their hands and nothing but their hands.

Who’ll win?

Short of the location being something totally obscure, I bet the person using only their voice will win 99% of the time.

The speed at which you can execute a command or complete a task with your voice far outpaces the speed at which you can do the same with your hands.

And because it’s quicker, it saves people time.


Gary Vee often claims that companies and technologies that succeed are often the ones that save people time.

Think of eBay

Before eBay, you had to gather up a load of belongings, pack your car, drive to a field, unpack your car and stand there at a car boot sale waiting for people to come up to you and negotiate a sale for your unwanted stuff.

The planning involved in running a car boot sale stall is unreal.

Now, with eBay, you can take a photo and have it available to the world to bid on within 5 minutes.

Take the finding a location example above. Even though you might only save 40 seconds on finding a location, just think of how many other routine things you need to find or do on an average day:

  • Find an email
  • Send an email
  • Find a file
  • Check the weather
  • Phone someone
  • Text someone
  • Send a calendar invite
  • Tweet something
  • Set a reminder
  • Cancel a reminder
  • Create a task list
  • Update a task list
  • Find out about a fact or google something

That list is the most basic of basic tasks that most of us do every day. Some of them we’ll do tens of times.

Just imagine if you halved the time it takes to do each of those tasks, then spread that time across a day, week, month, year. You’d end up having an extra hour or so a day. And that’s conservative.

Imagine if, wherever you are right now, you could freeze the clock and have a completely free hour. An hour to do whatever you want. Then, after the hour, you’ll resume from where you are now.

Now, in reality, you aren’t going to have a full hour at the end of your day to spend as you wish. You’ll just be able to act on things and do things quicker, which leads to…

Increased productivity

Brain Romelle pointed out, on the Rene Ritchie Vector podcast, that technology is a tool that allows humans to be more productive.

In the same way we created tools to help us chop wood to build fires, we have tools that help us accomplish tasks today. And that’s what voice does.

With voice, as a tool, with its speed and friction-free nativity, it means that we can get more done in shorter spaces of time, making us more productive.

You can find out about the weather while you’re washing up. You can set a timer while you’re putting something in the oven.

Those are basic, crude examples, but imagine scaling that into the workplace. Imagine being able to book a meeting room, send an email or perform any other routine, time-consuming task, all while doing something else or at less than half the time it takes you currently.

Voice lets people get more done.

Fundamentally social

Smartphones have been slowly desocialising us for years. Some phone use is borderline rude.

Looking down at a screen while someone is talking. Googling something at dinner. It all breaks up the conversation.

Voice-first devices and experiences let people lift their head up again. It lets us keep eye contact. It lets us remain in the flow of the interaction and not have to break our concentration or conversation.

A social timely timer

We met some friends for dinner. They’ve just had a baby and had to feed her before our meal arrived. Instead of having to pick out his phone and withdraw from the group while he set a timer to warm the milk bottle, my friend took out his phone, asked Google to set the timer and placed his phone on the table.

He was able to sort the timer and milk-warming situation without having to break concentration or even lose eye contact.

Seeing someone else do that as if it was nothing was a revelation. Voice-first experiences let us carry on doing what humans are hard-wired to do… Socialise.

Plus, 50% of the time, Amazon Echo’s are used by more than one person. It’s inherently more social.

Being able to continue being engaged with other people while still searching for information or performing tasks is a huge reason why the usage of voice first platforms will continue to rise. And another reason why voice will be huge for your brand.

Infinitely more accessible

The final thing that voice-first platforms have that other technologies don’t is the potential for far more users to use it. The potential reach is gargantuan.

The internet is great, but the devices that we use to access it are shrowded in inequality. Smartphones, laptops, keyboards and mice all have accessibility problems that prevent millions of people from using them.

Now, Apple does have some great accessibility features and I’m sure other platforms do as well, but compared to the most natural interface ever, it’s nothing.

Voice is indiscriminate. Yes, you might have problems using a purely audio based platform if you’re def, but aside from that, just about every person on the planet can use a voice platform.

Plus, with the addition of screens on devices like the Echo Show and cameras that are learning sign language on platforms like Mycroft, the potential reach for this tech is literally every body with an internet connection.

More user attention = more brand attention

All of the above are reasons why voice will take-off and be adopted by more and more people. More of our time and attention will be spent on voice interactions.

And wherever people’s attention is, that’s where brands need to be.

If you’re even remotely interested in new technology, if you have an ounce of care for user behaviour or if you’re looking to get an early foot in the door, then I’d implore you to have a dabble in getting to grips with the voice ecosystem.

    Share via
    Copy link
    Powered by Social Snap