Thoughts on Google’s LaMDA 2

Kane Simms
May 16, 2022
in Article, Opinion

Thoughts on Google’s LaMDA 2 https://vux.world/wp-content/uploads/A146297A-A215-4BDA-90AC-828C87EACAF7_1_201_a-scaled.jpeg 2560 1437 Kane Simms Kane Simms https://secure.gravatar.com/avatar/26839585565b6484d0560f5e365378f0?s=96&d=blank&r=g May 16, 2022 May 16, 2022

At Google I/O, LaMDA 2 was announced. Google’s second iteration of its large language model, LaMDA.

It stands for Language Models for Dialogue Applications and is intended to make having autonomous conversations with machines possible.

It, apparently, can answer questions and, using the context of the question or answer, generate prompts to continue conversations. It can allegedly take pretty complex queries on complex topics and summarise answers plainly, as well as hold multi turn conversations around a specific topic, like ‘dogs’, the ‘ocean’, and others.

Marketing vs reality

As impressed as I tend to be when I see new releases of language models like this, Open AI’s GPT series or this recent one from AI21, I still can’t help but wonder.

Since reading this piece by Emily M Bender on crushing AI hype, I can’t help but scrutinise these kind of announcements. I’m wondering:

a) the incentives of the companies creating the models (and those reporting on them)

b) the reality of the technology vs how it’s explained

c) the practical real world application potential of the technology

d) how you or your business will benefit from it or be impacted by it

Long term incentives

Google is obviously incentivised to create large general language models like this. Long term, If it can generate language and conversations with high levels of accuracy, then what does that mean for Google search? What does it mean for Google Assistant?

Some of the examples demonstrated at Google I/O have the potential to disrupt the very concept of a website.

The List It feature takes a task prompt, like “I want to grow a veg garden”, and generates a list of things you’ll need to do in order to grow one. It’s complete with detailed guidance and tips, just like you might find on Gardeners World or similar sites.

The long term impact for brands

When Google Assistant answers questions by taking content from the internet, it attributes that answer to the website it pulled it from.

With something like LaMDA 2, the language model is so big that it won’t even know where it got the content from. In this instance, it’ll be a concoction of thousands of gardening websites.

Google is essentially building its own, first party knowledge base, using content from across the internet. This could mean that Google would have more control over Google search and Google Assistant. This in theory means better customer experience and increased usage of Google services.

However, you can potentially begin to wave goodbye to attribution on Google Assistant or, potentially in time, positions on search engine results pages, as more traffic is served by Google directly.

Short term

The short term concern is a lot simpler. Google has to compete to be known as the leading AI company on the planet. It has to build large language models and push forward the AI cause and to remain to be seen as the leading AI company on the planet.
Regardless of whether any of this makes it to production, Google still has to show its impressive progress because, if it doesn’t, Open AI or others will.

LaMDA 2 is a show of muscle and technical prowess, rather than V1 of the future. It’s Google showing you its progress to position itself as a leading AI company.

The reality of the technology

LaMDA 2 certainly looks like it has potential, as do all these large language models when they’re unveiled.

Being able to take a prompt like “tell me about the deep ocean” and receive a sensible response, with follow-up prompts to continue the conversation is impressive. Certainly considering that the prompts are produced by the system on its own, after having digested and summarised related content, and prioritised which prompts would be most useful, given the context of the conversation… All things that are usually manual jobs for conversation designers and architects today.

However, most of these large language models, LaMDA 2 included, are miles away from full scale production. In fact, it may never reach production.

LaMDA, like GPT-3 and others, still has issues, producing irrelevant or inaccurate or offensive or out-of-context information.

And, to be fair to Google, it does openly acknowledge that there is still work to be done to improve its accuracy and it is not perfect. Examples of LaMDA2 shown at I/O, and the test case examples available in the up-and-coming test suite, AI Test Kitchen, are just that: tests.

“We’re at the beginning of our journey to make models like these useful to people” explained Sundar Pichai, CEO, Google.

Google describes these test cases as “examples of what it would be like” to use LaMDA2.

Practical application

The real concern for Google is taking this test bed technology and making it ready for full scale, autonomous production usage. A deployment that doesn’t give wrong answers, incorrect facts or offensive or malicious content.

And long term, it’s very difficult to squash issues in self-learning systems or language modals as large as this.

If it’s making up answers on the spot using a next-word-prediction algorithm, based on the configuration of 178 billion parameters, after having digested all the words across the entire internet, books and more, then how on earth will you even be able to monitor what it gets wrong in the first place, in practice and at scale? Which one, few, bunch, stadia or galaxies of parameters are the ones that need work?

It’s very hard.

I’m not saying it’s not impressive. It certainly looks impressive in promo videos and in controlled demo environments with reliable test use cases. Taking that and rolling it into Google search or Google Assistant, though, is a whole other ball game entirely.

How it affects you and your business

The challenge for most businesses reading this is that these large language models aren’t for you. They’re not going to help your customers with your use cases and your knowledge. Maybe never.

These are incredibly large generalistic language models that have been trained on general knowledge, opinions and facts. None of this knowledge (or very little of it) will be relevant for the kind of questions the customers have for your business. It’s a Google thing, for Google customers, and that’s pretty much it, for now.

Whether LaMDA 2, 3, 4, or 28, will be the LaMDA we see in Google’s production products, who knows. I suspect we’ll get there eventually, but slowly.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
resolution	session	This is a functionality cookie used to collect the horizontal value of the visitor screen resolution. It helps in optimizing the website view to the user.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_111445333_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
ajs_anonymous_id	never	This cookie is set by Segment.io to check the number of ew and returning visitors to the website.
CONSENT	16 years 2 months 25 days 18 hours	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__smSessionId	9 hours	No description available.
__smToken	1 year	This cookie is set by the Sumo. This cookie is used for verifying whether the user is logged in or not.
__smVID	1 month	This cookie is set by Sumo. The purpose of the cookie is not yet known.
_mailmunch_visitor_id	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
AnalyticsSyncHistory	1 month	No description
attribution_user_id	1 year	This cookie is set by the provider Typeform. This cookie is used for Typeform usage statistics. It is used in context with the website's pop-up questionnaires and messengering.
cookielawinfo-checkbox-functional	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
debug	never	No description available.
intercom-id-or0x2acp	8 months 26 days 1 hour	No description
intercom-session-or0x2acp	7 days	No description
li_gc	2 years	No description
li_sugr	3 months	No description available.
mailmunch_second_pageview	never	This cookie is set by MailMunch which is email collection and email marketing platform. We do not know the exact purpose of the cookie.
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.