DIGITAL MARKETING

Silicon personas, ChatGPT & the next big thing in digital marketing

Biases and preferences baked into ChatGPT can generate synthetic user data and silicon personas for product testing and digital marketing.

31 March 2023

James Tyrrell

@JT_bluebird1

james.tyrrell@hybrid.co

All stories

Synthetic user data: silicon personas generated by feeding ChatGPT with representative backstories could help product designers and digital marketers anticipate customer responses. Image credit: Shutterstock Generate.

Getting your Trinity Audio player ready...

In 2023, it’s crystal clear that large language models (LLMs) are on a trajectory to becoming indispensable enterprise tools. Microsoft is investing billions in its partnership with OpenAI to capitalize on the potential of flagship LMMs such as GPT-4. And the fact that some industry executives are urging for an ‘AI pause’ for six months shines a bright light on the pace of disruption to a huge range of jobs, from managers to models. Adding to that shakeup is an intriguing use of LLMs as silicon personas, which was signposted in a research paper that quietly appeared on the arXiv preprint server in September 2022.

What are silicon personas?

Argyle and her colleagues termed these surrogates, silicon subjects, and these silicon personas – which could equally be created to represent user types – turn out to be incredibly lifelike. In the study, the researchers conditioned GPT-3 with the socio-demographic backstories gathered from 2012, 2016, and 2020 US election survey data, and examined whether they could use their silicon subjects to determine the outcome of the election.

The correlation between the silicon subjects and the actual voter data was incredibly strong, with humans and personas remarkably aligned. And the outcome shows that GPT-3 exhibits sufficiently fine-grained biases and preferences to – in this case – determine which party it would vote for based on the backstory of each of the silicon personas.

From digital synapses to digital marketing

But before we start to put flesh on the bones of the next big thing in digital marketing and product testing, let’s take a quick look under the hood of GPT-3. OpenAI’s breakthrough LLM (which was extended and finessed into the basis for the wildly popular ChatGPT) has 175 billion parameters, which was 10 times more than any LLM that had gone before it. The parameters represent the weights at each node in the multi-layered neural network, and determine which of these digital synapses ‘fire’ when GPT-3 is presented with a text prompt and attempts to guess the next word in the sequence.

In principle, the more parameters, the more capable the AI model. But all of those billions of parameters need to be trained, and that commits developers to using vast amounts of data. Initially, parameters carry random values, but over time, during training (which took months, using a custom-designed supercomputer hosted on Microsoft Azure) they converge to their optimum values.

Customer insight and product testing smarts

To give you an idea, we ran a super quick experiment using OpenAI’s ChatGPT web UI, feeding the advanced chatbot with the following –

Persona 1 = a middle-aged man who likes sports and has a gym membership.

And then asking ChatGPT to recommend a list of products targeting persona 1, which generated the sequence of suggestions below –

Fitness tracker or smartwatch, athletic shoes, resistance bands, foam roller or massage ball, workout apparel, protein supplements, water bottle, and gym bag.

But the fun doesn’t stop there. For example, you can tell ChatGPT that the year is 1950 and ask the advanced chatbot to regenerate the list, which now includes –

Dumbbells or barbells, athletic shoes, boxing gloves, jump rope, stationary bikes, athletic supporter, sweatbands, and weightlifting belt.

Already, we’ve got some useful insight into product line trends today compared with the 1950s. But this is just scratching the surface. You can go far with silicon personas. But you don’t have to explore this product testing and digital marketing landscape alone. Kwame Ferreira, Hugo Alves, and their colleagues started experimenting with AI as a way of synthesizing virtual users to solve one of their biggest pain points – product testing. And not only did the synthetic data work well, it helped elsewhere too. “As we evolved our experiments in AI it became clearer that we were getting a lot of value in the product discovery phase,” comments the team, which has opened the process as a beta version dubbed ‘Synthetic Users’.

If you have accurate backstories for your clients and customers, there’s a good chance that you’ll be able to put LLM-powered silicon personas to work to deliver user testing data or marketing insight that would otherwise have taken hours, days, or even weeks to collate. And, not only could you save time on your product research, you’ll likely save money too.