AI

Experiment finds AI boosts creativity individually — but lowers it collectively

Comment

ai assisted translation
Image Credits: Bryce Durbin / TechCrunch

A new study examines whether AI could be an automated helpmeet in creative tasks, with mixed results: it appeared to help less naturally creative people write more original stories — but dampened the creativity of the group as a whole. It’s a trade-off that may be increasingly common as AI tools impinge on creative endeavors.

The study is from researchers Anil Doshi and Oliver Hauser at University College London and University of Exeter respectively, published in Science Advances. And while it’s necessarily limited due to its focus on short stories, it seems to confirm the feeling many have expressed: that AI can be helpful but ultimately offers nothing truly new in creative endeavors.

“Our study represents an early view on a very big question on how large language models and generative AI more generally will affect human activities, including creativity,” Hauser told TechCrunch in an email. “While there is huge potential (and, no doubt, huge hype) for this technology to have big impacts in media and creativity more generally, it will be important that AI is actually being evaluated rigorously — rather than just implemented widely, under the assumption that it will have positive outcomes.”

The experiment had hundreds of people write very short stories (8 sentences or so), on any topic but suitable for a broad audience. One group just wrote; a second group was given the opportunity to consult GPT-4 for a single story idea with a few sentences (they could use as much or as little as they liked); a third could get up to five such story starters.

Image Credits: Hauser, Joshi

Once the stories were written, they were evaluated by both their own writers and a second group that knew nothing about the generative AI twist. These people rated the stories on novelty, usefulness (i.e. likelihood of publishing), and emotional enjoyment.

Low creativity, high benefit…High creativity, no benefit

Prior to writing the stories, the participants also completed a word-production task that acts as a proxy for creativity. It’s a concept that can’t be directly measured, but in this case one’s creativity in writing can at least be approximated (without judgment! Not everyone is a born or practiced writer).

“Capturing something so rich and complex as creativity with any measure seems fraught with complications,” wrote Hauser. “There is, however, a rich set of research around human creativity and there is a live debate about how best to capture the idea of creativity in a measure.”

They said their approach was widely used in academia and well documented in other studies.

What the researchers found was that people with lower creativity metrics scored lowest on evaluations of their stories, which arguably validates the approach. They also saw the largest gains when given the opportunity to use a generated story idea (which, it’s worth noting, the vast majority across the experiment did).

Stories by people with a low creativity score who just wrote were reliably rated lower than others on writing quality, enjoyability, and novelty. Given one AI-generated idea, they scored higher on every metric. Given the choice of five, they scored even higher.

It really appears that for folks struggling with the creative side of writing (at least within this context and definition), the AI helper is genuinely improving the quality of their work. This probably resonates with many to whom writing does not come naturally, and a language model saying “hey, try this” is the prompt they need to finish a paragraph or start a new chapter.

Image Credits: Hauser, Joshi

But what about the people who scored highly on the creativity metric? Did their writing climb to new heights? Sadly, no. In fact, those participants saw little to no benefit at all, or even (though it’s very close and arguably not significant) worse ratings. It seems that those on the creative side produced their best work when they had no AI help at all.

One can imagine any number of reasons why this might be the case, but the numbers do suggest that, in this situation, AI had a zero to negative effect on writers with innate creativity.

Flattened

But that’s not the part that the researchers were worried about.

Beyond the subjective evaluation of stories by participants, the researchers conducted some analyses of their own. They used OpenAI’s embeddings API to rate how similar each story was to the other stories in its category (i.e. human-only, one AI option, or five AI options).

They found that access to generative AI caused the resulting stories to be closer to the average for their category. In other words, they were more similar and less varied as a group. The total difference was in the 9-10% range, so it’s not like the stories were all clones of one another. And who knows but this similarity might be an artifact of less practiced writers finishing a suggested story versus more creative writers coming up with one from scratch.

The finding was nevertheless enough to warrant a cautionary note in the conclusions, which I could not condense and so quote in full:

While these results point to an increase in individual creativity, there is risk of losing collective novelty. In general equilibrium, an interesting question is whether the stories enhanced and inspired by AI will be able to create sufficient variation in the outputs they lead to. Specifically, if the publishing (and self-publishing) industry were to embrace more generative AI-inspired stories, our findings suggest that the produced stories would become less unique in aggregate and more similar to each other. This downward spiral shows parallels to an emerging social dilemma: If individual writers find out that their generative AI-inspired writing is evaluated as more creative, they have an incentive to use generative AI more in the future, but by doing so, the collective novelty of stories may be reduced further. In short, our results suggest that despite the enhancement effect that generative AI had on individual creativity, there may be a cautionary note if generative AI were adopted more widely for creative tasks.

It echoes the fear in visual art and in web content that if the AI leads to more AI, and what it trains on is just more of itself, it could end up in a self-perpetuating cycle of blandness. As generative AI begins to creep into every medium, it is studies like these that act as counterweights to claims of unbounded creativity or new eras of AI-generated films and songs.

Hauser and Doshi acknowledge that their work is just the beginning — the field is brand new, and every study, including their own, is limited.

“There are a number of paths that we expect future research to pick up on. For instance, implementation of generative AI ‘in the wild’ will look very different than our controlled setting,” Hauser wrote. “Ideally, our study helps guide both the technology and how we interact with it to ensure continued diversity of creative ideas, whether it is in writing, or art, or music.”

More TechCrunch

A new study examines whether AI could be an automated helpmeet in creative tasks, with mixed results: it appeared to help less naturally creative people write more original stories —…

Experiment finds AI boosts creativity individually — but lowers it collectively
Image Credits: Bryce Durbin / TechCrunch

Featured Article

HeadSpin, whose founder is in prison for fraud, sold to PE firm for ‘cents on the dollar,’ sources say

In total, HeadSpin raised $117 million since its 2015 inception and was last valued at $1.1 billion in 2020.

HeadSpin, whose founder is in prison for fraud, sold to PE firm for ‘cents on the dollar,’ sources say

A bipartisan group of senators has introduced a new bill that seeks to protect artists, songwriters, and journalists from having their content used to train AI models or generate AI…

New Senate bill seeks to protect artists’ and journalists’ content from AI use

When Keith Rabois announced he was leaving Founders Fund to return to Khosla Ventures in January, it came as a shock to many in the venture capital ecosystem — and…

From Ethan Choi to Spencer Peterson, venture capitalists continue to play musical chairs

Archer Aviation and Southwest Airlines are teaming up to figure out what it will take to build out a network of electric air taxis at California airports. Southwest’s customer data…

Archer’s vision of an air taxi network could benefit from Southwest customer data

If you visited the Wikipedia website on mobile this week, you might have seen a pop-up indicating that dark mode is ready for prime time.

Wikipedia’s mobile website finally gets a dark mode — here’s how to turn it on

Featured Article

What the AT&T phone records data breach means for you

The giant U.S. telco lost the information of around 110 million customers. Here’s what you need to know.

What the AT&T phone records data breach means for you

The error brings to a close SpaceX’s incredible streak of 335 flawless launches across the company’s Falcon family of rockets, which also includes the more powerful Falcon Heavy.

SpaceX Falcon 9 suffers rare failure on orbit during Starlink deployment

The AI chatbot has been trained on Amazon’s product catalog, customer reviews, community Q&As, and other public information found around the web.

Amazon AI chatbot Rufus is now live for all US customers

If X continues to violate Europe’s data protection rules, the company is on the hook for fines of up to €4,000 per day.

More bad news for Elon Musk after X user’s legal challenge to shadowban prevails

HERO Software has closed a €40 million Series B financing round, and plans to expand across Europe. 

A startup set out to fight climate change — it did it by helping plumbers

Fusion power may still be a few years away, but one startup is laying the groundwork for what it hopes will become a bustling sector of the economy.

Fusion pioneer Commonwealth Fusion Systems is selling core magnet tech to the University of Wisconsin

For months, rumors persisted that Google, and perhaps others, were interested in buying HubSpot, a Boston-based CRM and marketing software company. HubSpot’s market cap ballooned as the rumors persisted, eventually…

Boston VCs are pleased that HubSpot will remain an independent company

ByteDance’s video editing app CapCut will stop offering free cloud storage to host creative assets starting August 5. In the past few days, users have received notifications about CapCut changing…

CapCut will stop offering free cloud storage from August 5

The platform formerly known as Twitter has earned the dubious honor of being the first very large online platform (VLOP) to face a preliminary finding of breaching the European Union’s…

Europe confirms first clutch of DSA grievances on Elon Musk’s X

Featured Article

AT&T says criminals stole phone records of ‘nearly all’ customers in new data breach

The stolen data includes 110 million AT&T customer phone numbers, calling and text records, and some location-related data.

AT&T says criminals stole phone records of ‘nearly all’ customers in new data breach

The full and final text of the EU AI Act, the European Union’s landmark risk-based regulation for applications of artificial intelligence, has been published in the bloc’s Official Journal. In…

EU’s AI Act gets published in bloc’s Official Journal, starting clock on legal deadlines

A Castro Valley resident was charged Thursday for allegedly slashing the tires of 17 Waymo robotaxis in San Francisco between June 24 and June 26, according to the city’s district…

Waymo cameras capture footage of person charged in alleged robotaxi tire slashings

Featured Article

SoftBank acquires UK AI chipmaker Graphcore

While the figure of $500 million has been bandied around in various reports for months, in a press briefing early Thursday morning, Graphcore co-founder and CEO Nigel Toon remained coy on the details.

SoftBank acquires UK AI chipmaker Graphcore

Elon Musk’s X, formerly Twitter, is continuing to develop a downvoting feature that will be used to improve how replies are ranked. Although the company has not yet officially announced…

X is building a ‘dislike’ button for downvoting replies

Featured Article

Data breach exposes millions of mSpy spyware customers

A huge batch of mSpy customer service emails dating back to 2014 were stolen in a May data breach.

Data breach exposes millions of mSpy spyware customers

Kudos founder says her company makes a disposable diaper lined with 100% cotton, unlike the major competitors.

Shark Tank-backed Kudos raises another $3M for healthier, cotton-based disposable diapers

Astra CEO Chris Kemp is already pulling out of a parking spot when he warns the person in the passenger seat that he doesn’t have a valid driver’s license. “And…

‘Wild Wild Space’ doc captures the risks and rivalries of the new space race

Although these companies’ claims are artfully couched, it’s clear that they want to express that the model sees in some sense of the word.

‘Visual’ AI models might not see anything at all

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Did you…

Lucid revs up sales, Fisker makes a deal and Uber reignites an old fight

Retro CEO Nathan Sharp isn’t worrying just yet about Google’s plan to copy his app’s experience, despite the numerous similarities.

Photo-sharing startup Retro spots Google Photos copying its idea and design

Tesla had internally planned to build the dedicated robotaxi and the $25,000 car, often referred to as the Model 2, on the same platform.

Tesla reportedly delays ‘robotaxi’ event to October

Here’s a look at what’s going to change with Siri, and what the introduction of Apple Intelligence will allow you to do with the digital assistant. 

How Apple Intelligence is changing the way you use Siri on your iPhone 

The new YouTube features include those that will automatically transform longer videos into Shorts, among others.

YouTube tempts creators with a half dozen new features for Shorts

The capital will be used to expand in Europe, the U.S. and Asia.

Exein raised $15M Series B to stop robotic arms going haywire