AI

Data workers detail exploitation by tech industry in DAIR report

Comment

Image Credits: DAIR/TU Berlin/Data Workers Inquiry

The essential labor of data work, like moderation and annotation, is systematically hidden from those who benefit from the fruits of that labor. A new project puts the lived experiences of data workers around the world in the spotlight, showing firsthand the costs and opportunities of tech work abroad.

Many tedious, thankless, or psychologically damaging tasks have been outsourced to poorer countries, where workers are happy to take on jobs for a fraction of an American or European wage. This labor market joins other jobs of the “dull, dirty, or dangerous” category like electronics “recycling” and shipbreaking. The conditions in moderation or annotation work aren’t as likely to cost you an arm or give you cancer, but that doesn’t make them safe, much less pleasant or rewarding.

The Data Workers’ Inquiry, a collaboration between AI ethics research group DAIR and TU Berlin, are nominally modeled on Marx’s work from the late 19th century identifying labor conditions in reports that are “collectively produced and politically actionable.”

All the reports are freely available, and were launched today at an online event where those running the project discussed it.

The ever-expanding scope of AI applications is built by necessity on human expertise, and that expertise is bought to this day for the lowest dollar value companies can offer without incurring a public relations problem. When you report a post, it doesn’t say “great, we’ll send this to a guy in Syria who will be paid 3 cents to take care of it.” But the volume of reports (and of content deserving of report) is so high that solutions other than mass outsourcing of the work to cheap labor markets don’t really make sense to the companies involved.

Perusing the reports, they are largely anecdotal, and deliberately so. These reports are more on the level of systematic anthropological observation than quantitative analyses.

Quantifying experiences like these often fails to capture the real costs — the statistics you end up with are the type that companies love to trumpet (and therefore to solicit in studies): higher wages than other companies in the area, job creation, savings passed on to clients. Seldom are things like moderation workers losing sleep to nightmares or rampant chemical dependency mentioned, let alone measured and presented.

Take Fasica Berhane Gebrekidan’s report on Kenyan data workers struggling with mental health and drug issues. (The full PDF is here.)

She and her colleagues worked for Sama, which bills itself as a more ethical data work pipeline, but the reality of the job, as the actual people describe it, is unrelenting misery and a lack of support from the local office.

A whistleblower’s image of the moderation work space at Samasource in Kenya.
Image Credits: Fasica Berhane Gebrekidan

Recruited to handle tickets (i.e. flagged content) in local languages and dialects, they are exposed to a never-ending stream of violence, gore, sexual abuse, hate speech and other content that they must view and “action” quickly lest their performance fall below expected levels, leading to docked pay, the report says. For some that’s more than one per minute, meaning they view a minimum of around 500 such items a day. (In case you’re wondering where the AI is here — they are likely providing the training data.)

“It’s absolutely soul-crushing. I’ve watched the worst things one can imagine. I’m afraid that I will be scarred for life for doing this job,” said Rahel Gebrekirkos, one of the contractors interviewed.

Support personnel were “ill-equipped, unprofessional, and under-qualified,” and moderators frequently turned to drugs to cope, and complained of intrusive thoughts, depression, and other problems.

We’ve heard some of this before, but it is relevant to hear that it is happening still. There are several reports of this type, but others are more personal stories, or take different formats.

For instance, Yasser Yousef Alrayes is a data annotator in Syria, working to pay for his higher education. He and his roommate work together on visual annotation tasks like parsing images of text that, as he points out, are often poorly defined, with frustrating demands from clients.

He chose to document his work in the form of a short film that is well worth eight minutes of your time.

Workers like Yasser are often obscured behind many organizational layers, acting as sub-contractors to sub-contractors, so that lines of responsibility are obfuscated should there ever be a problem or lawsuit.

DAIR and TU Berlin’s Milagros Miceli, one of the leaders of the project, told me that they had not seen any comment or changes from the companies indicated in the report, but that it was still early. But the results seem strong enough for them to go back for more: “We’re planning to continue this work with a second cohort of data workers,” she wrote, “most probably from Brazil, Finland, China, and India.”

No doubt there are some who will discount these reports for the very quality that makes them valuable: their anecdotal nature. But while it’s easy to lie with statistics, anecdotes always carry at least some truth in them, for these stories are taken direct from the source. Even if these were the only dozen moderators in Kenya, or Syria, or Venezuela with these problems, what they say should concern anyone who relies on them — which is to say, just about everyone.

More TechCrunch

The essential labor of data work, like moderation and annotation, is systematically hidden from those who benefit from the fruits of that labor. A new project puts the lived experiences…

Data workers detail exploitation by tech industry in DAIR report
Image Credits: DAIR/TU Berlin/Data Workers Inquiry

Hello and welcome back to TechCrunch Space. I hope everyone had a great Independence Day. On to the news!

TechCrunch Space: SpaceX’s big plans for Starship in Florida

Featured Article

Valuations of startups have quietly rebounded to all-time highs. Some investors say the slump is over. 

Generative AI businesses aside, the last couple of years have been relatively difficult for venture-backed companies. Very few startups were able to raise funding at prices that exceeded their previous valuations.   Now, approximately two years after the venture slump began in early 2022, some investors, like IVP general partner Tom…

3 hours ago
Valuations of startups have quietly rebounded to all-time highs. Some investors say the slump is over. 

VPN makers report having received a notification from Apple that their apps have been removed from the App Store in Russia.

Apple removes VPN apps at request of Russian authorities, say app makers

Europe’s next-generation launch vehicle, the Ariane 6, is poised to lift off for the first time tomorrow, as the continent looks to build out sovereign access to space and ensure…

Ariane 6 is the future of European heavy-lift launch — for better or worse

Over the past few days, Ghost says it has achieved two major milestones in its move to become a federated service.

Substack rival Ghost federates its first newsletter

The Samsung event will feature updates to the Galaxy Z Fold, Galaxy Z Flip, as well as more details on the Galaxy Ring and Galaxy AI.

Samsung Unpacked 2024: What we expect and how to watch Wednesday’s hardware event

Amazon has released an all-new version of its Echo Spot ahead of Prime Day, the company announced on Monday. The 2024 version of the Alexa-enabled smart alarm clock costs $79.99,…

Amazon revives its Echo Spot with an upgraded look and improved audio

One of the vendors to benefit from the database boom is Tembo, a startup creating a platform that lets developers deploy different flavors of Postgres.

Tembo capitalizes on the database boom and lands new cash to expand

TechCrunch Disrupt 2024 is set to welcome an impressive lineup of judges for the Startup Battlefield 200 competition, presented this year by Google Cloud. These judges will decide which company…

Mayfield’s Navin Chaddha is coming to TechCrunch Disrupt 2024

Numerous concerns are weighing on the minds of many, whether it’s current global conflicts, climate change or the precarious state of the economy, it is no surprise that the world…

Art therapy app Scribble Journey lets you express emotions through doodles

Pestle addresses the common problem of finding recipes on the web.

Pestle’s app can now save recipes from Reels using on-device AI

These efforts have come as Lucid is looking to start building its Gravity SUV by the end of this year.

Lucid Motors sets new record for EV deliveries as it seeks ‘escape velocity’

Berlin-based food delivery giant Delivery Hero has warned investors it may “ultimately” face an antitrust fine of up to €400 million. The development, reported earlier by Reuters, follows unannounced raids…

Delivery Hero warns it could face €400M antitrust fine

Featured Article

Investors chase wealth tech startups in India as affluent class grows

The high-net-worth and ultra-high-net-worth segments are booming in India, prompting some wealth management firms to aggressively expand their relationship manager networks to capture this market.

21 hours ago
Investors chase wealth tech startups in India as affluent class grows

Featured Article

Seed VCs are turning to new ‘pro rata’ funds that help them compete with the big firms

Three companies with new funds deploy capital to support seed and Series A VCs looking to exercise their pro rata rights.

1 day ago
Seed VCs are turning to new ‘pro rata’ funds that help them compete with the big firms

Here are the latest companies venturing into the gaming scene and details about each offering, including pricing, examples of titles and supported devices. 

YouTube and LinkedIn have games now, and here’s how you can play them

Featured Article

CIOs’ concerns over generative AI echo those of the early days of cloud computing

CIOs trying to govern generative AI have the same concerns they had about cloud computing 15 years ago, but they’ve learned some things along the way.

1 day ago
CIOs’ concerns over generative AI echo those of the early days of cloud computing

It sounds like the latest dispute between Apple and Fortnite-maker Epic Games isn’t over. Epic has been fighting Apple for years over the company’s revenue-sharing requirements in the App Store.…

Epic Games CEO promises to ‘fight’ Apple over ‘absurd’ changes

As deep-pocketed companies like Amazon, Google and Walmart invest in and experiment with drone delivery, a phenomenon reflective of this modern era has emerged. Drones, carrying snacks and other sundries,…

What happens if you shoot down a delivery drone?

A police officer pulled over a self-driving Waymo vehicle in Phoenix after it ran a red light and pulled into a lane of oncoming traffic, according to dispatch records. The…

Waymo robotaxi pulled over by Phoenix police after driving into the wrong lane

Welcome back to TechCrunch’s Week in Review — TechCrunch’s newsletter recapping the week’s biggest news. Want it in your inbox every Saturday? Sign up here. This week, Figma CEO Dylan…

Figma pauses its new AI feature after Apple controversy

We’ve created this guide to help parents navigate the controls offered by popular social media companies.

How to set up parental controls on Facebook, Snapchat, TikTok and more popular sites

Featured Article

You could learn a lot from a CIO with a $17B IT budget

Lori Beer’s work is a case study for every CIO out there, most of whom will never come close to JP Morgan Chase’s scale, but who can still learn from how it goes about its business.

2 days ago
You could learn a lot from a CIO with a $17B IT budget

For the first time, Chinese government workers will be able to purchase Tesla’s Model Y for official use. Specifically, officials in eastern China’s Jiangsu province included the Model Y in…

Tesla makes it onto Chinese government purchase list

Generative AI models don’t process text the same way humans do. Understanding their “token”-based internal environments may help explain some of their strange behaviors — and stubborn limitations. Most models,…

Tokens are a big reason today’s generative AI falls short

After multiple rejections, Apple has approved Fortnite maker Epic Games’ third-party app marketplace for launch in the EU. As now permitted by the EU’s Digital Markets Act (DMA), Epic announced…

Apple approves Epic Games’ marketplace app after initial rejections

There’s no need to worry that your secret ChatGPT conversations were obtained in a recently reported breach of OpenAI’s systems. The hack itself, while troubling, appears to have been superficial…

OpenAI breach is a reminder that AI companies are treasure troves for hackers

Welcome to Startups Weekly — TechCrunch’s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Most…

Space for newcomers, biotech going mainstream, and more

Elon Musk’s X is exploring more ways to integrate xAI’s Grok into the social networking app. According to a series of recent discoveries, X is developing new features like the…

X plans to more deeply integrate Grok’s AI, app researcher finds