When the terms of service change to make way for AI training

By Eli Tan

AI
Thursday, 27 Jun 2024
12:30 PM MYT

Related News

South Korea 5h ago

ChatGPT hits 1.2 million daily users in S. Korea amid Ghibli-style AI-generated image trend

Technology 19h ago

New AI benchmarks test speed of running AI applications

Pakistan 2h ago

The AI Studio Ghibli trend is an insult to art and artists: Comment

Buried thousands of words into its document, Google tweaked the phrasing for how it used data for its products, adding that public information could be used to train its AI chatbot and other services. — Reuters

SAN FRANCISCO: Last July, Google made an eight-word change to its privacy policy that represented a significant step in its race to build the next generation of artificial intelligence.

The subtle change was not unique to Google. As companies look to train their AI models on data that is protected by privacy laws, they’re carefully rewriting their terms and conditions to include words like “artificial intelligence”, “machine learning” and “generative AI”.

Some changes to terms of service are as small as a few words. Others include the addition of entire sections to explain how generative AI models work, and the types of access they have to user data. Snap, for instance, warned its users not to share confidential information with its AI chatbot because it would be used in its training, and Meta alerted users in Europe that public posts on Facebook and Instagram would soon be used to train its large language model.

Those terms and conditions – which many people have long ignored – are now being contested by some users who are writers, illustrators and visual artists and worry that their work is being used to train the products that threaten to replace them.

“We’re being destroyed already left, right and center by inferior content that is basically trained on our stuff, and now we’re being discarded,” said Sasha Yanshin, a YouTube personality and co-founder of a travel recommendation site.

This month, Yanshin canceled his Adobe subscription over a change to its privacy policy. “The hardware store that sells you a paintbrush doesn’t get to own the painting that you make with it, right?” he said.

StarPicks

QSR Brands posts 25% revenue growth in 1Q25

To train generative AI, tech companies can draw from two pools of data – public and private. Public data is available on the web for anyone to see, while private data includes things like text messages, emails and social media posts made from private accounts.

Public data is a finite resource, and a number of companies are only a few years away from using all of it for their AI systems. But tech giants like Meta and Google are sitting on a trove of private data that could be 10 times the size of its public counterpart, said Tamay Besiroglu, an associate director at Epoch, an AI research institute.

That data could amount to “a substantial advantage” in the AI race, Besiroglu said. The problem is gaining access to it. Private data is mostly protected by a patchwork of federal and state privacy laws that give users some sort of licensing over the content they create online, and companies can’t use it for their own products without consent.

In February, the Federal Trade Commission warned tech companies that changing privacy policies to retroactively scrape old data could be “unfair or deceptive.”

AI training could eventually use the most personal kinds of data, like messages to friends and family. A Google spokesperson said a small test group of users, with permission, had allowed Google to train its AI on some aspects of their personal emails.

Some companies have struggled to balance their hunger for new data with users’ privacy concerns. In June, Adobe faced backlash on social media after it changed its privacy policy to include a phrase about automation that many of its customers interpreted as having to do with AI scraping.

The company explained the changes with a pair of blog posts, saying customers had misunderstood them. On June 18, Adobe added explanations to the top of some sections of its terms and conditions.

“We’ve never trained generative AI on customer content, taken ownership of a customer’s work or allowed access to customer content beyond legal requirements,” Dana Rao, Adobe’s general counsel and its chief trust officer, said in a statement.

This year, Snap updated its privacy policy about data collected by My AI, its AI chatbot that users can have conversations with.

A Snap spokesperson said the company gave “upfront notices” about how it used data to train its AI with the opt-in of its users.

In September, the social platform X added a single sentence to its privacy policy about machine learning and AI. The company did not return a request for comment.

Last month, Meta alerted its Facebook and Instagram users in Europe that it would use publicly available posts to train its AI starting Wednesday, inciting some backlash. It later paused the plans after the European Center for Digital Rights brought complaints against the company in 11 European countries.

In the United States, where privacy laws are less strict, Meta has been able to use public social media posts to train its AI without such an alert. The company announced in September that the new version of its large language model was trained on user data that its previous iteration had not been trained on.

Meta has said its AI did not read messages sent between friends and family on apps like Messenger and WhatsApp unless a user tagged its AI chatbot in a message.

“Using publicly available information to train AI models is an industrywide practice and not unique to our services,” a Meta spokesperson said in a statement.

Many companies are also adding language to their terms of use that protects their content from being scraped to train competing AI.

Yanshin said that he hoped regulators could act fast in creating protections for small businesses like his against AI companies, and that traffic to his travel website had fallen 95% since it began competing with AI aggregators.

“People are going to sit around debating the pros and cons of stealing data because it makes a nice chatbot," he said. “In three, four, five years’ time, there might not be entire segments of this creative industry because we’ll just be decimated.” – The New York Times

Topic:

AI Internet Technology

Is this article useful?

89% of our readers find this article useful

Report a mistake

What is the issue about?

Spelling and grammatical error

Factually incorrect

Story is irrelevant

Email (optional)

Thank you for your report!

Next In Tech News

Others Also Read

Tariffs deliver blow to dented Southeast Asian solar export

Business21m ago

Symbol	Open	High	Low	Last	Chg	%Chg	Vol ('00)
HSI-CWCM	0.080	0.080	0.055	0.065	-0.030	-31.58	1,536,447
HSI-CWCE	0.150	0.175	0.145	0.160	-0.040	-20.00	941,884
HSI-PWD2	0.085	0.090	0.080	0.080	0.010	14.29	852,434
VELOCITY-WB	0.045	0.045	0.045	0.045	0.010	28.57	612,000
TOPGLOV	0.800	0.870	0.795	0.845	0.040	4.97	578,478
SUPERMX	0.730	0.825	0.730	0.775	0.060	8.39	551,774
SAPNRG	0.045	0.050	0.045	0.045	-0.005	-10.00	478,618
T7GLOBAL	0.320	0.360	0.315	0.325	0.005	1.56	447,263
HSI-CWEO	0.100	0.100	0.085	0.095	-0.020	-17.39	420,674
HSI-CWEJ	0.120	0.120	0.100	0.110	-0.025	-18.52	369,073
VS	0.830	0.845	0.795	0.820	-0.025	-2.96	320,075
PERTAMA	0.095	0.105	0.075	0.090	-0.010	-10.00	310,593
HSI-PWD5	0.140	0.150	0.120	0.130	0.025	23.81	310,108
HARTA	1.830	2.080	1.820	2.020	0.160	8.60	257,145
T7GLOBAL-WD	0.055	0.080	0.055	0.060	0.000	0.00	255,059

When the terms of service change to make way for AI training

QSR Brands posts 25% revenue growth in 1Q25

Monthly Plan

Annual Plan

1 month

Next In Tech News

Others Also Read

Tariffs deliver blow to dented Southeast Asian solar export

INTERACTIVE: Time for a term limit?

Bursa Malaysia dips as Trump’s tariffs hit sentiment, last-minute buying cushions losses

Police investigate alleged hit-and-run in Menora Tunnel accident

Soccer-UK set to host 2035 Women's World Cup as sole bidder

'Mums' club nights' – where mothers can party and dance safely

Hold introduction of additional taxes this year, urges FMM after new US tariffs

Jail for father who sexually assaulted intellectually disabled daughter in Singapore

For Trump, tariff gamble brings political risk

Shares in sportswear brands Nike, Adidas and Puma slide after tariffs hit Vietnam

Australia sweats through hottest 12 months in more than 100 years

David Beckham celebrates 50th birthday early with lavish party in Miami

Market Summary

FBM KLCI

25,137,450

Market Movers

Want to listen to full audio?

Majlis SIRIM Industri 2024

Thank you for downloading.

When the terms of service change to make way for AI training

Related News

Save 30% and win Bosch appliances! More Info

Monthly Plan

Annual Plan

1 month

Related stories:

Related News

Next In Tech News

Others Also Read

Trending in Tech

Market Summary

FBM KLCI

25,137,450

Want to listen to full audio?

Majlis SIRIM Industri 2024

Thank you for downloading.