Copyright group takes down Dutch language AI dataset


AI (Artificial Intelligence) letters and robot hand miniature in this illustration, taken June 23, 2023. REUTERS/Dado Ruvic/Illustration/File Photo

AMSTERDAM (Reuters) - Dutch-based copyright enforcement group BREIN has taken down a large language dataset that was being offered for use in training AI models, the organization said on Tuesday.

The dataset included information collected without permission from tens of thousands of books, news sites, and Dutch language subtitles harvested from "countless" films and TV series, BREIN said in a statement.

Director Bastiaan van Ramshorst told Reuters it was not clear whether or how widely the dataset may already have been used by AI companies.

"It's very difficult to know, but we are trying to be on time" to avoid future lawsuits, he said.

He said the European Union's AI Act will require AI firms to disclose what datasets they have used to train their models.

In the U.S., Microsoft-backed OpenAI has been hit with several lawsuits including one by the New York Times for allegedly using copyright material to train its models without permission.

In Denmark, a copyright protection group called Danish Rights Alliance forced the take-down of a huge dataset known as "Books3" last year.

The person who had been offering the Dutch dataset agreed to the terms of a cease and desist order and removed it from a website where it was available for download, BREIN said. The organization did not disclose the person's identity, citing Dutch privacy rules.

(Reporting by Toby Sterling; editing by Jonathan Oatis)

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

   

Next In Tech News

Nintendo Alarmo clock now available from local retailer, priced at a steep RM899
Cutting-edge AI to find missing relatives at ancient Kumbh Mela
French woman faces cyberbullying after falling for fake Brad Pitt
TikTok preparing for U.S. shut-off on Sunday, The Information reports
TikTok calls report of possible sale to Elon Musk’s X ‘pure fiction’
ChatGPT will soon be able to�remind you to walk the dog
Apple wants to keep diversity programmes disavowed by other US firms
Powerfoyle technology can keep small electronic devices running forever
US SEC sues Elon Musk over late disclosure of Twitter stake
Specially equipped drones for complex, high-risk missions

Others Also Read