CURRENT

Microsoft sued by authors over use of books in AI training

26 Jun 2025, 9:55 AM
Microsoft sued by authors over use of books in AI training

NEW YORK, June 26 — Microsoft has been hit with a suit by a group of authors who claim the company used their books without permission to train its Megatron artificial intelligence model.

Kai Bird, Jia Tolentino, Daniel Okrent and several others alleged Microsoft used pirated digital versions of their books to teach its AI to respond to human prompts. Their suit, filed in New York federal court on Tuesday, is one of several high-stakes cases brought by authors, news outlets and other copyright holders against tech companies including Meta Platforms, Anthropic and Microsoft-backed OpenAI over alleged misuse of their material in AI training.

The complaint against Microsoft came a day after a California federal judge ruled that Anthropic made fair use under United States copyright law of authors’ material to train its AI systems but may still be liable for pirating their books. It was the first US decision on the legality of using copyrighted materials without permission for generative AI training.

Microsoft spokespeople did not immediately respond to a request for comment on the suit. An attorney for the authors declined to comment.

The writers alleged in the complaint that Microsoft used a collection of nearly 200,000 pirated books to train Megatron, an algorithm that gives text responses to user prompts. The complaint said Microsoft used the pirated dataset to create a “computer model that is not only built on the work of thousands of creators and authors, but also built to generate a wide range of expression that mimics the syntax, voice, and themes of the copyrighted works on which it was trained”.

Tech companies have argued that they make fair use of copyrighted material to create new, transformative content, and that being forced to pay copyright holders for their work could hamstring the burgeoning AI industry.

The authors requested a court order blocking Microsoft’s infringement and statutory damages of up to US$150,000 (RM634,650) for each work that Microsoft allegedly misused.

— Reuters

What do you think?

Latest
MidRec
Media Selangor
About Us

Media Selangor Sdn Bhd (MSSB), a subsidiary of Menteri Besar Selangor Incorporated (MBI), is the official media agency of the Selangor State Government. In addition to the Media Selangor news portal (formerly known as Selangorkini & Selangor Journal), Media Selangor also publishes newspapers in Mandarin, Tamil, and English.

Properties
MS: f922288e558c3b7b1d99bd47484377b4
EN: cd68e718a4d41dc8ef70c9d27c60e1f1
ZH: 100cdec69db9bc7fd9f175cab704a072
TA: 7b60ca9b9b7a9838dc33c5db6fb6f38c
TV-MS: 5c53513d790774360d169f98c36ce619