Skip to main content

HarperCollins reportedly pens deal with Microsoft to train AI on its books

HarperCollins reportedly pens deal with Microsoft to train AI on its books

Publishing giant HarperCollins has agreed to allow a technology company to use “select nonfiction” books to train its artificial intelligence (AI) models.

The company told 404 Media (18 November) that it made a deal with an unnamed “technology company” and that it will allow authors to opt in for the new venture.

Bloomberg reported yesterday (19 November) that Microsoft is the tech company that will team up with HarperCollins and use its nonfiction books to train a new AI model. Exact details about this AI model are currently unknown.

“HarperCollins has a long history of innovation and experimentation with new business models,” the company said in a statement.

“Part of our role is to present authors with opportunities for their consideration while simultaneously protecting the underlying value of their works and our shared revenue and royalty streams. This agreement, with its limited scope and clear guardrails around model output that respects authors’ rights, does that.”

Last week, writer Daniel Kibblesmith shared an email he received asking if he’d consent to include his novel Santa’s Husband in the training bundle. According to screenshots posted by Kibblesmith, the deal was worth $2,500 for each title for a three-year licensing agreement, and would include “certain protections concerning credit and limits of verbatim usage per AI response”. Kibblesmith refused the deal, calling it “abominable”.

In a response to his original post, he said: “Direct any outrage toward the incredibly doable action of purchasing physical books by living authors from local bookstores.”

In May of this year, News Corp, the parent company of HarperCollins, struck a deal with OpenAI to allow the ChatGPT creator train its AI models on the company’s news content. The deal also allows OpenAI to display news content from several publications owned by News Corp, including The Wall Street Journal and The Sunday Times, in response to questions asked by users of its AI models.

While other news organisation have also struck deals with OpenAI, including The Atlantic and Vox Media, some news organisations and publishers have not been so welcoming of AI disruption. The New York Times is suing the AI giant for allegedly copying and using millions of copyrighted news articles, in-depth investigations and other journalistic work “without permission or payment”.

In October, The Guardian reported that UK ministers are facing a backlash over plans to allow AI companies to train their models on content from publishers and artists by default unless they opt out. Earlier that month, thousands of creatives around the world signed a statement warning AI companies that the unlicensed use of their work to train generative AI models is a “major, unjust threat” to their livelihoods.

When asked by SiliconRepublic.com for comment about the matter, a spokesperson for HarperCollins said: “HarperCollins has reached an agreement with an artificial intelligence technology company to allow limited use of select nonfiction backlist titles for training AI models to improve model quality and performance.

“While we believe this deal is attractive, we respect the various views of our authors, and they have the choice to opt in to the agreement or to pass on the opportunity.

“HarperCollins has a long history of innovation and experimentation with new business models: part of our role is to present authors with opportunities for their consideration while simultaneously protecting the underlying value of their works and our shared revenue and royalty streams.”

The spokesperson concluded: “This agreement, with its limited scope and clear guardrails around model output that respects author’s rights, does that.”

Ciarán Mather
This article originally appeared on www.siliconrepublic.com and can be found here
 

You Might Also Be Interested In

  • 3 minute read
  • Published 30/04/2025

Galway chosen as location of game-changing AI software hub

Pioneering platform aims to revolutionise food & beverage checkouts at iconic venues worldwide

Read more
  • 4 minute read
  • Published 24/04/2025

Ericsson invests €200 million at Athlone facility to boost high-performing programmable networks leadership

Ericsson has announced a significant €200 million investment over the next three years in a pioneering research, development, and innovation (RD&I) project at the company’s Athlone facility in central Ireland.

Read more
  • Published 17/04/2025

Sony Interactive Entertainment establishes digital innovation and engineering centre in Ireland, plans to hire 100 local employees in engineering and operations

Sony Interactive Entertainment, the creators of PlayStation, are targeting June to open their office in Dublin, focusing on digital innovation and engineering.

Read more