Encyclopedia Britannica Sues OpenAI Over Alleged Copyright Infringement

Encyclopedia Britannica and Merriam-Webster have filed a lawsuit against OpenAI, claiming the AI company used their copyrighted material to train its models and then generated responses that closely mirror their content. The complaint alleges that GPT‑4 "memorized" large portions of Britannica’s text and can reproduce near‑verbatim excerpts on demand, diverting traffic from the publishers’ sites. The case adds to a growing wave of legal actions by publishers seeking accountability for AI training practices, joining lawsuits from The New York Times and a settlement involving Anthropic.

Background

Encyclopedia Britannica, a long‑standing reference publisher, along with dictionary publisher Merriam‑Webster, have taken legal action against OpenAI. The lawsuit, filed on a Friday, alleges that OpenAI incorporated the publishers’ copyrighted content into its training data for its AI models, including GPT‑4, without permission.

Allegations of Unauthorized Use

The complaint asserts that OpenAI repeatedly copied Britannica’s material, stating that “GPT‑4 itself has ‘memorized’ much of Britannica’s copyrighted content and will output near‑verbatim copies of significant portions on demand.” The plaintiffs provide side‑by‑side examples showing entire passages from Britannica appearing word for word in AI‑generated responses. Similar claims are made by Merriam‑Webster regarding its dictionary entries.

Impact on Web Traffic

Britannica further argues that OpenAI’s AI responses are “cannibalizing” its web traffic. By delivering answers that directly compete with Britannica’s content, the AI model allegedly reduces the number of users who would otherwise visit the publisher’s website, undermining traditional search‑engine referral patterns.

Broader Legal Context

This lawsuit is part of an expanding series of copyright actions targeting AI developers. The New York Times has pursued similar claims against OpenAI, accusing the company of copying large amounts of its copyrighted material. In a separate case, Anthropic settled a class‑action lawsuit over the use of copyrighted books in its AI training, resulting in a $1.5 billion payout to authors.

Potential Implications

If the court finds in favor of Britannica and Merriam‑Webster, OpenAI could face injunctions requiring the removal of infringing content, monetary damages, and possibly new licensing requirements for future AI training. The outcome may also influence how other publishers approach legal strategies against AI firms, potentially reshaping the landscape of content licensing and AI development.

Current Status

The lawsuit is ongoing, with OpenAI yet to respond publicly to the allegations. Both publishers are seeking remedies that would prevent further unauthorized use of their material and address the alleged diversion of web traffic.