Wikimedia Foundation Calls on AI Companies to Pay for Wikipedia Content

Wikipedia Asks AI Companies to Stop Scraping Data and to Start Paying Up
CNET

Key Points

  • Wikimedia Foundation asks AI firms to stop free scraping of Wikipedia and pay for API access.
  • The foundation cites $179 million annual operating costs and reliance on donations.
  • AI models need high‑quality, human‑curated data, which Wikipedia provides.
  • Google previously signed a commercial access deal in 2022; other firms have not responded.
  • The move aligns with broader publisher actions demanding compensation for AI training data.

The Wikimedia Foundation, which operates Wikipedia, has asked major AI developers to stop scraping the encyclopedia’s data for free and instead pay to use its Enterprise API. The nonprofit argues that AI models need high‑quality, human‑curated information, but the massive traffic and server load caused by unsanctioned scraping threaten Wikipedia’s sustainability. While some companies, including Google, previously reached commercial agreements, many have not responded to the request. The foundation’s appeal adds to a broader movement by online publishers seeking compensation for the use of their content by AI systems.

Wikimedia’s Request for Fair Compensation

The Wikimedia Foundation, the nonprofit behind Wikipedia, announced that it wants artificial‑intelligence companies to halt the free scraping of Wikipedia’s content and to begin paying for access through its Enterprise API. In a recent blog post, the foundation emphasized that AI models depend on high‑quality, human‑curated information to remain effective, and that Wikipedia’s volunteer‑edited articles provide precisely that level of reliability and sourcing.

Operating Wikipedia is costly. The site, which ranks among the world’s most visited webpages, required $179 million to run during the most recent fiscal year. Funding for Wikipedia comes primarily from donations, and the organization does not display advertising. The foundation warned that when users turn to AI tools instead of visiting Wikipedia directly, they bypass the donation prompts that appear on the site’s homepage, potentially reducing the nonprofit’s revenue.

Proposed Enterprise API Solution

Wikimedia’s proposed solution is to grant AI firms access to its content via an Enterprise API. This approach would allow large‑scale use of Wikipedia data without overburdening the site’s servers and would generate revenue to support the nonprofit’s mission. According to the foundation, the API would let companies “use Wikipedia content at scale and sustainably without severely taxing Wikipedia’s servers, while also enabling them to support our nonprofit mission.”

Industry Response and Existing Agreements

The foundation reached out to representatives from several leading AI companies, including Google, OpenAI, Meta, Perplexity, Anthropic, Microsoft, DeepSeek and xAI. None of the companies immediately replied to requests for comment, and a Wikimedia spokesperson also did not respond right away. However, the foundation noted that Google entered into a commercial agreement to access Wikipedia content in 2022, indicating that some arrangements already exist.

Broader Context of Content Licensing and Lawsuits

Wikimedia’s request comes amid a wave of actions by online content creators demanding compensation for the use of their work by AI systems. Publishers such as Penske, The New York Times and News Corp have filed lawsuits alleging copyright infringement, while organizations like the Associated Press and Reuters have signed licensing deals with AI firms. The heightened scrutiny reflects growing concerns over how AI training data is sourced and whether creators receive fair remuneration.

Implications for the Future of AI and Open Knowledge

If AI companies choose to pay for Wikipedia’s data, the arrangement could set a precedent for other content providers seeking sustainable models for sharing their material with AI developers. It would also reinforce the principle that large‑scale data use should be balanced with the financial realities of maintaining free, volunteer‑driven platforms. Conversely, a refusal by AI firms to pay might push Wikipedia to implement stricter access controls or seek alternative funding mechanisms to ensure its long‑term viability.

#Wikimedia Foundation#Wikipedia#AI companies#OpenAI#Google#Microsoft#Meta#Anthropic#Enterprise API#copyright#licensing#content scraping
Generated with  News Factory -  Source: CNET

Also available in: