Close Menu
  • Home
  • News
  • Politics
  • Health
  • Business
  • Education
  • Opinion
  • Lifestyle
  • Entertainment
Facebook X (Twitter) Instagram
The Meridian Spy
  • Home
  • News
  • Politics
  • Health
  • Business
  • Education
  • Opinion
  • Lifestyle
  • Entertainment
The Meridian Spy
Home»News»Wikimedia Pushes to Open Wikidata for AI Training
News

Wikimedia Pushes to Open Wikidata for AI Training

meridianspyBy meridianspyOctober 2, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
A convoy of military trucks with anti aircraft missiles.
Share
Facebook Twitter LinkedIn Pinterest Email
Share
    

Share!

  • Share
  • Tweet

Wikimedia Pushes to Open Wikidata for AI Training

Wikimedia Deutschland has launched a new database designed to make Wikipedia’s vast knowledge base more accessible to artificial intelligence models.

 

The initiative, called the Wikidata Embedding Project, introduces a vector-based semantic search system capable of parsing more than 120 million articles across Wikipedia and its sister sites. The tool also supports the Model Context Protocol (MCP), a new standard that enables AI systems to query external data sources directly.

 

Developed in collaboration with neural search firm Jina and IBM-owned data provider DataStax, the project aims to give developers structured access to verified knowledge for retrieval-augmented generation (RAG) systems. Until now, Wikidata searches were limited to keywords or the specialised query language SPARQL.

 

“Powerful AI can be open and collaborative, rather than monopolised by large corporations,” said Philippe Saadé, project manager for Wikidata AI.

 

The new system organises information semantically. For example, a search for “scientist” will return nuclear physicists, Bell Labs alumni, multilingual translations, images, and related concepts such as “researcher” and “scholar.”

 

The database is available on Toolforge, and Wikimedia will hold a developer webinar on October 9. The launch comes amid rising demand for reliable training data in AI, as companies face legal and financial pressure over the use of copyrighted material. In February, Anthropic agreed to a $1.5 billion settlement with authors whose works had been used in its datasets

READ ALSO  UK will not yield to Trump pressure over Greenland, says PM Starmer

Share this:

  • Click to share on WhatsApp (Opens in new window) WhatsApp
  • Tweet

No related posts.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
meridianspy

Related Posts

Again, National Grid Collapses as Power Generation Drops to 24MW

January 23, 2026

$5bn Bonga S’West Project Gets Presidential Support

January 23, 2026

NYSC, SMEDAN Deepen Partnership on Youth Empowerment

January 23, 2026
Search
Recent Posts
  • Again, National Grid Collapses as Power Generation Drops to 24MW
  • $5bn Bonga S’West Project Gets Presidential Support
  • Tight Monetary Policy Slashes Inflation by 10% – CBN
  • NYSC, SMEDAN Deepen Partnership on Youth Empowerment
  • CCTV, Solar Street Lighting to Secure 1,068km Sokoto–Badagry Superhighway – FG
  • Fubara Survives Impeachment as Rivers CJ Declines to Set Up Panel
  • Nigerian Navy Expands Access to Primary Healthcare in Adamawa Communities
  • FG Pushes Local Power Components to Cut Forex Use
  • AFCON Names 3 Nigeria Players for 2025 Best XI team of the tournament
  • UK will not yield to Trump pressure over Greenland, says PM Starmer
  • FG Targets Capital Projects with ₦87.31bn Aviation Budget
  • DSS arrests Malami after release from Kuje prison
  • Anambra lawmakers set to review state law for enhanced devt.
  • 7 injured in Ondo as building collapses
  • NCC Unveils Spectrum Roadmap to Boost Digital Economy
Categories
  • Business
  • Education
  • Entertainment
  • Foreign
  • Health
  • Investigations
  • Lifestyle
  • News
  • Opinion
  • Politics
  • Sport
Access Bank DiamondXtra Season 16 Rewards
  • About us
  • Contact Us
  • News
  • Politics
  • Health
© 2026 All Right Reserved. Designed by Techjuno

Type above and press Enter to search. Press Esc to cancel.