Skip to content

Wikidata/Purdue-Data-Mine-2024

Repository files navigation

Project Banner

WMDE x The Data Mine

This repository contains the program materials and student work for Wikimedia Deutschland's project in the 2024 Purdue Data Mine. Students will focus on comparing data from Wikidata with external data sources and then derive and report mismatches for the Wikidata Mismatch Finder. The corrections of these mismatches by the Wikidata community will then serve to improve Wikidata's data and all downstream projects including Wikipedia.

Note: The final blogpost for the project can be found WIP.

Contents

  • mismatch_generation
    • Student work to derive mismatches between Wikidata and external sources
  • notebooks
    • Program materials to introduce Python, Jupyter, Wikidata data access and more

Mismatch Process

You can see the process to generate mismatches at the following Mermaid live editor link. The resulting diagram is further displayed below:

Mismatch generation process