Structured Data on Commons/Newsletter/2015-02-19
Structured Data on Commons update
editGreetings,
After a delay in updates to the Structured data on Commons project, I wanted to catch you up with what has been going on over the past three months. In short: The project is on hold, but that doesn't mean nothing is happening.
The meeting in Berlin in October provided the engineering teams with a lot to start on. Unfortunately the Structured Data on Commons project was put on hold not too long after this meeting. Development of the actual Structured data system for Commons will not begin until more resources can be allocated to it.
The Wikimedia Foundation and Wikimedia Germany have been working to improve the Wikidata query process on the back-end. This is designed to be a production-grade replacement of WikidataQuery integrated with search. The full project is described at Mediawiki.org.This will benefit the structured data project greatly since developing a high-level search for Commons is a desired goal of this project.
The Wikidata development team is working on the arbitrary access feature. Currently it's only possible to access items that are connected to the current page. So for example on Vincent van Gogh you can access the statements on Q5582, but you can't access these statements on c:Category:Vincent van Gogh or c:Creator:Vincent van Gogh. With arbitrary access enabled on Commons we no longer have this limitation. This opens up the possibility to use Wikidata data on Creator, Institution, Authority control and other templates instead of duplicating the data (what we do now). This will greatly enhance the usefulness of Wikidata for Commons.
To use the full potential of arbitrary access the Commons community needs to reimplement several templates in LUA. In LUA it's possible to use the local fields and fallback to Wikidata if it's not locally available. Help with this conversion is greatly appreciated. The different tasks are tracked in phabricator, see https://phabricator.wikimedia.org/T89594 .
Volunteers are continuing to add data about artworks to Wikidata. Sometimes an institution website is used and sometimes data is being transfered from Commons to Wikidata. Wikidata now has almost 35.000 items about paintings. This is done as part of the WikiProject sum of all paintings. This helps us to learn how to d:Wikidata:WikiProject Visual arts/Item structuremodel and refine metadata about artworks. Experience that will of course be very useful for Commons too.
Additionally, the metadata cleanup drive continues to produce results. The drive, which is intended to identify files missing {{information}} or the like structured data fields and to add such fields when absent, has reduced the number of files missing information by almost 100,000 on Commons. You can help by looking for files with similarly-formatted description pages, and listing them at Commons:Bots/Work requests so that a bot can add the {{information}} template on them.
At the Amsterdam Hackathon in November 2014, a couple of different models were developed about how artwork can be viewed on the web using structured data from Wikidata. You can browse two examples here and here. These examples can give you an idea of the kind of data that file pages have the potential to display on-wiki in the future.
The Structured Data project is a long-term one, and the volunteers and staff will continue working together to provide the structure and support in the back-end toward front-end development. There are still many things to do to help advance the project, and I hope to have more news for you in the near future. Contact me any time with questions, comments, concerns.