Community Wishlist Survey 2022/Multimedia and Commons/WikiCommons metadata analysis tool

Random proposal ►◄ Multimedia and Commons The survey has concluded. Here are the results!

WikiCommons metadata analysis tool

Problem: Metadata of images is constantly being improved thanks to crowdsourcing, either on the database of a GLAM or on Wikimedia Commons itself. In order to keep the metadata up to date on all systems, a tool would be needed to compare, upload and download the metadata from/to Wikimedia Commons.
Proposed solution: A general GLAM analysis tool that compares the metadata of the GLAM sources with metadata from other sources in Wikimedia Commons. Ideally a solution withOpenRefine or Pattypan. Procedure/Rules: Export metadata from GLAM (xls/csv); Prepare tables for the analysis toolLoad tables in analysis tool; Get/extract Wikimedia Commons Metadata; Compare metadata (i. e. highlighting the differences) and decide; No change -> ignore; Changes by GLAM -> upload the update to WikiCommons via analysis tool; Changes by WikiCommons -> create new csv file for uploading to GLAM.
Who would benefit: GLAM Institutions on WikiCommons
More comments: First draft @GLAMhack2021
Phabricator tickets:
Proposer: ETH-Bibliothek (talk) 10:32, 21 January 2022 (UTC)[reply]

Discussion

As Wikipedian working together with different GLAMs I strongly support this proposal. We should try to find methods, how we can make use of the improved metadata where ever they are. (To be transparent: as volunteer I also work together with the ETH-library.) --Hadi (talk) 16:51, 21 January 2022 (UTC)[reply]
Point to Structured Data on Commons: a good addition and a great idea! what i really would appreciate, if such a tool would also point (more) metadata alongside Structured Data on Commons. because structured data contains so much additional knowledge, which could be helpful for updating or reconciling with local data and generally my opinion is that structured data is the bright and sustainable future of metadata in commons. :-) --Mfchris84 (talk) 11:57, 21 January 2022 (UTC)[reply]
Sounds similar to Wikimedia Commons Data Roundtripping and Structured data for GLAM-Wiki/Roundtripping. Jean-Fred (talk) 12:03, 31 January 2022 (UTC)[reply]
What you're describing is multiple copies of the same data being kept in different places and getting out of sync, i.e. data redundancy (the bad kind). A real solution to this problem is to not have the redundancy in the fist place. And if that's not possible for some reason, the process should be much more streamlined than you describe; the server should be able to pull metadata from an alternate source and display it on the page with an easy UI to select the changes to apply. As for exporting, XLS is completely unnecessary since it's a proprietary, convoluted format; CSV offers a minimum of structure, however I would hope systems out there would be capable of reading a semantic structured data export (since Commons already supports structured data). Silver hr (talk) 16:48, 2 February 2022 (UTC)[reply]

Voting

Support —The Editor's Apprentice (talk) 18:57, 28 January 2022 (UTC)[reply]
Support Hadi (talk) 18:10, 29 January 2022 (UTC)[reply]
Support Nicole Graf (talk) 07:16, 31 January 2022 (UTC)[reply]
Support Sentenzius (talk) 08:12, 31 January 2022 (UTC)[reply]
Support — The preceding unsigned comment was added by Rjl724 (talk) 08:33, 31 January 2022 (UTC)[reply]
Support Kleiner T-Rex (talk) 13:36, 31 January 2022 (UTC)[reply]
Support FluraFlu (talk) 15:29, 31 January 2022 (UTC)[reply]
Support SBB Historic (talk) 17:04, 31 January 2022 (UTC)[reply]
Support Carla Jung (talk) 08:02, 1 February 2022 (UTC)[reply]
Support Mianga (talk) 08:02, 1 February 2022 (UTC)[reply]
Support FJohner64 (talk) 09:46, 1 February 2022 (UTC)[reply]
Support Thingofme (talk) 10:06, 1 February 2022 (UTC)[reply]
Support Little-creator (talk) 10:48, 1 February 2022 (UTC)[reply]
Support Swiss National Library (talk) 11:22, 1 February 2022 (UTC)[reply]
Support Si. Leitner (talk) 12:51, 1 February 2022 (UTC)[reply]
Support Julihi (talk) 07:26, 2 February 2022 (UTC)[reply]
Support El ribi (talk) 08:24, 2 February 2022 (UTC)[reply]
Support HHeike (talk) 14:54, 2 February 2022 (UTC)[reply]
Support HouseBlaster (talk) 01:09, 3 February 2022 (UTC)[reply]
Support Hedger z Castleton (talk) 15:54, 4 February 2022 (UTC)[reply]
Support Poacea (talk) 15:32, 5 February 2022 (UTC)[reply]
Support--Vulp❯❯❯here! 07:38, 6 February 2022 (UTC)[reply]
Support Ayumu Ozaki (talk) 08:10, 6 February 2022 (UTC)[reply]
Support —— Eric Liu_（Talk） 09:41, 6 February 2022 (UTC)[reply]
Support paul2520 (talk) 16:57, 7 February 2022 (UTC)[reply]
Support Tom Ja (talk) 17:45, 7 February 2022 (UTC)[reply]
Support Talmoryair (talk) 09:36, 8 February 2022 (UTC)[reply]
Support Marcok (talk) 07:51, 10 February 2022 (UTC)[reply]
Support AnanasL (talk) 07:55, 10 February 2022 (UTC)[reply]