Community Wishlist Survey 2021/Wikidata/Duplicates and merge candidates
Problem : There is an increasing number of items that are empty or possible duplicates
Who would benefit : Wikidata editors
Proposed solution : Improve on prior art like Projectmerge to detect duplicates not only by labels but by comparing properties and links with other items; migrate the WD:DNM do not merge lists to something more usable (example suggested in the discussion page, migrate to P1889 statements
More comments :
Phabricator tickets :
Proposer : Sabas88 (talk ) 12:38, 20 November 2020 (UTC) [ reply ]
Discussion
Removed the Phabricator task as it's not relevant. --Matěj Suchánek (talk ) 15:54, 20 November 2020 (UTC) [ reply ]
@Sabas88 : Thanks for your proposal. Is there code for the mentioned projects that we can take a look at? We'd like to have a better understanding on how the projects detect duplicates. Thanks again! Harumi Monroy 19:35, 23 November 2020 (UTC) [ reply ]
Sorry I can't find it... Help:Merge has a list of tools but I didn't see a relevant git repository --Sabas88 (talk ) 12:45, 25 November 2020 (UTC) [ reply ]
A good idea. Improve on existing tools, to be able to better predict if two items are duplicate. Simplistic example: same name, different description, but both populated places (or similar property, city, village) with a very similar geographic location (within a radius of 2 km one from the other). --FocalPoint (talk ) 05:58, 24 November 2020 (UTC) [ reply ]
Or if not same name, perhaps with some other String Metric and comparing properties..--Sabas88 (talk ) 12:45, 25 November 2020 (UTC) [ reply ]
Voting
Support Movses (talk ) 19:38, 8 December 2020 (UTC) [ reply ]
Support マイキ (talk ) 19:39, 8 December 2020 (UTC) [ reply ]
Support Imz (talk ) 20:13, 8 December 2020 (UTC) [ reply ]
Support Ferdi2005 [Mail] 20:44, 8 December 2020 (UTC) [ reply ]
Support Mcampany (talk ) 21:40, 8 December 2020 (UTC) [ reply ]
Support YFdyh000 (talk ) 22:28, 8 December 2020 (UTC) [ reply ]
Support RXerself (talk ) 23:53, 8 December 2020 (UTC) [ reply ]
Support BALA. R Talk 01:44, 9 December 2020 (UTC) [ reply ]
Support Tarnumg (talk ) 02:12, 9 December 2020 (UTC) [ reply ]
Support NMaia (talk ) 03:09, 9 December 2020 (UTC) [ reply ]
Support Chrisaliv (talk ) 05:35, 9 December 2020 (UTC) [ reply ]
Support example: many location related duplicates from svwiki amd cebwiki with actual source seemingly being Geonames katpatuka (talk ) 06:06, 9 December 2020 (UTC) [ reply ]
Support Omda4wady (talk ) 07:32, 9 December 2020 (UTC) [ reply ]
Support Avron (talk ) 07:36, 9 December 2020 (UTC) [ reply ]
Support Kpjas (talk ) 11:30, 9 December 2020 (UTC) [ reply ]
Support Akela (talk ) 12:57, 9 December 2020 (UTC) [ reply ]
Support Delpha (talk ) 13:07, 9 December 2020 (UTC) [ reply ]
Support Bietels (talk ) 14:13, 9 December 2020 (UTC) [ reply ]
Support Nehaoua (talk ) 16:45, 9 December 2020 (UTC) [ reply ]
Support Петър Петров (talk ) 17:31, 9 December 2020 (UTC) [ reply ]
Support JAn Dudík (talk ) 20:29, 9 December 2020 (UTC) [ reply ]
Support - Darwin Ahoy! 02:02, 10 December 2020 (UTC) [ reply ]
Support - yona B. (D ) 08:23, 10 December 2020 (UTC) [ reply ]
Support Susanna Ånäs (Susannaanas) (talk ) 11:02, 10 December 2020 (UTC) [ reply ]
Support Euro know (talk ) 11:26, 10 December 2020 (UTC) [ reply ]
Support Sasuke Sarutobi (talk ) 23:34, 10 December 2020 (UTC) [ reply ]
Support Higa4 (talk ) 04:58, 11 December 2020 (UTC) [ reply ]
Support Paucabot (talk ) 12:12, 11 December 2020 (UTC) [ reply ]
Support Watty62 (talk ) 14:44, 11 December 2020 (UTC) [ reply ]
Support Husky (talk ) 16:13, 11 December 2020 (UTC) [ reply ]
Support Bencemac (talk ) 16:16, 11 December 2020 (UTC) [ reply ]
Support Poslovitch (talk ) 16:45, 11 December 2020 (UTC) [ reply ]
Support Susanna Giaccai (talk ) 16:50, 11 December 2020 (UTC) [ reply ]
Support Theklan (talk ) 18:20, 11 December 2020 (UTC) [ reply ]
Support BoldLuis (talk ) 18:27, 11 December 2020 (UTC) [ reply ]
Support Francois-Pier (talk ) 08:35, 12 December 2020 (UTC) [ reply ]
Support Tom Ja (talk ) 09:54, 12 December 2020 (UTC) [ reply ]
Support Klaas `Z4␟` V : 15:00, 12 December 2020 (UTC) [ reply ]
Neutral I support the improvements to the merge tools, but for me the most needed would be creation of Merge tool that allows "Unmerge" option as I see a lot of bad merges done with easy to use merge tools used by users with very little experience. This proposal does not mentions any Unmerge options. --Jarekt (talk ) 15:29, 12 December 2020 (UTC) [ reply ]
Support it could be useful, and could encourage the usage of "different from" property on similar elements Luca.favorido (talk ) 19:42, 12 December 2020 (UTC) [ reply ]
Support . Meiræ 22:00, 12 December 2020 (UTC) [ reply ]
Support Gelli1742 (talk ) 20:21, 13 December 2020 (UTC) [ reply ]
Support C. crispus (talk ) 07:46, 14 December 2020 (UTC)
Support --Mosbatho (talk ) 21:58, 14 December 2020 (UTC) [ reply ]
Support Nurtenge (talk ) 06:59, 15 December 2020 (UTC) [ reply ]
Support — SMcCandlish ☺ ☏ ¢ >ʌ ⱷ҅ᴥ ⱷʌ < 08:42, 15 December 2020 (UTC) [ reply ]
Support MTheiler (talk ) 15:13, 15 December 2020 (UTC) [ reply ]
Support Utopes (talk ) 19:26, 15 December 2020 (UTC) [ reply ]
Support TemboUngwe (talk ) 15:28, 16 December 2020 (UTC) [ reply ]
Support --Luan (discussão ) 19:50, 16 December 2020 (UTC) [ reply ]
Support F. Riedelio (talk ) 10:25, 17 December 2020 (UTC) [ reply ]
Support GiFontenelle (talk ) 00:41, 18 December 2020 (UTC) [ reply ]
Support Nashona (talk ) 01:51, 19 December 2020 (UTC) [ reply ]
Support Patsagorn Y. (Talk ) 04:53, 19 December 2020 (UTC) [ reply ]
Support Iva (talk ) 16:03, 20 December 2020 (UTC) [ reply ]
Support — Baidax 💬 17:15, 21 December 2020 (UTC) [ reply ]