User talk:Aborruso/Archive 14
Latest comment: 4 years ago by Mohammed Sadat (WMDE) in topic Wikidata weekly summary #430
This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Wikidata weekly summary #426
Here's your quick overview of what has been happening around Wikidata over the last week.
- Events
- The Wikidata track of the LD4 conference on Linked Data in Libraries takes place on Thursday 30 and Friday 31 July. Free, upon registration.
- Seeking feedback on a possible WikiCite@Wikidata's 8th Birthday online conference; and what content/format/timing preferences you have. Please respond to this short survey. [Privacy policy: https://w.wiki/XfN]
- Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group call: Liam Wyatt on WikiCite and its future plans, ways to get involved, and discussions that are happening in the community, 28 July. Agenda
- Past: Wikidata and Wikibase office hour with a focus on the Query Service, July 21st. Notes of the discussions
- Upcoming video: Wikipedia Weekly Network - LIVE Wikidata editing #14, August 1 Facebook YouTube
- Upcoming: Online Wikidata editathon in Swedish #23, August 2
- Press, articles, blog posts, videos
- Croiser des données avec OpenRefine by Ash_Crow (in French, how to match data from inside and outside Wikidata with Open Refine)
- "OBA: An Ontology-Based Framework for Creating REST APIs for Knowledge Graphs"
- Library’s linked-data project gets new grant. "Known as Linked Data for Production, the project is part of a long-term collaboration among Cornell University Library, Stanford Libraries and the School of Library and Information Science at the University of Iowa. Through linked data, information about books and other items in library records will be enhanced by related information from external online sources". By Jose Beduya
- Wikidata Training Workshop 1, by Canadian Arts Presenting Association
- Video: Wikidata Lab XXIV on relative digital positioning (in Portuguese). YouTube
- Video: Women Writers in Review: Integrating special collections into Wikidata. YouTube
- Video: Wikipedia Weekly Network - Entity Schemas and Shape Expressions (ShEx) Facebook YouTube
- Video: Wikipedia Weekly Network - LIVE Wikidata editing #13 Facebook YouTube
- Tool of the week
- We would love suggestions for tools to include in this section of the weekly summary. Please add your suggestions directly under Status updates/Next#Backlog after checking that the tool isn't already listed.
- Other Noteworthy Stuff
- Wikimedia Commons Query Service (WCQS) launches in Beta. This SPARQL endpoint for Structured Data on Commons can federate with Wikidata's Query Service. (Announcement & discussion)
- A new OpenRefine reconciliation service for Wikidata is available. Add it in OpenRefine with
https://wikidata.reconci.link/en/api
or by replacingen
by any other Wikimedia language code.
- Did you know?
- Newest properties:
- General datatypes: pertainym, BTI Governance Index, BTI Status Index, Ofsted inspection rating, distribution map of taxon
- External identifiers: Wiki-Rennes ID, Queensland Biota ID, Australian Weed ID, Encyclopedie berbere ID, Japanese magazine code, Lobbywatch.ch ID of a member of parliament, IVS ID, Svenska Akademiens Ordbok ID, National Registry of Exonerations Case ID, Described and Captioned Media Program producer ID, Český hudební slovník osob a institucí ID, PM20 geo code, PM20 subject code, DATAtourisme ID, OpenCritic critic ID, ASCCEG 2019 ID, SmallGroup ID, ANZSIC 2006 ID, AHECC 2017 ID
- New property proposals to review:
- General datatypes: booking URL, member of lexicon, screen (display) resolution, in pixels, Size comparison diagram, Butcher tableau, related laws and regulations, Water area
- External identifiers: Archive Site Trinity College Cambridge ID, WISAARD ID, Jisho word id, UK Research and Innovation organisation ID, Opta football player ID, Opta football team ID, Opta football competition ID, Power plant operating licence (Turkey), Árvore de Interesse Público ID, MinDat taxon ID, Dizionario Biografico dei Protestanti in Italia ID, Twitter topics numeric ID, People Australia ID
- Query examples:
- Newest properties:
- Development
- Changed the size of image previews to 1024 in the gallery view of the query service to avoid some images not loading sometimes (phabricator:T258241)
- Added an actual space between the entity title and the name of the fallback language (if any), so that the fallback language isn't selected anymore when double-clicking the entity title for copying (phabricator:T256857)
- Fixed the directionality of text pieces in placeholders that mix LTR and RTL (phabricator:T253812)
- Continued work on first pieces of design system to make coding new features easier in the future
- Continued untangling the code of Wikibase Client and Wikibase Repo to make it easier to develop on them
- Finished first piece of research on how to make it easier to access Wikidata's data for programmers - more work to be done
- Preparing to start coding on the Query Builder to make it easier to create queries without having to know SPARQL
- Finished running the scraper that gets potential new references for unreferenced statements and preparing it for publishing
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Wikidata weekly summary #427
Here's your quick overview of what has been happening around Wikidata over the last week.
- Discussions
- Open request for adminship: Wiki13
- Events
- Upcoming: Search Platform Office Hours—August 5th, 2020. This event will be an occasion to talk about the Query Service.
- Upcoming: Video: Wikipedia Weekly Network - LIVE Wikidata editing #15, August 8 Facebook, YouTube
- Upcoming: Online Wikidata editathon in Swedish #24, August 9
- Press, articles, blog posts, videos
- Wikidata track at the 2020 LD4 Conference on Linked Data in Libraries:
- (30 July) Wikidata Tutorial: Intro to the Basics (by User:Gamaliel)
- (30 July) Advanced Wikidata Tools and Concepts: More Than Just P's and Q's (by User:Mahir256)
- (30 July) Developing a Wikidata Project (by User:Will (Wiki Ed))
- (31 July) VanderBot: Using a Python script to create and update researcher items in Wikidata (by User:Baskaufs)
- (31 July) No bricks without clay: outcomes from the Stanford Wikidata Working Group (by User:Arcadialib)
- (31 July) LD4 Wikidata Affinity Group Wikidata Working Hour: Adding References to Wikidata (by User:Chicagohil)
- Video: Wikipedia Weekly Network - LIVE Wikidata editing #14 Facebook YouTube
- Video: Lexemes in Wikidata - structured lexicographical data for everyone (by Lydia Pintscher), YouTube
- Video: Wikidata presentation (in Turkish), YouTube
- Why You Should Do NLP Beyond English - Nice article giving some context about why it matters to have Lexemes in Wikidata in many different languages
- Wikidata track at the 2020 LD4 Conference on Linked Data in Libraries:
- Tool of the week
- SQID allows you to analyse, browse and query Wikidata. SQID is inspired by Magnus Manske's Reasonator, but focuses on prominently featuring information about Wikidata classes and properties.
- Did you know?
- Newest properties:
- General datatypes: height of center of mass, road number formatter, Vietnamese middle name, heraldic attitude, traffic sign template image
- External identifiers: Tree of Public Interest ID, Denkmaldatenbank Thurgau ID, DSSTOX compound identifier, South Africa EMIS code, Archive Site Trinity College Cambridge ID, WISAARD resource ID, Gateway to Research organisation ID, SÚKL code, Science Fiction Awards Database author ID, Power plant operating licence (Turkey)
- New property proposals to review:
- General datatypes: certified as, number of stages, convergence rate, step count, Alternative form, view, version type for works, advertisement copy
- External identifiers: LibraryThing series identifier, Swiss Industrial Heritage ID, TOPCMB ID, Library of Congress Medium of Performance Thesaurus ID, Sports-Reference.com college basketball school ID, SAT-matrikulo, Signal number, BHF author ID, BHF magazine ID, SPLC Group ID, SPLC Individual ID, Open Civic Data Division Identifiers, RKD thesaurus ID, TCLF ID, Presence compositrices ID of composer, Presence compositrices ID of work
- Query examples:
- Properties and the number of constraint definition statements on them - there are quite a few with 0 constraint definitions
- a graph of MPs and parties in the Swedish Parliament and with whom they worked together with to create motions 2018 SPOILER: >95% is just with people in the same party
- Wealthiest queer people on Wikidata (Source)
- Bubble chart showing the winners of the FA Cup (Source)
- Map of parks in Oslo missing images on Wikidata (Source)
- Commons queries:
- Newest properties:
- Development
- The last week was our quarterly prototyping week. We worked on the following projects. None of them are ready for prime-time yet but we'll continue with them.
- Slices: We've had a lot of requests for accessing dumps of a smaller part of Wikidata's data since rarely anyone needs the complete data in Wikidata. The tricky part is figuring out which part is needed and if any of that can be generalized. We looked into for example how to make dump generation faster so we could potentially produce more smaller dumps that only cover a part of Wikidata's data, either thematically (e.g. humans) or by type of data (e.g. only statements and English labels and aliases but not sitelinks or descriptions).
- REST API: As part of our effort to make it easier to access Wikidata's data for programmers we looked into a REST API. We tried to see if we could cover the existing action API modules in a REST API. We could. We'll take this as input for our ongoing API work now.
- Improving quality ratings through ORES: ORES can judge the quality of an Item automatically. It is currently not very good at it however. We tried a few things to make it more accurate and found some easy wins we'll probably make happen in the next weeks.
- Query manipulator: One of the ways we could potentially improve the load situation of the Wikidata Query Service is by automatically analyzing and then redirecting a bunch of queries to other systems that are more suitable for that particular type of query. The nice thing about that would be that the person/program sending the query wouldn't have to care about it but it'd be done automagically for them. We tried to build such a system and the results look very promising but more work/experimenting is needed, especially together with the WMF Search team.
- The last week was our quarterly prototyping week. We worked on the following projects. None of them are ready for prime-time yet but we'll continue with them.
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Wikidata weekly summary #428
Here's your quick overview of what has been happening around Wikidata over the last week.
- Discussions
- Open request for adminship: Hazard-SJ, Gnoeee, Wagino 20100516
- Closed request for adminship: Wiki13 (successful)
- Events
- Past: Wikibase Live Session - August 2020. This session had a few people present on some of their work with modeling GLAM data in Wikibase or Wikidata. (replay)
- Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group call: Daniel Mietchen and Lane Rasberry about Scholia, a project to present bibliographic information and scholarly profiles of authors and institutions, 11 August. [Agenda]
- Upcoming: Online Wikidata editathon in Swedish #25, August 16
- Press, articles, blog posts, videos
- Video: Wikipedia Weekly Network - QuickStatements and Distributed Wikidata games Facebook, YouTube
- Video: Collaboration, contribution and use of Wikidata and Wikipedia by academic libraries (in Greek). YouTube
- Librarians work to broaden Vanderbilt’s research reputation with Wikidata tools. "To speed up the creation of metadata about faculty and their publications, Steven Baskauf, data science and data curation specialist for libraries, developed “VanderBot,” a set of scripts that can read and write to Wikidata, greatly improving the efficiency by which Vanderbilt’s faculty are discoverable through Wikidata".
- Tool of the week
- OSM ↔ Wikidata matcher links Wikidata entries to places in OpenStreetMap.
- Other Noteworthy Stuff
- wikidata2df, a Python package for easily turning a Wikidata SPARQL query into a pandas dataframe
- With Wikidata Concept Tree Generator, you can enter any concept and instantly see a visualization of its extended relations. (Source)
- Did you know?
- Newest properties:
- General datatypes: size comparison diagram, view
- External identifiers: Legacy.com newspaper ID, ChemSynthesis ID, Dizionario Biografico dei Protestanti in Italia ID, Maitron des fusillés ID, Fototeka person ID, LibraryThing series ID, TOPCMB ID, Swiss Industrial Heritage ID, Library of Congress Medium of Performance Thesaurus ID
- New property proposals to review:
- General datatypes: image of entrance, extinction date, notation writer, raga, tala, SMARTS, Editio princeps, recording location
- External identifiers: Sochy a města ID osoby, Sochy a města ID sochy, podvignaroda, Have I Been Pwned breach ID, European Investment Bank project ID, WordNet 3.1 Synset Id, cadastral municipality ID, NPR station ID, NYARC Discovery ID, Nasjonalt skoleregister-ID, ERIC Thesaurus ID, American Folklore Society Ethnographic Thesaurus ID, Trismegistos Texts ID
- Query examples:
- Commons queries:
- Newest properties:
- Development
- finalized designs for the query builder to start coding at the beginning of September
- wrapping up the initial work on the design system so that we can start using the first pieces of it in the query builder development
- working on properly linking redirects in recent changes, watchlist and co (phabricator:T255387)
- addressed remaining security review comments about the Wikidata Bridge so that we can deploy it finally on the first Wikipedia
- fixed a bug where string values had the wrong length limit (phabricator:T259440)
- finishing the work of untangling Wikibase Client and Wikibase Repository extensions to make development easier
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Wikidata weekly summary #429
Here's your quick overview of what has been happening around Wikidata over the last week.
- Discussions
- Closed requests for adminship: Hazard-SJ, Gnoeee, Wagino 20100516 (all successful)
- New request for comments: How (un)important is preserving the historic character of an item?
- Events
- Upcoming: Online Wikidata editathon in Swedish #26, August 23
- Press, articles, blog posts, videos
- Video: Editing Wikidata with information from Son jarocho (in Spanish). YouTube
- Tool of the week
- Entity Explosion: a new multilingual Chrome browser extension. "Taking the power of Wikidata with me wherever I go across the web!". Uses API calls to the Wikidata Query Service to match the URL you are browsing on to a Wikidata item, and then displays data and links to other sites about the same entity. (Video)
- Other Noteworthy Stuff
- Wikidata Bridge v1 will be deployed on Catalan Wikipedia on August 18th
- New description and screenshots for the Simple Query Builder project, feedback welcome
- Help:Dataset sizing
- Did you know?
- Newest properties:
- General datatypes: alternative form
- External identifiers: Henrik Ibsen writings ID, RKD thesaurus ID, TCLF ID, Sculptures and cities database ID for sculptures, Manioc book ID, Presence compositrices ID of composer, Offizielle Deutsche Charts song ID, ToS;DR service numerical identifier, Have I Been Pwned breach ID, Unsplash User ID, EIB project ID, ANZSRC 2020 FoR ID
- New property proposals to review:
- General datatypes: latest start date, earliest end date, Number of votes after transferring, map URL, type of archaeological site, record number, front and back matter
- External identifiers: Sports-Reference.com college football school ID, Panteono de edukado.net, LSG local body code, MobyGames attribute ID, Dictionary of Occupational Titles ID, Frauen in Bewegung 1848–1938, Anais do Museu Paulista article ID, Dictionary of Occupational Titles Code (fourth edition, revised), Proballers player ID, Linked Open Data Cloud identifier, Chrome Webstore extension ID, CTHS publication ID, LBS Physical ID, GBIF occurence ID
- Query examples:
- Largest collections of Picasso content (Source). The National Gallery of Art holds the most with 303 objects
- Bubble chart of snooker world champions (Source)
- Map of streets in Tillydrone named after WW2 military leaders (Source)
- Newest properties:
- Development
- Fixed a bug where a length limit for strings seems to have reverted itself back from 1500 to its default 400 (phabricator:T259440)
- Fixed a bug that Wikibase is not always adding &redirect=no in situations when MediaWiki usually does (phabricator:T255387)
- Wrapping up the initial work on the design system so it is ready for use in the first new feature (Query Builder)
- Fixed the serialization of statements on Forms and Senses not containing the datatype (phabricator:T249206)
- Wrapping up work on the first version of Federated Properties so that other Wikibase installations can use Wikidata's Properties instead of having to maintain their own
- Worked on ensuring the data from the linked data interface at Special:EntityData is always up to date after an edit has been made (phabricator:T128486)
- Enabling clients to use Lua to request labels, descriptions and aliases in some (often minority) languages even when they are not content languages (phabricator:T259340, phabricator:T260118)
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
Wikidata weekly summary #430
Here's your quick overview of what has been happening around Wikidata over the last week.
- Events
- Upcoming: Wikidata distributed birthday in October. You can organize an (online) event with your local group or favorite WikiProject! See also: calendar in progress, information for organizers, 24-hours online meetup
- Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group call: We will talk about gadgets and user scripts. 25 August. Agenda
- Upcoming: Online Wikidata editathon in Swedish #27, August 30
- Upcoming video: Wikipedia Weekly Network - LIVE Wikidata editing #17 Facebook, YouTube
- Press, articles, blog posts, videos
- Commonsense Knowledge in Wikidata (via ArXiv)
- Wikidata-focused presentations at the Workshop "Data Science in Climate and Climate Impact Research" taking place on 20-21 August 2020 in Zurich and online.
- Sarasua, Cristina, & Mietchen, Daniel. (2020, August). Multilingual Structured Climate Research Data in Wikidata - The Community Perspective. Zenodo. http://doi.org/10.5281/zenodo.3994272
- Mietchen, Daniel, & Sarasua, Cristina. (2020, August). Multilingual Structured Climate Research Data in Wikidata - The Data Perspective. Zenodo. http://doi.org/10.5281/zenodo.3994266 (also on YouTube)
- Video: How to add missing descriptions to Wikidata using QuickStatments tool (in Arabic) - YouTube
- Video: Wikipedia Weekly Network - LIVE Wikidata editing #16 Facebook, YouTube
- Video: Introduction to Wikidata (in Malayalam) - YouTube
- Video: Wikidata editing basics (in Chinese) - YouTube
- Tool of the week
- Sophox allows for SPARQL querying of Wikidata and OpenStreetMap in a single query
- Other Noteworthy Stuff
- Two new grant programs from WikiCite, in support of open citations and linked bibliographic data.
- Full documentation, eligibility requirements, selection criteria, program design principles, and contacts at the links. Apply by 1 October.
- Project & events [$2-10k]
- e-Scholarships [per-diem calculated on your city; 1-5 people (single, or as a 'remote group') for 2-4 days, for COVID-era "stay at home" projects. Paid in advance living allowance, no expense report required.]
- Did you know?
- Newest properties:
- General datatypes: SMARTS notation, tala, raga, recording location, law or regulation identifying number, earliest end date, latest start date, extinction date
- External identifiers: Filmstarts title ID, Trismegistos text ID, SPLC Individual ID, ERIC Thesaurus ID, American Folklore Society Ethnographic Thesaurus ID, BHF magazine ID, Macedonian cadastral municipality ID, Monumentbrowser ID, Frauen in Bewegung 1848–1938 ID, Nasjonalt skoleregister ID, BHF author ID, Proballers ID, Opera Online work ID, Opera Online composer ID, Opera Online opera house ID, CTHS publication ID
- New property proposals to review:
- General datatypes: Namesakes, Engineer's Line Reference, bibliography, external auditor, ITF-identificatiecode voor speler 2020, rotated image, mirrored image, combined from, overlaid from, number of deaths in senior care homes, number of cases intensive care, negotiated by, is solution to, Commons category for the exterior of the item, Number of taxpayers, medium
- External identifiers: Abbreviations related to ancient authors and works and to academic works regarding classical antiquity, YIVO Encyclopedia of Jews in Eastern Europe ID, Emporis company ID, Sports-Reference.com college basketball box score ID, FBref.com squad ID, Operator licence number, Jewish Virtual Library person ID, Spanish Artists from the Fourth to the Twentieth Century ID, JSTOR publisher ID, Istrapedia ID, monumenta.ch ID, Microsoft MVP profile ID, Art Bonus ID, The Living New Deal ID, TracesOfWar person ID, Hrvatski biografski leksikon ID, UG Digital Collections, jewishencyclopedia.com ID, Fancyclopedia 3 ID, Hrvatska tehnička enciklopedija ID, Grandterrier.net ID
- Query examples:
- British MPs with sons- or daughters-in-law who were also MPs (source)
- MPs with the largest number of children or childen-in-law who became MPs (source)
- Descendants of Robert Emett (born 1729), with counts of sitelinks and external IDs for them and their spouses (source)
- Programming languages written by women (Source)
- Bar chart showing the number of research output (articles, etc) annotated with a SARSCoV2 proteins as 'main subject' (Source)
- Female soccer players who have a (known) social media account (Source)
- Largest cities in France by population (Source)
- Commons queries:
- Newest WikiProjects: WikiProject Gazetteer
- Newest properties:
- Development
- Deployed the first version of the Wikidata Bridge to Catalan Wikipedia
- Creating Grafana Dashboards for the new Wikidata Bridge so we have some data to help us determine which datatypes to support next for example (phabricator:T260532)
- Finished working on ensuring Labels of Items in some unusual, often minority, languages are still available on Wikipedia and other clients (phabricator: T259340)
- Fixed error messages for API modules that will not work with the first version of Federated Properties (phabricator:T258558)
- Working on improving how ORES judges the quality of an Item to make it more accurate
- Started coding on Automated Configuration Discovery to make it easier for tool builders to make their tools work for other Wikibase instances as well
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!