Wikisource Community User Group/November 2017 Hangout
November 2017 Wikisource Hangout
Google Hangouts: join meeting
Etherpad: open etherpad
To talk about various Wikisource things, such as:
- Wikisource and Wikidata (and some infos about the WikidataCon in Berlin last weekend)
- a second Wikisource conference in 2018 ?
- various issues or accomplishments you want to share or need help from others
- the Wikisource Community User Group
- Add thing here
Notes will be recorded on an etherpad.
If anyone has any trouble joining the Hangout, get on #wikisourceconnect on IRC and we'll try to sort it out.
Attendees
edit- Samwilson
- VIGNERON
- Anika
Notes
edit2016 WSUG report
editThe 2016 Report is done; let's not forget the 2017 one.
Connections with Wikidata
editMore connection, between Wikisources projects and Wikidata, is desirable.
Problem with the Authors, not always persons (institutions, organisation, etc.), linking problems. Maybe only a en.ws problem. No portals on de.ws (no Portal: namespace, nor Author: namespace ). Need to figure out the cross-Wikisource situation better.
For example s:Portal:United States is d:Q5365167 (correction: https://www.wikidata.org/w/index.php?title=Q5365167&diff=480636521&oldid=475102085 ) but works are instead authored on Wikdiata by d:Q30 (although there is also e.g. s:it:Autore:Stati Uniti d'America).
A lots of mistakes to fix on Wikidata... Data not as decided by the group. But the work/edition structure (via d:Property:P629) is agreed on.
New mediawiki extension called mw:Extension:Wikisource! Sam had the idea of a Lua module in there to make easier to pull data from Wikidata, so that all Wikisources use the same tools and don't have to make their own.
Working draft: s:Template:Author-list (using s:Module:Edition ) makes the jump from edition to work, via P629. Also has structured HTML, for easier scraping/re-use by 3rd parties.
d:Q42372837 (with P31 = d:Q42396623 digital representation) Problem to link to Commons and to scans. P953
Other question: P757 vs. P50... Authors vs. Contributors
- editor (P98 ?) – P98 is a person, never an organisation
- author (P50)
- illustrator (P110), photographer
- contributor (P767)
- author of foreword (P2679)
- possible creator P1179
- publisher/editor (organisation, group of people – P???)
- printer P872; sometimes the same as P123 publishing house, publishing company)
- (creator P170)
- P655 translator
Perhaps generalised as:
{{wikidata-list |property=P767 |separator=, |last_separator=, and |intro=Authors: |outtro=. }}
Authors:
<a href="/wiki/Author:Jane_Austen" title=""><span itemprop="author" itemscope="" itemtype="http://schema.org/Person"><span itemprop="name">Jane Austen</span></span></a> and
<a href="/wiki/Author:Jim_Austen" title=""><span itemprop="author" itemscope="" itemtype="http://schema.org/Person"><span itemprop="name">Jim Austen</span></span></a>.
2018 Conference
editwhat about a second WIkisource conference.
Could be good for visibility, in May with WikiCite ? (https://meta.wikimedia.org/wiki/WikiCite_2018 may 2018, probably too soon), we should invite people from library to have the others side.
Big issues:
- program !!! (work on this before the others)
- finding venu, accomodation/hotel
- visa...
Wishlist Survey
editIMPORTANT the 2017 Community Wishlist Survey starts very soon : 6th November
- Wikisourcerers should answer it!! the more the better !!
- Anika mentions ABBYY OCR and related problem like fraktur script
- Sam mentions the Google OCR, e.g.: https://tools.wmflabs.org/ws-google-ocr/index.php?image=https%3A%2F%2Fupload.wikimedia.org%2Fwikipedia%2Fcommons%2Fthumb%2F0%2F0a%2FGeorg_Muck_-_Geschichte_von_Kloster_Heilsbronn_%2528Band_2%2529.pdf%2Fpage453-575px-Georg_Muck_-_Geschichte_von_Kloster_Heilsbronn_%2528Band_2%2529.pdf.jpg&lang=de comparison with other OCR, is it better or worse? implemented in Wikisource : https://wikisource.org/wiki/Wikisource:Google_OCR (asked and used by indian langages, but can be used by any if required).
re Anika: While Muck is an example for better OCR: https://tools.wmflabs.org/ws-google-ocr/index.php?image=https%3A%2F%2Fupload.wikimedia.org%2Fwikipedia%2Fcommons%2Fthumb%2F0%2F0b%2FAlbum_der_S%25C3%25A4chsischen_Industrie_Band_1.pdf%2Fpage369-2083px-Album_der_S%25C3%25A4chsischen_Industrie_Band_1.pdf.jpg&lang=de is an example for worse... The line "Bon ber Eifenbahn fiülhrt ein eigenes Sbienengleis bis in ben Babrithof, auf weldem bie Giüter
birelt von ober ju ber gabrit geführt werben fönnen." shall be "Von der Einsenbahn führt ein eigenes Schienengleis bis in den Fabrikhof, auf welchem die Güter direkt von oder zu der Fabrik geführt werden können." The Titel "Dechanifche Baumwoll-Beberei von Bimmer
inann & Co. in tctschkan." is "Mechanische Baumwoll-Weberei von Zimmer-
mann & Co. in Netzschkau." (big Issue Scan Quality?)
Next hangout
editMail the list about a datetime for the next hangout.