主題標籤
主題標籤是一種用於監控和搜索維基媒體編輯摘要中主題標籤使用情況的工具。該工具會監控所有的維基媒體项目(維基數據除外,詳見下文)的編輯,並允許用户搜索它們。其最初創建於2016年,但於2018年重寫。
- 项目聯絡人:Samwalton9 (WMF)
- 源代碼:Github
- Phabricator:主題標籤
用法
該工具目前托管在雲端虛擬專用伺服器上,实例上,网址是https://hashtags.wmflabs.org/。右側的「趨勢標籤」則羅列了過去30天內最常用的主題標籤。
To search for a specific hashtag, simply enter it in the Hashtag field and click Submit. Results can be further limited by Project (e.g. `fr.wikisource.org`) and within a date range with one or both of a start and end date. To search for multiple hashtags within the same search, simply separate them with commas, e.g. 1lib1ref, 1bib1ref
. Edits with multiple searched hashtags in the same edit summary are only returned once.
URLs take the form https://hashtags.wmflabs.org/?query=<hashtag>&project=<project>&startdate=<YYYY-MM-DD>&enddate=<YYYY-MM-DD>.
Any parameter except query can be omitted.
To get into more details for a particular search, click on the Show statistics
button. Currently, three graphs will be displayed – Top projects, Top users and Edits over time. The Top projects and Top users graphs show the top 10 Wikimedia projects and top 10 users respectively, both sorted by decreasing number of edits. To view the full list of projects and users click on View full stats
button. The Edits over time graph shows edits over days if the range of search results is less than 90 days, edits over months if the range is more than 90 days but less than 3 years, and edits per year in other cases. You can also download each of these statistics as a CSV file by clicking the Download CSV
button.
Until 2018 the tool was hosted at https://tools.wmflabs.org/hashtags. Due to high resource usage and a very large database, the tool was rewritten and moved to a dedicated Cloud VPS instance. Most URLs from the old tool now redirect to the new one, including any search queries.
主題標籤
主題標籤工具通過最近更改事件流,來監控除「維基數據」之外的所有維基媒體項目的主題標籤使用情況。其是通過使用正則表達式(?:^|\s)[##]{1}(\w+)
來尋找匹配主題標籤的編輯,而該正則表達式所能匹配的不僅僅只有全數字的主題標籤。
Wikidata is currently excluded from data collection due to the large volume of data it would generate. As an example, nearly 7 million monthly edits are tagged with #quickstatements there. See T207029 for details.
Bot edits are also excluded from data collection due to high edit rates. Individual bot edits can easily be tracked by looking at the contribution history of those individual bot accounts.
From 2016 to 2018 data was collected from various language Wikipedia projects, as each was requested by editors. Therefore data earlier than August 2018 does not include other Wikimedia projects or many Wikipedia languages. There is a gap in the data from August 8th to early September 2018 as a result of the time between the old tool being taken down and the new one starting up.
Due to technical issues there is also a gap from September 30th to November 13th 2021.
貢獻
歡迎各位對「主題標籤」工具的研發做出貢獻。源代碼可於Github上找到,公開任務列在Phabricator上。該工具通過Docker软件貨櫃中的Django上運行。有關本地設置的說明,請參閱Github的自述文件(即「README」)。
See also
- User:Shizhao/hashtagslink.js, a user script for linking to the tool from edit summary hashtags