Research:Daily unique anonymous editors
- = 1 edits
SET @date = "20140101";
SET @n = 1;
SELECT
COUNT(*)
FROM (
SELECT
rev_user_text,
SUM(revisions) AS revisions
FROM (
SELECT
rev_user_text,
COUNT(*) AS revisions
FROM revision
WHERE
rev_timestamp BETWEEN @date AND
DATE_FORMAT(DATE_ADD(@date, INTERVAL 1 DAY), "%Y%m%d%H%i%S") AND
rev_user = 0
GROUP BY 1
UNION
SELECT
ar_user_text AS rev_user_text,
COUNT(*) AS revisions
FROM archive
WHERE
ar_timestamp BETWEEN @date AND
DATE_FORMAT(DATE_ADD(@date, INTERVAL 1 DAY), "%Y%m%d%H%i%S") AND
ar_user = 0
GROUP BY 1
) AS user_revisions
GROUP BY 1
) AS editors
WHERE revisions >= @n;
Daily unique anonymous editors is a standardized metric used to measure the number of logged-out editors who save edits to a wiki on a given day. It's used as a proxy for editing population size.
Discussion
editUsing IP as an identifier
editThe current metric depends on counting IP addresses within the specified period as a proxy for distinct anonymous editors. A unique IP address doesn't necessarily identify a unique user due to IP rotation, IP addresses shared among multiple editors, proxies etc.
Time lag
editAs this is a daily metric, a full 24 hours must elapse after the beginning of the date (UTC) in order to calculate an uncensored value.
Edits on deleted pages
editThis metric includes edits on existing pages as well as pages that have been or will later be deleted. This allows us to define a metric as stateless, in other words historical values will not change in the future depending on the status of a page (existing/deleted/moved) at the time the metric is computed. Deletion-related activity is tracked via a separate set of metrics.