Research:Wiki Participation Challenge

Contact
Diederik van Liere
This page documents a completed research project.


Key Personnel

edit
  • Diederik van Liere
  • Howie Fung

Project Summary

edit

This competition challenges data-mining experts to build a predictive model that predicts the number of edits an editor will make in the five months after the end date of the training dataset. The dataset is randomly sampled from the English Wikipedia dataset from the period January 2001 - August 2010.

The objective of this competition is to quantitively understand what factors determine editing behavior. We hope to be able to answer questions, using these predictive models, why people stop editing or increase their pace of editing.

Contestants are expected to build a predictive model that can be reused by the Wikimedia Foundation to forecast long term trends in the number of edits that we can expect.

Methods

edit

Participants are free to use any econometric / statistical method or machine learning approach.

Dissemination

edit

The explanations of the algorithms can be found here:

Wikimedia Policies, Ethics, and Human Subjects Protection

edit

Benefits for the Wikimedia community

edit

Output will consist of algorithms and source code that will predict future editing behaviour.

Time Line

edit

The competition ran from Tuesday 28 June 2011 until Tuesday 20 September 2011, for a total of 84 total days.

References

edit
edit

http://www.kaggle.com/c/wikichallenge

Contacts

edit