User:King ChristLike/Page Collection Report

Page Collection Report

edit

This tool helps you to generate a clear and comprehensive report on specific collections of Wikimedia pages. It compiles and displays data from chosen pages so that users can efficiently examine and interpret information.

Project status: actively maintained

Basic functionality

edit

Page Collection Report is intended for use by Wikimedia editors, researchers or anyalysts. It is meant to help combine information from several Wikimedia pages into an organized report and helps them ensure that important pages are not missing.

Page Collection Report uses a list of target pages to extract important information, and organizes it in a way that is easy to read. This makes it easier to analyze contents, revisions and essential metrics across several pages. For more details about the technical implementation, see the developer documentation.

What Page Collection Report does not do

edit

This tool cannot edit Wikimedia pages or automatically complete missing pages, it does not have real-time data updates and it cannot process pages with restrictive access like those requiring authentication unless proper credentials are provided. Also, it does not work as a search engine and cannot handle non-wikimedia content.

Prerequisites

edit

Before using this tool, you should be familiar with the basics of Wikimedia projects and have access to a web browser, using common text editors, spreadsheet software, and Mac or Windows operating system. Users should have basic understanding of page collections and have access to the tool’s web interface.

How to use Page Collection Report

edit

Access the tool

edit
  1. Navigate to the tool's interface.
  2. Enter the URL or titles of your target Wikimedia pages.
  3. Select filters or specific parameters of your choice to meet your need(s).

Generate report

edit
  1. Initiate the report generation process.
  2. After generating, you can view directly in your working browser or download for offline access.

Troubleshooting

edit

Report fails to generate

  • Ensure that the page titles/URLs entered are correct and that the pages exist on Wikimedia projects.

Incomplete data in report

  • Verify the content of the target pages for restrictions or limited information. Ensure pages do not block scraping.

How to get help and report issues

edit
  • Report issues at the project issue tracker on GitLab.
  • Ask questions or get help at the project's discussion page on GitLab. You can expect a response in 5 business days.

Developer documentation

edit

Technical implementation

edit

This tool uses the Wikimedia Action API to retrieve and analyze page collections. It depends on Wikimedia and its database because it uses structured queries and API calls for fetching, organizing, and reporting on page data. Organisation and structuring: We start by defining a clear and logical architecture for the documents. For example, we can classify them by type, by project, by département. Tools and technologies: We choose the right tools to store, manage, and distribute documents. This can be a simple shared folder, document management software (DMS), or a more complex system.

Organisation and structuring:

edit

We start by defining a clear and logical architecture for the documents. For example, we can classify them by type, by project, by departement. The right tools are chosen to store, manage, and distribute documents. This can be a simple shared folder, document management software (DMS), or a more complex system.

Code structure

edit

The articles module does text processing, word count, and formatting for articles, module does storage and retrieval. The HTML, CSS, and JavaScript files for the user interface directory contains node.js Api endpoint for handling request.

Local development

edit

Set up

edit

How to set up development environment:

  1. Clone the repository.
  2. Navigate to the project directory
  3. Install dependencies.

Install

edit

How to install:

  1. Create the virtual environment.
  2. Activate the virtual environment.
  3. Install the requirements.

Configure

edit

Modify the config.ini file to set any necessary configurations, such as API endpoints or authentication tokens.

Build and test

edit

How to build and run locally:

   python app.py

How to run tests:

   pytest tests/

Debugging

edit
  • Error: API Limit Exceeded
    • Reduce the number of pages per report or try to limit rate.
  • Error: Invalid Page Title
    • Verify the accuracy of the entered page titles.

How to contribute

edit

The page collection report maintainers welcome contributions, either bug reports or feature requests.

Contribution process

edit

Before contributing, read the Code of Conduct that outlines community guidelines and expectations.

  • Fork the repository on GitLab.
  • Create a feature branch
   git checkout -b feature-name
  • Commit changes
   git commit -m "Description of changes"
  • Push branch
   git push origin feature-name
  • Submit merge request

Credits

edit

Developer: Santhosh Thottingal

License

edit

This project is licensed under the MIT License. See the license file for details.