Grants:IEG/Public Domain Textbook Import

status: withdrawn

Individual Engagement Grants
Individual Engagement Grants
Review grant submissions
review
grant submissions
Visit IdeaLab submissions
visit
IdeaLab submissions
eligibility and selection criteria

project:

Public Domain Textbook Import


project contact:

taosubmarines(_AT_)mail.ru

participants:


grantees: User:Herper.gr


summary:

Working on https://en.wikibooks.org/w/index.php?title=Snakes_of_Europe, this makes me wonder how to work with a single scanned pdf page, how to import the OCR text from archive.org, extract plates, tables or diagrams as an image, and ensure approximate layout is the same on a heavily modified pdf/odt output (printed ebook)





2014 round 1

Project idea

edit

What is the problem you're trying to solve?

edit

The book, The Snakes Of Europe is a 100 year old public domain text. There is no modern day comparable open access/source field-guide/reference/text-book, that has photos to assist species identification of the subject matter, and information. My idea is to enable easy cross-referencing of external scans (pdf/image/...) (ie from archive.org) and of external OCR sources, and to look at ways to improve import of OCR (https://en.wikibooks.org/wiki/Snakes_of_Europe/Definition_and_Classification has various tables incorporated that have not been pasted legibly, whereas https://en.wikibooks.org/wiki/Snakes_of_Europe/Habits is a fairly simple copy and paste, fully legible, even without correct formatting).

What is your solution?

edit

I would like to develop an extension for firefox and chrome to have external pages (a pdf page from archive.org) on a top side panel and be able to work with OCR text on a lower side panel, so as to be able to import (copy) a piece of the pdf with say an image or a table of interest. With this concept I would be able to paste selected tables to the 'Definition_and_Classification' article above, and have them appear as images. I could also copy in image plates, and taxobox type features, of individual species easily. In this extension i could also look at incorporating addtional utilities such as GOCR, to work in only this extension.

Project goals

edit


Ready to create the rest of your proposal?
Use the button below just once to create the remaining sections you'll need!


Part 2: The Project Plan

edit

Project plan

edit

Scope

edit

Activities

edit

Budget

edit

Total amount requested

edit

Budget breakdown

edit

Intended impact

edit

Target audience

edit

Community engagement

edit

Fit with strategy

edit

Sustainability

edit

Measures of success

edit

Need target-setting tips?

Participant(s)

edit

Discussion

edit

Community Notification

edit

Please paste a link below to where the relevant communities have been notified of this proposal, and to any other relevant community discussions. Need notification tips?

Endorsements

edit

Do you think this project should be selected for an Individual Engagement Grant? Please add your name and rationale for endorsing this project in the list below. Other feedback, questions or concerns from community members are also highly valued, but please post them on the talk page of this proposal.

  • Community member: add your name and rationale here.