Previous |  Up |  Next

Article

Title: Small Scale Retrodigitization (English)
Author: Doob, Michael
Language: English
Journal: Towards Digital Mathematics Library. Birmingham, United Kingdom, July 27th, 2008
Volume:
Issue: 2008
Year:
Pages: 103-113
.
Category: math
.
Summary: The digitization of papers born in the print-only era is vital for the health of the mathematical record. Many large scale retrodigitization projects are underway and, at this point, probably more that half of the mathematical history has been finished. Many smaller journals and books remain to be done. This paper gives a framework within which these may also be completed. It uses the digitization of the Canadian Journal of Mathematics (53,000 pages), completed as a one-man project over a few months, as the working example. The project described herein not only may be used as a model for similar efforts but also indicates some interesting problems yet to be solved. (English)
Keyword: home retrodigitization
Keyword: NUMDAM
MSC: 68P99
MSC: 68U10
MSC: 68U15
idZBL: Zbl 1170.68486
.
Date available: 2011-07-18T09:24:18Z
Last updated: 2012-08-27
Stable URL: http://hdl.handle.net/10338.dmlcz/702534
.
Reference: 1. : .The home web site for ArXiv is http://arxiv.org/ and is hosted by the Cornell University Library. The history of ArXiv is given in the article at http://en.wikipedia.org/wiki/ArXiv.
Reference: 2. : .http://www.ceic.math.ca/Publications/retro_bestpractices.pdf.
Reference: 3. Dennis, Keith: .has had some encouraging results using Perl scripts developed by his working group at Cornell. His software has only been circulated informally. Zbl 0527.16007
Reference: 4. Dennis, K., Michler, G. O., Schneider, G., Suzuki, M.: Automatic reference linking in distributed digital libraries., CVPRW 2003, Conference on Computer Vision and Pattern Recognition Workshop, paper #26, Volume 3 (Workshop on Document Image Analysis and Retrieval), 5 pp. (2003).
Reference: 5. Ewing, J.: Measuring Journals.. Notices of the AMS, 1049–1053, (2006). Zbl 1142.00304
Reference: 6. : .The project location is http://code.google.com/p/tesseract-ocr.
Reference: 7. : .Described at http://en.wikipedia.org/wiki/Tesseract_(software)and announced at http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-ocr.html.
Reference: 8. : .The main site is at http://www.imagemagick.org/script/index.php.
Reference: 9. : .A full description of this project is at http://minidml.mathdoc.fr/.
Reference: 10. : NUMDAM.
Reference: 11. : .See http://en.wikipedia.org/wiki/OCRopus.
Reference: 12. : .The home page for this software is http://www.pdfhacks.com/pdftk/.
Reference: 13. : .Documentation for the hyperref package can be found both at http://en.wikibooks.org/wiki/LaTeX/Packages/Hyperref and at http://www.tug.org/applications/hyperref/.
Reference: 14. : .http://www-sop.inria.fr/apics/tralics specifically translates LaTeX to XML.
Reference: 15. : .http://www.unicode.org/charts contains a list of the standard character names.
.

Files

Files Size Format View
DML_001-2008-1_13.pdf 861.2Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo