Previous |  Up |  Next


Title: Data Enhancements in a Digital Mathematical Library (English)
Author: Růžička, Michal
Author: Sojka, Petr
Language: English
Journal: Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010
Issue: 2010
Pages: 69-76
Category: math
Summary: The quality of digital mathematical library depends on the formats and quality of data it offers. We show several enhancements of (meta)data of the Czech Digital Mathematics Library DML-CZ. We discuss possible minimalist modification of regular LaTeX documents that would simplify generating basic metadata that describes the article in an XML/MathML format. We also show a proof of concept of a method that enables us to include LaTeX source code of mathematical expressions into pdfTeX-generated PDFs in such a way that the reader can Copy & Paste the code from his PDF viewer. This code, hidden in the PDF file, can also be used for LaTeX math indexing. (English)
Keyword: DML-CZ
Keyword: metadata generation
Keyword: XML
Keyword: MathML
Keyword: PDF
Keyword: copy-math
Keyword: metadata generation
Keyword: Tralics
MSC: 68-06
MSC: 68U10
MSC: 68U15
MSC: 68U99
Date available: 2011-07-18T09:47:06Z
Last updated: 2012-08-27
Stable URL:
Reference: 1. : Archivum Mathematicum.[online],, Masaryk University, Brno, Czech Republic. Last modified December 18, 2009. [cit. 2010-04-25].
Reference: 2. : Centre de diffusion de revues académiques mathématiques.[online],, [Center for diffusion of mathematic journals]. [cit. 2008-05-25].
Reference: 3. : Czech Digital Mathematics Library.[online],, [cit. 2010-04-24]. Zbl 1170.68487
Reference: 4. : EuDML: The European Digital Mathematics Library.[online],, This page was last modified on 20 January 2010, at 08:09. [cit. 2010-04-25].
Reference: 5. Hatlapatka, R., Sojka, P.: PDF Enhancements Tools for a Digital Library.In: Sojka, P. (ed.) Proceedings of DML 2010, pp. 69–76. Masaryk University Press, Paris, France (Jul 2010).
Reference: 6. : Infty Project: Research Project on Mathematical Information Processing.[online],, [cit. 2010-06-02].
Reference: 7. : Tralics: a LaTeX to XML translator., Last modified $Date: 2009/11/24 17:17:03 $ [cit. 2010-04-24].
Reference: 8. Bouche, T.: A pdfLaTeX-based automated journal production system.TUGboat 27(1), 45–50 (2006), In Proceedings of EuroTeX 2006.
Reference: 9. Grimm, J.: Tralics, a LaTeX to XML Translator.TUGboat 24(3), 377–388 (2003), In Proceedings of EuroTeX.
Reference: 10. Růžička, M.: Automated Processing of TeX-Typeset Articles for a Digital Library.In: Sojka, P. (ed.) DML 2008 – Towards Digital Mathematics Library. pp. 167–176 (2008), Birmingham, UK, July 27th, 2008.
Reference: 11. Suzuki, M., Kanahori, T., Ohtake, N., Yamaguchi, K.: An Integrated OCR Software for mathematical Documents and Its Output with Accessibility.In: Computers Helping people with Special Needs. Lecture Notes in Computer Sciences, vol. 3119, pp. 648–655. Springer (2004), 9th International Conference ICCHP 2004, Paris, July 2004.


Files Size Format View
DML_003-2010-1_11.pdf 336.7Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo