Previous |  Up |  Next

Article

Keywords:
University of Western Ontario; XML
Summary:
Publishing in Mathematics and theoretical areas in Computer Science and Physics has been predominantly using TeX/LaTeX as a formatting language in the last two decades. This large corpus of born-digital material is both a boon — LaTeX is semi-semantic format where the source often contains indications of the author’s intentions — and a problem — TeX is Turing-complete and authors use this freedom to use thousands of styles and millions of user macros. Several tools have been developed to convert TeX/LaTeX documents to XML-based — i.e. Web and DML-compatible formats. Different DML Projects use different tools, and the selection seems largely accidental. To put the choice of converters for DML projects onto a more solid footing and to encourage competition and feature convergence we survey the market. In this paper we investigate and compare five LaTeX-to-XML transformers in three dimensions: $a$) ergonomic factors like documentation, ease of installation, $b$) coverage, and $c$) quality of the resulting documents (in particular the MathML parts).
References:
ABC+03. Ausbrooks, Ron: Mathematical Markup Language (MathML) version 2.0 (second edition). W3C recommendation, World Wide Web Consortium, 2003.
Ang09a. Anghelache, Romeo: Hermes discontinued. project page at http://humanist.roua.org/2009/01/01/hermes-paused/, seen May 2009.
Ang09b. Anghelache, Romeo: Hermes website. project page at http://hermes.roua.org/, seen May 2009.
arX. arxmliv build system. http://arxmliv.kwarc.info
ArX07. arXiv.org e-Print archive. seen December2007. web page at http://www.arxiv.org
Bou08. Thierry, Bouche: Cedrics: When CEDRAM meets Tralics. In Sojka, Petr, editor, Towards Digital Mathematics Library, Proceedings of the DML 2008 workshop, pages 153–165. Masaryk University, Brno, 2008.
CeC09. Cecill license. http://www.cecill.info/, seen May 2009.
DLM09. Digital library of mathematical functions. project page at http://dlmf.nist.gov/, seen May 2009. Zbl 1130.65045
Gri03. Grimm, Jose: Tralics, a latex to xml translator. 2003.
KŞ06. Kohlhase, Michael, Şucan, Ioan: A search engine for mathematical formulae. In Ida, Tetsuo, Calmet, Jacques, and Wang, Dongming, editors, Proceedings of Artificial Intelligence and Symbolic Computation, AISC’2006, number 4120 in LNAI, pages 241–253. Springer Verlag, 2006. Zbl 1156.68306
Mil09. Miller, Bruce: LaTeXML website. http://dlmf.nist.gov/LaTeXML/, seen May 2009.
MM06. Munavalli, Rajesh, Miner, Robert: Mathfind: a math-aware search engine. In SIGIR ’06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 735–735, New York, NY, USA, 2006. ACM Press.
PH. Plaice and Yannis Haralambous: Omega website.
Sci09. EDP Sciences: lxir website. http://www.lxir-latex.org/, seen May 2009.
SGD+09. Stamerjohanns, Heinrich, Ginev, Deyan, David, Catalin, Misev, Dimitar, Zamdzhiev, Vladimir, Kohlhase, Michael: A comparison study of mathml-aware LaTeX converters. Kwarc report, Jacobs University Bremen, 2009.
SK08. Stamerjohanns, Heinrich, Kohlhase, Michael: Transforming the ar$\chi $iv to XML. In Autexier, Serge et al., editors, Intelligent Computer Mathematics, 9th International Conference, MKM 2008 Birmingham, UK, July 28 – August 1, 2008, Proceedings, number 5144 in LNAI, pages 574–582. Springer Verlag, 2008. Zbl 1166.68364
Tex09. TeX4HT website. http://www.cse.ohio-state.edu/~gurari/TeX4ht/, seen May 2009.
Tra09. Tralics website. http://www-sop.inria.fr/miaou/tralics/, seen May 2009.
TtM09. TtM website. project page at http://hutchinson.belmont.ma.us/tth/mml/, seen May 2009.
Val09. Validator website. http://homepage.mac.com/rcrews/software/validator/, seen May 2009.
Wat09. Watt, Stephen: MathML at ORCCA. project page at http://www.orcca.on.ca/MathML/, seen May 2009.
WG09. W3C Math WG: MathML software – converters. http://www.w3.org/Math/Software/mathml_software_cat_converters.html, seen May 2009.
Partner of
EuDML logo