Previous |  Up |  Next

Article

Title: Web Interface and Collection for Mathematical Retrieval : WebMIaS and MREC (English)
Author: Líška, Martin
Author: Sojka, Petr
Author: Růžička, Michal
Author: Mravec, Petr
Language: English
Journal: Towards a Digital Mathematics Library. Bertinoro, Italy, July 20-21st, 2011
Volume:
Issue: 2011
Year:
Pages: 77-84
.
Category: math
.
Summary: We demonstrate searching of mathematical expressions in technical digital libraries on a MREC collection of 439,423 real scientific documents with more than 158 million mathematical formulae. Our solution—the WebMIaS system—allows the retrieval of mathematical expressions written in TeX or MathML. TeX queries are converted on-the-fly into tree representations of Presentation MathML, which is used for indexing. WebMIaS allows complex queries composed of plain text and mathematical formulae, using MIaS (Math Indexer and Searcher), a math aware search engine based on the state-of-the-art system Lucene. MIaS implements proximity math indexing with a subformulae similarity search. (English)
Keyword: math indexing and retrieval
Keyword: mathematical digital libraries
Keyword: information systems
Keyword: information retrieval
Keyword: mathematical content search
Keyword: document ranking of mathematical papers
Keyword: math text mining
Keyword: WebMIaS
Keyword: MIaS
Keyword: Tralics
Keyword: TEX
Keyword: UMCL
Keyword: Lucene
MSC: 68-06
MSC: 68U10
MSC: 68U15
MSC: 68U99
.
Date available: 2011-07-15T09:29:24Z
Last updated: 2012-08-27
Stable URL: http://hdl.handle.net/10338.dmlcz/702604
.
Reference: 1. Archambault, D., Moço, V.: Canonical MathML to Simplify Conversion of MathML to Braille Mathematical Notations.In: Miesenberger, K., Klaus, J., Zagler, W., Karshmer, A. (eds.) Computers Helping People with Special Needs, Lecture Notes in Computer Science, vol. 4061, pp. 1191–1198. Springer Berlin / Heidelberg (2006), http://dx.doi.org/10.1007/11788713_172
Reference: 2. Grimm, J.: Producing MathML with Tralics.In: Sojka, P. (ed.): Towards a Digital Mathematics Library. Masaryk University, Paris, France (Jul 2010), http://www.fi.muni.cz/~sojka/dml-2010-program.html, pp. 105–117, http://dml.cz/dmlcz/702579
Reference: 3. Kováčik, O., Rákosník, J.: On spaces $L^{p(x)}$ and $W^{k,p(x)}$.Czechoslovak Mathematical Journal 41, 592–618 (1991), http://dml.cz/dmlcz/102493 MR 1134951
Reference: 4. : MREC—Mathematical REtrieval Collection.http://nlp.fi.muni.cz/projekty/eudml/MREC/index.html
Reference: 5. Sojka, P.: Towards a Digital Mathematics Library.Masaryk University, Paris, France (Jul 2010), http://www.fi.muni.cz/~sojka/dml-2010-program.html
Reference: 6. Sojka, P., Líška, M.: Indexing and Searching Mathematics in Digital Libraries – Architecture, Design and Scalability Issues.In: Davenport, J.H., Farmer, W., Rabe, F., Urban, J. (eds.) Proceedings of CICM Conference 2011 (Calculemus/MKM). Lecture Notes in Artificial Intelligence, LNAI, vol. 6824, pp. 228–243. Springer-Verlag, Berlin, Germany (Jul 2011)
Reference: 7. Stamerjohanns, H., Ginev, D., David, C., Misev, D., Zamdzhiev, V., Kohlhase, M.: MathML-aware Article Conversion from LaTeX.In: Sojka, P. (ed.) Proceedings of DML 2009. pp. 109–120. Masaryk University, Grand Bend, Ontario, CA (Jul 2009), http://dml.cz/dmlcz/702561
Reference: 8. Stamerjohanns, H., Kohlhase, M., Ginev, D., David, C., Miller, B.: Transforming Large Collections of Scientific Publications to XML.Mathematics in Computer Science 3, 299–307 (2010), http://dx.doi.org/10.1007/s11786-010-0024-7 Zbl 1205.68490
Reference: 9. Sylwestrzak, W., Borbinha, J., Bouche, T., Nowiński, A., Sojka, P.: EuDML—Towards the European Digital Mathematics Library.In: Sojka, P. (ed.): Towards a Digital Mathematics Library. Masaryk University, Paris, France (Jul 2010), http://www.fi.muni.cz/~sojka/dml-2010-program.html, pp. 11–24, http://dml.cz/ dmlcz/702569
.

Files

Files Size Format View
DML_004-2011-1_11.pdf 449.3Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo