Previous |  Up |  Next

Article

Title: Designing a Semantic Ground Truth for Mathematical Formulae (English)
Author: Sexton, Alan
Author: Sorge, Volker
Author: Suzuki, Masakazu
Language: English
Journal: Towards a Digital Mathematics Library. Paris, France, July 7-8th, 2010
Volume:
Issue: 2010
Year:
Pages: 37-42
.
Category: math
.
Summary: We report on a new project to design a semantic ground truth set for mathematical document analysis. The ground truth set will be generated by annotating recognised mathematical symbols with respect to both their global meaning in the context of the considered documents and their local function within the particular mathematical formula they occur. The aim of our work is to have a reliable database available for semantic classification during the formula recognition process with the aim of enabling correct interpretations of mathematical formulae and generating semantic markup such as Content MathML. (English)
Keyword: Content MathML
Keyword: OCR
MSC: 68-06
MSC: 68U10
MSC: 68U15
MSC: 68U99
.
Date available: 2011-07-18T09:43:54Z
Last updated: 2012-08-27
Stable URL: http://hdl.handle.net/10338.dmlcz/702571
.
Reference: 1. Aly, W., Uchida, S., Fujiyoshi, A., Suzuki, M.: Statistical classification of spatial relationships among mathematical symbols.In: Proceedings of ICDAR 2009, pages 1350–1354. IEEE Society Press, 2009.
Reference: 2. Baker, J., Sexton, A., Sorge, V.: A linear grammar approach to mathematical formula recognition from PDF.In: Proceedings of Intelligent Computer Mathematics, LNAI. Springer Verlag, Germany, 2009.
Reference: 3. Baker, J., Sexton, A., Sorge, V.: Faithful mathematical formula recognition from PDF documents.In: Proceedings of DAS 2010, 2010. Forthcoming.
Reference: 4. Buswell, S., Caprotti, O., Carlisle, D. P., Dewar, M. C., Gaëtano, M., Kohlhase, M.: The OpenMath Standard.The OpenMath Society, June 2004.
Reference: 5. Suzuki, M., Tamari, F., Fukuda, R., Uchida, S., Kanahori, T.: Infty—an integrated OCR system for mathematical documents.In: Proceedings of ACM Symposium on Document Engineering, pages 95–104. ACM Press, 2003.
Reference: 6. Suzuki, M., Uchida, S., Nomura, A.: A ground-truthed mathematical character and symbol image database.In: Proceedings of ICDAR 2005, pages 675–679. IEEE Society Press, 2005.
Reference: 7. The American Mathematical Society: 2000 Mathematics Subject Classification.2000. http://www.ams.org/msc/.
Reference: 8. Beusekom, J. van, Shafait, F., Breuel, T. M.: Automated OCR ground truth generation.In: Proceedings of DAS 2008, Sep 2008.
.

Files

Files Size Format View
DML_003-2010-1_7.pdf 214.4Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo