Previous |  Up |  Next

Article

Title: An Online Repository of Mathematical Samples (English)
Author: Baker, Josef B.
Author: Sexton, Alan P.
Author: Sorge, Volker
Language: English
Journal: Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009
Volume:
Issue: 2009
Year:
Pages: 49-57
.
Category: math
.
Summary: With a growing community of researchers working on the recognition, parsing and digital exploitation of mathematical formulae, a need has arisen for a set of samples or benchmarks which can be used to compare, evaluate and help to develop different implementations and algorithms. The benchmark set would have to cover a wide range of mathematics, contain enough information to be able to search for specific samples and be accessible to the whole community. In this paper, we propose an on-line system and repository where researchers may upload samples of mathematics in various formats such as scanned images, images directly rendered from born-digital documents, or born-digital document extracts. The system will support community tagging of these samples with attributes about their syntactic structure, semantic origin, image quality and source. Each sample in the database may then be searched for by any of its associated attributes, and users could download sets of sorted or random formulae to meet their own requirements. Associated with the system will be freely downloadable tools to assist in extracting and clipping mathematical samples from various kinds of documents to prepare them for uploading. Additionally, the system will allow users to annotate each sample with their own files, in LaTeX, MathML, OpenMath and other formats. The intention here is that these annotation files will correspond either to the recognition results of the users’ own systems on the samples, or manually constructed results. We believe that this facility will help to build a community verified ground truth set, available to anyone accessing the system. (English)
Keyword: MathML
Keyword: PDF
MSC: 68U10
MSC: 68U15
idZBL: Zbl 1176.68080
.
Date available: 2011-07-18T09:33:47Z
Last updated: 2012-08-27
Stable URL: http://hdl.handle.net/10338.dmlcz/702559
.
Reference: 1. Hoos, H.H., Stutzle, T.: SATLIB: An online resource for research on SAT.. In: Proceedings of the Third Workshop on Satisfiability (SAT 2000), IOS Press (2000) 283–292 http://www.satlib.org.
Reference: 2. Sutcliffe, G., Suttner, C.: The TPTP Problem Library: CNF Release v1.2.1.. Journal of Automated Reasoning 21(2) (1998) 177–203 Zbl 0910.68197, MR 1646570
Reference: 3. Suzuki, M., Uchida, S., Nomura, A.: A ground-truthed mathematical character and symbol image database.. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), IEEE Society Press (2005) 675–679 http://www.inftyproject.org/en/database.html.
Reference: 4. W3C: Ink markup language (InkML).. (2006) http://www.w3.org/TR/InkML/.
Reference: 5. Crockford, D.: JavaScript Object Notation.. (2006) http://www.json.org/.
Reference: 6. The American Mathematical Society: 2000 Mathematics Subject Classification.(2000) http://www.ams.org/msc/.
Reference: 7. Sternberg, S.: Semi-riemann geometry and general relativity.(2003) http://www.math.harvard.edu/~shlomo/docs/semi_riemannian_geometry.pdf.
Reference: 8. Judson, T.: Abstract algebra — theory and applications.(2009) http://abstract.ups.edu/download.html.
.

Files

Files Size Format View
DML_002-2009-1_7.pdf 299.6Kb application/pdf View/Open
Back to standard record
Partner of
EuDML logo