Previous |  Up |  Next

Article

Keywords:
question answering; natural language processing
Summary:
This paper describes UIO, a multi–domain question–answering system for the Czech language that looks for answers on the web. UIO exploits two fields, namely natural language interface to databases and question answering. In its current version, UIO can be used for asking questions about train and coach timetables, cinema and theatre performances, about currency exchange rates, name–days and on the Diderot Encyclopaedia. Much effort have been made into making addition of a new domain very easy. No limits concerning words or the form of a question need to be set in UIO. Users can ask syntactically correct as well as incorrect questions, or use keywords. A Czech morphological analyser and a bottom-up chart parser are employed for analysis of the question. The database of multiword expressions is automatically updated when a new item has been found on the web. For all domains UIO has an accuracy rate about 80
References:
[1] Appelt D. E., Israel D. J.: Introduction to information extraction technology. In: Proc. 16th Internat. Joint Conference on Artificial Intelligence (IJCAI-99) Tutorial, Stockholm 1999
[2] Aretoulaki M., Gallwitz F., Harbeck S., Ipšič I., Ivanecký J., Matoušek V., Niemann H., Nöth, E., Pavešič N.: Sqel: A multilingual and multifunctional dialogue system. In: Proc. 5th Internat. Conference on Spoken Language Processing (ICSLP ’98), Sydney 1998, pp. 2883–2996
[3] Bauer D., Segond, F., Zaenen A.: Enriching an sgml-tagged Dictionary for Machine-aided Comprehension. Technical Report No. MLTT-011, Rank Xerox Research Centre 1994
[4] Bried E., Segond, F., Valetto G.: Formal description of multiword lexemes with the finite-state formalism idarex. In: Proc. 16th Internat. Conference on Computational Linguistic. Morgan Kaufmann, San Francisco, CA 1996
[5] Buchholz S., Daelemans W.: Complex answers: a case study using a www question answering system. Natur. Language Engrg. 7 (2001), 4, 301–323
[6] Clarke C., Cormack G., Kisman, D., Lynam T.: Question answering by passage selection (multitext experiments for trec-9). In: Proc. Ninth Text Retrieval Conference (TREC-9), NIST Special Publication 2000, p. 673
[7] Dufour N.: A database for computerized multi-word unit recognition. In: Proc. ISP-3, Stuttgart 1998
[8] Hajič J.: Kodas – a simple method of natural language interface to a database. Explizite Beschreibung der Sprache und automatiche Textbearbeitung 6 (1984), Charles University, Prague
[9] Hajič J.: Nalcom: A multilevel nl-interface. Explizite Beschreibung der Sprache und automatiche Textbearbeitung 15 (1988), Charles University, Prague
[10] Hajičová E., Borota J., Hajič J., Hnáková M., Kuboň V., Oliva, K., Panevová J.: Text-and-inference based approach to question answering. Theoret. Comput. Linguistic 3 (1995)
[11] Hirschman L., Gaizauskas R.: Natural language question answering: The view from here. Natur. Language Engrg. 7 (2001), 275–300
[12] Jirků P., Hajič J.: Inferencing and search for an answer in tibaq. In: Proc. Ninth Internat. Conference on Computational Linguistics (E. Hajičová, ed.), Charles University, Prague 1982
[13] Katz B.: From sentence processing to information access on the world wide web. In: Proc. AAAI Spring Symposium on Natural Language, Processing for the World Wide Web, Stanford Univesity, Stanford 1997
[14] Klaas S.: Parsing Schemata: A Framework for Specification and Analysis of Parsing Algorithm. Springer–Verlag, Berlin 1996
[15] Matoušek V.: Simplified processing of elliptic and anaphoric utterances in a train timetable information retrieval dialogue system. In: Proc. Third Internat. Conference TSD 2000 (P. Sojka, I. Kopeček, and K. Pala, eds., Lecture Notes in Computer Science 1902), Springer–Verlag, Berlin 2001, p. 0399
[16] Maynard D., Cunningham H., Bontcheva, K., Dimitrov M.: Adapting a robust multi-genre NE system for automatic content extraction. In: 10th Internat. Conference, AIMSA 2002, Varna (Lecture Notes in Artificial Intelligence 2443), Springer–Verlag, Berlin 2002, pp. 264–273 Zbl 1020.68801
[17] Milward D., Thomas J.: From information retrieval to information extraction. In: ACL 2000 Workshop on Recent Advances in Natural Language Processing and Information Retrieval 2000
[18] Moldavan D., Harabagiu S., Pasca M., Mihalcea R., Goodrum R., Girji, R., Rus V.: Lasso: A tool for surfing the answer net. In: Proc. Eight Text Retrieval Conference (TREC-8), NIST Special Publication 1999
[19] Mouček R., Taušer K.: Dialogue system for city for city information centre. In: Proc. 6th World MultConference on Systemics, Cybernetics and Informatics SCI 2002, Orlando 2001, pp. 536–567
[20] Prager J., Browni E.: One search engine or two for question-answering. In: Proc. Ninth Text Retrieval Conference (TREC-9), NIST Special Publication 2000, p. 235
[21] Sgall P.: Natural language understanding and the perspectives of question answeing. In: Proc. Ninth Internat. Conference on Computational Linguistics (E. Hajičová, ed.), Charles University, Prague 1982
[22] Scott S., Gaizauskas R.: University of sheffield trec-9 q&a system. In: Proc. Ninth Text Retrieval Conference (TREC-9), NIST Special Publication 2000
[23] Sedláček R., Smrž P.: A new Czech morphological analyser ajka. In: Proc. Fourth Internat. Conference TSD 2001 (P. Sojka, I. Kopeček, and K. Pala, eds., Lecture Notes in Computer Science 2166), Springer–Verlag, Berlin 2001, pp. 100–107 Zbl 1009.68670
[24] Sriharii R., Li W.: Information extraction supported question answering. In: Proc. Eight Text Retrieval Conference (TREC-8), NIST Special Publication 1999
[25] Svoboda L.: UIO, a dialog system for question answering. In: Proc. Znalosti 2003 Workshop (V. Svátek, ed.), 2003
[26] Tomita M.: Efficient Parsing for Natural Language. Kluwer, Dordrecht 1986
[27] Zhang D., Lee W. S.: A web-based question answering system. In: Proc. SMA Annual Symposium 2003, NUS, Singapore 2003
[28] Žačková E.: Partial Parsing for Czech. Ph.D. Thesis, Masaryk University, 2002
[29] Žáčková E., Nepil, M., Popelínský L.: Automatic tagging of compound verb groups in Czech corpora. In: Text, Speech and Dialogue: Proc. TSD’2000 Workshop (P. Sojka, I. Kopeček, and K. Pala, eds., Lecture Notes in Computer Science 1902), Springer–Verlag, Berlin 2000, p. 0115
Partner of
EuDML logo