Design of natural-language interfaces for reference systems
Abstract
The description of the technology of modern natural language interfaces for intelligent systems and language interfaces for question-answering intelligent systems is presented, as well as methods and principles for their design. Analysis of the intelligent systems with natural language interfaces used in different areas are given. These areas are medicine, smart home technology, education, industry, fast adaptation to new technologies. The list of the most popular services with the natural language interfaces is presented. Each service can be used as a ready-to-use personal assistant or as a core for the development of a new customized natural language interface. The research of the natural language interfaces was conducted from the point of view of the natural language usage for the interaction between a user and the machine. The main problems here are the bias in natural language and the difficulties in the design of natural language interfaces that meet user expectations. The main principles of modeling of natural language interfaces are considered. As an intelligent system the interface consists of the database, knowledge machine and user interface. Speech recognition and speech synthesis components make natural language interfaces more convenient from the point of view of usability.
About the Authors
Yu. S. HetsevichBelarus
Cand. Sci. (Eng.), Head of the Speech Recognition and Synthesis Laboratory
V. A. Zhitko
Belarus
Cand. Sci. (Eng.)
S. A. Hetsevich
Belarus
Master of Science, Junior Researcher
L. I. Kaigorodova
Belarus
Master of Science, Postgraduate Student, Junior Researcher
K. A. Nikalaenka
Belarus
Postgraduate Student
References
1. Feng C., Valaee S., Wain Sy Au A., Reyes S., Sorour S., …, Eizenman M. Anonymous indoor navigation system on handheld mobile devices for visually impaired. International Journal of Wireless Information Networks, 2012, vol. 19, iss. 4, pp. 352−367.
2. Atrash A., Kaplow R., Villemure J., West R., Yamani H., Pineau J. Development and validation of a robust speech interface for improved human-robot interaction. International Journal of Social Robotics , 2009, no. 1, pp. 345−356.
3. Steedman M., Baldridge J. Combinatory categorial grammar. Blackwell Sci, Oxford. Available at: ftp://ftp.cogsci.ed.ac.uk/pub/steedman/ccg/manifesto.pdf (accessed 22.10.2012).
4. Sondik E., Smallwood R. The optimal control of partially observable Markov processes over a Finite Horizon. Operations Research, 1973, vol. 21, no. 5, pp. 1071−1088.
5. Pires G., Araújo R., Nunes U., Almeida A. A powered wheelchair using a behaviour-based navigation. 5th International Workshop on Advanced Motion Control (AMC'98-Coimbra), Coimbra, Portugal, 29 June – 1 July 1998. Coimbra, 1998, pp. 536−541.
6. Vacher M., Fleury A., Portet F., Serignat J. F., Noury N. Complete sound and speech recognition system for health smart homes: application to the recognition of activities of daily living. New Developments in Biomedical Engineering. D. Campolo (ed.), 2010, pp. 645−673.
7. Rougui J., Istrate D., Souidene W. Audio sound event identification for distress situations and context awareness. Proceedings of the 31 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC’09), Minneapolis, USA, 2−6 September 2009. Minneapolis, 2009, pp. 3501−3504.
8. Lines L., Hone K. S. Multiple voices, multiple choices: older adults’ evaluation of speech output to support independent living. Gerontechnol, 2006, vol. 5(2), pp. 78−91.
9. Godde F., Moller S., Engelbrecht K. P., Kuhnel C., Schleicher R., …, Wolters M. Study of a speech-based smart home system with older user. Proceedings of the First International Workshop Intelligent User Interfaces for Ambient Assisted Living (IUI4AAL’2008), Canary Islands, Spain, 1 January 2008. Canary Islands, 2008, pp. 17–22.
10. Hamill M., Young V., Boger J., Mihailidis A. Development of an automated speech recognition interface for personal emergency response systems. Journal of NeuroEngineering and Rehabilitation, 2009, vol. 6, iss. 26. Available at: http://www.jneuroengrehab.com/content/6/1/26 (accessed 15.11.2012).
11. Portet F., Vacher M., Golanski C., Roux C., Meillon B. Design and evaluation of a smart home voice interface for the elderly: acceptability and objection aspects. Personal and Ubiquitous Computing, 2011, vol. 17, iss. 1, pp. 127−144.
12. Huang W. C., Chang T. L. , Lin H. P. An intelligent multimedia e-learning system for pronunciations. Lecture Notes in Computer Science, 2007, vol. 4570, pp. 84–93.
13. Kacalak W., Majewski M. Automatic recognition and verification of voice commands in natural language given by the operator of the technological device using artificial neural networks. Computer Recognition Systems: Proceedings of the 4th International Conference on Computer Recognition Systems (CORES ’05), Koszalin, Poland. Koszalin, 2005, part V, pp. 689−696.
14. Renevier P., Nigay L., Bouchet J., Pasqualetti L. Generic interaction techniques for mobile collaborative mixed systems. Computer-Aided Design of User Interfaces IV. Holland, Kluwer Academic Publishers, 2005, pp. 309−322.
15. Aron J. Your iPhone is listening. Siri’s ability to make sense of ordinary language sets it apart from the herd. New Scientist, 2011, no. 2836, pp 24.
16. Schalkwyk J., Beeferman D., Beaufays F., Byrne B., Chelba C., …, Strope B. Google search by voice: a case study. Visions of Speech: Exploring New Voice Apps in Mobile Environments, Call Centers and Clinics. California, USA, Google, Inc., 2010. pp. 1−35.
17. Gales M. J. F. Semi-tied full-covariance matrices for hidden Markov models. IEEE Transactions on Speech and Audio Processing, 1997, vol. 7, iss. 3, pp. 272−281.
18. Sulejmanov D. Sh. Dvuhurovnevyj lingvisticheskij processor otvetnyh tekstov na estestvennom yazyke [Two-level linguistic processor for natural language answering texts]. Open Semantic Technologies for Intelligent Systems (OSTIS–2011): materialy I Mezhdunarodnoj nauchno-tehnicheskoj konferencii, Minsk, 2 oktjabrja 2011 g. [Materials of the International Scientific and Technical Conference, Minsk, 2 October 2011], Minsk, Belarusian State University of Informatics and Radioelectronics, 2011, pp. 311–322 (in Russian).
19. Long B. Natural language as an interface style. Dynamic Graphics Project Department of Computer Science University of Toronto. Available at: http://www.dgp.toronto.edu/people/byron/papers/nli.html (accessed: 15.03.2012).
20. Karpilovich T. P. Algoritmy porozhdeniya predlozhenij estestvennogo yazyka (obzor i analiz). Algorithms of Natural Language Sentences Generation (Overview and Analysis). Minsk, 1977, 300 p. (in Russian).
21. Popov E. V. Obshchenie s EVM na estestvennom yazyke. Communication with a computer using natural Language. Moscow, Nauka, 1982, 360 p. (in Russian).
Review
For citations:
Hetsevich Yu.S., Zhitko V.A., Hetsevich S.A., Kaigorodova L.I., Nikalaenka K.A. Design of natural-language interfaces for reference systems. Informatics. 2019;16(3):37-47. (In Russ.)