Preview

Informatics

Advanced search

A model of homographs automatic identification for the Belarusian language

https://doi.org/10.37661/1816-0301-2023-20-4-87-100

Abstract

Objectives. A prototype system for automated removal of homonyms in Belarusian and Russian electronic texts is described. This is due to the urgent problem of automatic text processing at the morphological level, the process of which is complicated by the inflection of the Belarusian language with a diverse and rich system of morphological characteristics of parts of speech.

Methods. The work uses regular homographs identification methods and knowledge-based methods.

Results. Methods and approaches for designing systems for automatic detection of homographs are proposed. An algorithm for identifying homographs on the basis of knowledge-based method has been developed. An effective and fast-acting prototype for their removal in Russian and Belarusian has been implemented.

Conclusion. A working prototype of the homograph search is presented, which is the first resource for removing ambiguity for the Belarusian language in open access.

For citations:


Hetsevich Yu.S., Zianouka Ya.S., Latyshevich D.I., Bakunovich A.A., Drahun A.Ya., Kazlova M.A. A model of homographs automatic identification for the Belarusian language. Informatics. 2023;20(4):87-100. (In Bel.) https://doi.org/10.37661/1816-0301-2023-20-4-87-100

Views: 231


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1816-0301 (Print)
ISSN 2617-6963 (Online)