<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.37661/1816-0301-2024-21-3-48-62</article-id><article-id custom-type="elpub" pub-id-type="custom">inform-1306</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ИНТЕЛЛЕКТУАЛЬНЫЕ СИСТЕМЫ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>INTELLIGENT SYSTEMS</subject></subj-group></article-categories><title-group><article-title>Разработка метода подражательного обучения  для нейросетевой системы управления движением  мобильного робота на примере задачи поиска выхода  из лабиринта</article-title><trans-title-group xml:lang="en"><trans-title>Development of an imitation learning method for a neural  network system of mobile robot’s movement on example  of the maze solving</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-4126-6572</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Ким</surname><given-names>Т. Ю.</given-names></name><name name-style="western" xml:lang="en"><surname>Kim</surname><given-names>T. Yu.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Ким Татьяна Юрьевна, младший научный сотрудник, лаборатория робототехнических систем № 116</p><p>ул. Сурганова, 6, Минск, 220012</p></bio><bio xml:lang="en"><p>Tatyana Yu. Kim, Junior Researcher, Laboratory of Robotic Systems No. 116</p><p>st. Surganova, 6, Minsk, 220012</p></bio><email xlink:type="simple">tatyana_kim92@mail.ru</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-3412-9174</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Прокопович</surname><given-names>Г. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Prakapovich</surname><given-names>R. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Прокопович Григорий Александрович, кандидат технических наук, доцент</p><p>ул. Сурганова, 6, Минск, 220012</p></bio><bio xml:lang="en"><p>Ryhor A. Prakapovich, Ph. D. (Eng.), Assoc. Prof.</p><p>st. Surganova, 6, Minsk, 220012</p></bio><email xlink:type="simple">rprakapovich@robotics.by</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Объединенный институт проблем информатики Национальной академии наук Беларуси</institution></aff><aff xml:lang="en"><institution>The United Institute of Informatics Problems of the National Academy of Sciences of Belarus</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2024</year></pub-date><pub-date pub-type="epub"><day>30</day><month>09</month><year>2024</year></pub-date><volume>21</volume><issue>3</issue><fpage>48</fpage><lpage>62</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Ким Т.Ю., Прокопович Г.А., 2024</copyright-statement><copyright-year>2024</copyright-year><copyright-holder xml:lang="ru">Ким Т.Ю., Прокопович Г.А.</copyright-holder><copyright-holder xml:lang="en">Kim T.Y., Prakapovich R.A.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/1306">https://inf.grid.by/jour/article/view/1306</self-uri><abstract><sec><title>Цели</title><p>Цели. Поставлена цель разработать новый метод обучения системы управления мобильным роботом поиску выхода из лабиринта на основе обучения с подкреплением и алгоритма правой руки.</p></sec><sec><title>Методы</title><p>Методы. В работе применен метод компьютерного моделирования в среде MATLAB/Simulink.</p></sec><sec><title>Результаты</title><p>Результаты. Предложен новый метод обучения системы управления мобильным роботом, способный реализовывать алгоритм правой руки для поиска выхода из лабиринта. Данный метод основан на работе двух агентов, взаимодействующих между собой: первый непосредственно реализует поисковый алгоритм и ищет выход из лабиринта, а второй, следуя за ним, с помощью метода подражательного обучения пытается научиться находить выход из лабиринта. Агент-эксперт, реализуя дискретный алгоритм движения по лабиринту, совершает точные дискретные шаги и движется почти независимо от второго агента. Единственным ограничением является скорость его движения, которая прямо пропорционально зависит от расстояния между агентами. Второй агент, агент-ученик, методом проб и ошибок старается сократить расстояние до первого. Для реализации процесса обучения использовался метод обучения с подкреплением в режиме подражания, для которого была разработана соответствующая функция вознаграждения, позволяющая удерживать центр масс робота в центре коридора и при необходимости поворачивать, следуя за агентом-экспертом. Агенты передвигаются по виртуальному полигону, состоящему из разветвленных коридоров, достаточно широких для реализации различных маневров движений.</p></sec><sec><title>Заключение</title><p>Заключение. Было доказано, что благодаря предложенному методу подражательного обучения агентученик способен не только перенимать от агента-эксперта требуемые паттерны поведения (искать в ранее неизвестном лабиринте выход по алгоритму правой руки), но и самостоятельно приобретать новые (изменять скорость на повороте, обходить небольшие коридоры-тупики), которые положительным образом влияют на выполнение поставленной задачи.</p></sec></abstract><trans-abstract xml:lang="en"><sec><title>Objectives</title><p>Objectives. To develop a new method for training a mobile robot control system to use a maze solver algorithm based on reinforcement learning and the right-hand algorithm.</p></sec><sec><title>Methods</title><p>Methods. The work uses the method of computer modeling in the MATLAB/Simulink environment.</p></sec><sec><title>Results</title><p>Results. A new method for training a mobile robot control system capable of implementing the right-hand algorithm for finding an exit from a maze is proposed. The proposed method is based on the work of two agents interacting with each other: the first directly implements the search algorithm and searches for an exit from the maze, and the second, following it, tries to learn using the imitation learning method. The expert agent, implementing a discrete algorithm for moving through the maze, makes precise discrete steps and moves almost independently of the second agent. The only limitation is its speed, which is directly proportional to the distance between the agents. The second agent, the student agent, tries to reduce the distance to the first agent by trial and error. The learning process was implemented using the reinforcement learning method, which was used in the imitation mode and for which a corresponding reward function was developed, allowing the robot's center of mass to be kept in the center of the corridor and, if necessary, to turn, following the expert agent. The agents move along a virtual polygon consisting of branched corridors wide enough to implement various movement maneuvers.</p></sec><sec><title>Conclusion</title><p>Conclusion. It was proven that, thanks to the proposed method of imitative learning, the student agent is able not only to adopt the required behavior patterns from the expert agent – to search for an exit in a previously unknown labyrinth using the right-hand algorithm, but also to independently acquire new ones (changing speed on a turn, bypassing small dead-end corridors), which positively influence the performance of the assigned task.</p></sec></trans-abstract><kwd-group xml:lang="ru"><kwd>мобильный робот</kwd><kwd>агент</kwd><kwd>обучение с подкреплением</kwd><kwd>алгоритм правой руки</kwd><kwd>лабиринт</kwd><kwd>подражательное обучение</kwd></kwd-group><kwd-group xml:lang="en"><kwd>mobile robot</kwd><kwd>agent</kwd><kwd>reinforcement learning</kwd><kwd>right-hand algorithm</kwd><kwd>maze</kwd><kwd>imitative learning</kwd></kwd-group><funding-group><funding-statement xml:lang="ru">Работа была выполнена при поддержке гранта БРФФИ Ф22КИТГ-002 и задания Т31  ГПНИ «Цифровые и космические технологии, безопасность человека, общества и государства» (2021–2025).</funding-statement><funding-statement xml:lang="en">The work was supported by the BRFFR grant F22KITG-002 and the task T31 of the State Program for Scientific Research "Digital and Space Technologies, Security of Man, Society and the State" (2021–2025).</funding-statement></funding-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Towards continuous control for mobile robot navigation: A reinforcement learning and slam based approach / K. A. A. Mustafa [et al.] // Intern. Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences. – 2019. – Vol. 42. – Р. 857–863. https://doi.org/10.5194/isprs-archives-XLII-2-W13-857-2019</mixed-citation><mixed-citation xml:lang="en">Mustafa K. A. A., Botteghi N., Sirmacek B., Poel M., Stramigioli S. Towards continuous control for mobile robot navigation: A reinforcement learning and slam based approach. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2019, vol. 42, рр. 857–863. https://doi.org/10.5194/isprs-archives-XLII-2-W13-857-2019</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Truong, X. T. Toward socially aware robot navigation in dynamic and crowded environments: A proactive social motion model / X. T. Truong, T. D. Ngo // IEEE Transactions on Automation Science and Engineering. – 2017. – Vol. 14, no. 4. – P. 1743–1760. https://doi.org/10.1109/TASE.2017.2731371</mixed-citation><mixed-citation xml:lang="en">Truong, X. T., Ngo T. D. Toward socially aware robot navigation in dynamic and crowded environments: A proactive social motion model. IEEE Transactions on Automation Science and Engineering, 2017, vol. 14, no. 4, рр. 1743–1760. https://doi.org/10.1109/TASE.2017.2731371</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Playing Atari with Deep Reinforcement Learning [Electronic resource] / V. Mhin [et al.]. – 2013. – Mode of access: https://doi.org/10.48550/arXiv.1312.5602. – Date of access: 20.06.2024.</mixed-citation><mixed-citation xml:lang="en">Mhin V., Kavukcuoglu K., Silver D., Graves A., Antonoglou I., …, Riedmiller M. Playing Atari with Deep Reinforcement Learning, 2013. Available at: https://doi.org/10.48550/arXiv.1312.5602 (accessed 20.06.2024).</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Mastering the game of Go with deep neural networks and tree search / D. Silver [et al.] // Nature. – 2016. – Vol. 529, no. 7587. – Р. 484–489.</mixed-citation><mixed-citation xml:lang="en">Silver D., Huang A., Maddison C. J., Guez A., Sifre L., …, Ha abi D. Mastering the game of Go with deep neural networks and tree search. Nature, 2016, vol. 529, no. 7587, рр. 484–489.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Learning dexterous in-hand manipulation / M. Andrychowicz [et al.] // The Intern. J. of Robotics Research. – 2020. – Vol. 39, no. 1. – Р. 3–20. https://doi.org/10.1177/0278364919887447</mixed-citation><mixed-citation xml:lang="en">Andrychowicz M., Baker B., Chociej M., Józefowicz R., McGrew B., …, Zaremba W. Learning dexterous in-hand manipulation. The International Journal of Robotics Research, 2020, vol. 39, no. 1, рр. 3–20. https://doi.org/10.1177/0278364919887447</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Emergence of Locomotion Behaviours in Rich Environments [Electronic resource] / N. Heess [et al.]. – 2017. – Mode of access: https://doi.org/10.48550/arXiv.1707.02286. – Date of access: 20.06.2024.</mixed-citation><mixed-citation xml:lang="en">Heess N., Dhruva T. B., Sriram S., Lemmon J., Merel J., …, Silver D. Emergence of Locomotion Behaviours in Rich Environments, 2017. Available at: https://doi.org/10.48550/arXiv.1707.02286 (accessed 20.06.2024).</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Autonomous vehicle perception: The technology of today and tomorrow / J. V. Brummelen [et al.] // Transportation Research Part C: Emerging Technologies. – 2018. – No. 86. – P. 384–406. https://doi.org/10.1016/j.trc.2018.02.012</mixed-citation><mixed-citation xml:lang="en">Brummelen J. V., O'Brien M., Gruyer D., Najjaran H. Autonomous vehicle perception: The technology of today and tomorrow. Transportation Research Part C: Emerging Technologies, 2018, no. 86, рр. 384–406. https://doi.org/10.1016/j.trc.2018.02.012</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Huang, W. Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning [Electronic resource] / W. Huang, F. Braghin, Z. Wang. – 2020. – P. 1–7. – Mode of access: https://doi.org/10.48550/arXiv.2001.03864. – Date of access: 20.06.2024.</mixed-citation><mixed-citation xml:lang="en">Huang W., Braghin F., Wang Z. Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning, 2020, рр. 1–7. Available at: https://doi.org/10.48550/arXiv.2001.03864 (accessed 20.06.2024).</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Robust AI driving strategy for autonomous vehicles / S. Nageshrao [et al.] // AI-enabled Technologies for Autonomous and Connected Vehicles. – Springer, 2022. – Р. 161–212.</mixed-citation><mixed-citation xml:lang="en">Nageshrao S., Rahman Y., Ivanovic V., Jankovic M., T eng E., …, Filev D. Robu t AI driving trategy for autonomous vehicles. AI-enabled Technologies for Autonomous and Connected Vehicles. Springer, 2022, рр. 161–212.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Sensor and sensor fusion technology in autonomous vehicles: A review / D. J. Yeong [et al.] // Sensors. – 2021. – Vol. 21, iss. 6. – Р. 2140. https://doi.org/10.3390/s21062140</mixed-citation><mixed-citation xml:lang="en">Yeong D. J., Velasco-Hernandez G., Barry J., Walsh J. Sensor and sensor fusion technology in autonomous vehicles: A review. Sensors, 2021, vol. 21, iss. 6, р. 2140. https://doi.org/10.3390/s21062140</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Kweon, J. Deep reinforcement learning for guidewire navigation in coronary artery phantom / J. Kweon, K. Kim, Ch. Lee // IEEE Access. – 2021. – Vol. 9. – P. 166409–166422. https://doi.org/10.1109/ACCESS.2021.3135277</mixed-citation><mixed-citation xml:lang="en">Kweon J., Kim K., Lee Ch. Deep reinforcement learning for guidewire navigation in coronary artery phantom. IEEE Access, 2021, vol. 9, рр. 166409–166422. https://doi.org/10.1109/ACCESS.2021.3135277</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">An Algorithmic Perspective on Imitation Learning / T. Osa [et al.]. – Bo ton : Now publishers Inc., 2018. – 188 p.</mixed-citation><mixed-citation xml:lang="en">Osa T., Pajarinen J., Neumann G., Bagnell J. A., Abbeel P., Peters J. An Algorithmic Perspective on Imitation Learning. Bo ton, Now publishers Inc., 2018, 188 p.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Лонца, A. Алгоритмы обучения с подкреплением на Python / A. Лонца ; пер. с англ. А. А. Слинкина. – М. : ДМК Пресс, 2020. – 285 с.</mixed-citation><mixed-citation xml:lang="en">Lonza, A. Reinforcement Learning Algorithms with Python. Packt Publishing, 2019, 366 р.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Chella, А. Imitation learning and anchoring through conceptual spaces / А. Chella // Applied Artificial Intelligence. – 2007. – No. 21. – P. 343–359.</mixed-citation><mixed-citation xml:lang="en">Chella, А imitation learning and anchoring through conceptual spaces. Applied Artificial Intelligence, 2007, no. 21, рр. 343–359.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Kim, T. Automatic tuning of the motion control system of a mobile robot along a trajectory based on the reinforcement learning method / T. Kim, R. Prakapovich // Communications in Computer and Information Science. – Springer, Cham, 2022. – Vol. 1562. – P. 234–244. https://doi.org/10.1007/978-3-030-98883-8_17</mixed-citation><mixed-citation xml:lang="en">Kim T., Prakapovich R. Automatic tuning of the motion control system of a mobile robot along a trajectory based on the reinforcement learning method. Communications in Computer and Information Science. Springer, Cham, 2022, vol. 1562, рр. 234–244. https://doi.org/10.1007/978-3-030-98883-8_17</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Sutton, R. S. Reinforcement Learning: An Introduction / R. S. Sutton, A. G. Barto. – 2nd ed. – London, England : The MIT Press, 2014. – 352 р.</mixed-citation><mixed-citation xml:lang="en">Sutton R. S., Barto A. G. Reinforcement Learning: An Introduction, 2nd edition. London, England, The MIT Press, 2014, 352 р.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Watkin , C. Q-learning / C. Watkin , P. Dayan // Machine Learning. – 1992. – Vol. 8, i . 3–4. – Р. 279–292.</mixed-citation><mixed-citation xml:lang="en">Watkin C., Dayan P. Q-learning. Machine Learning, 1992, vol. 8, i . 3–4, рр. 279–292.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Duan, J. M. Prior knowledge ba ed Q-learning path planning algorithm / J. M. Duan, Q. L. Chen // Electronic Optic &amp; Control. – 2019. – Vol. 26, i . 9. – Р. 29–33.</mixed-citation><mixed-citation xml:lang="en">Duan J. M., Chen Q. L. Prior knowledge ba ed Q-learning path planning algorithm. Electronics Optics &amp; Control, 2019, vol. 26, i . 9, рр. 29–33.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Sutton, R. S. Reinforcement Learning: An Introduction / R. S. Sutton, A. G. Barto. – 2nd ed. – London, England : The MIT Pre , 2014. – 338 р.</mixed-citation><mixed-citation xml:lang="en">Sutton R. S., Barto A. G. Reinforcement Learning: An Introduction, 2nd edition. London, England, The MIT Pre , 2014, 338 р.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Rossi, F. Horizontal and vertical scaling of container-based applications using reinforcement learning / F. Rossi, M. Nardelli, V. Cardellini // 2019 IEEE 12th Intern. Conf. on Cloud Computing (CLOUD), Milan, Italy, 8–13 July 2019. – Milan, 2019. – P. 329–338. https://doi.org/10.1109/CLOUD.2019.00061</mixed-citation><mixed-citation xml:lang="en">Rossi F., Nardelli M., Cardellini V. Horizontal and vertical scaling of container-based applications using reinforcement learning. 2019 IEEE 12th International Conference on Cloud Computing (CLOUD), Milan, Italy, 8–13 July 2019. Milan, 2019, рр. 329–338. https://doi.org/10.1109/CLOUD.2019.00061</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">PAC model-free reinforcement learning / A. L. Strehl [et al.] // ICML’06: Proc. of the 23th Intern. Conf. on Machine Learning, Pittsburgh, Pennsylvania, USA, 25–29 June 2006. – Pittsburgh, 2006. – P. 881–888. https://doi.org/10.1145/1143844.114395</mixed-citation><mixed-citation xml:lang="en">Strehl A. L., Li L., Wiewiora E., Langford J., Littman M. L. PAC model-free reinforcement learning. ICML’06: Proceeding of the 23th International Conference on Machine Learning. Pittsburgh, Pennsylvania, USA, 25–29 June 2006. Pittsburgh, 2006, рр. 881–888. https://doi.org/10.1145/1143844.114395</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Ravichandiran, S. Deep Reinforcement Learning with Python / S. Ravichandiran. – 2nd ed. – Packt Publishing, 2020. – 760 p.</mixed-citation><mixed-citation xml:lang="en">Ravichandiran S. Deep Reinforcement Learning with Python, 2nd edition. Packt Publishing, 2020, 760 p.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">Yu, Ch. Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units / Ch. Yu, G. Ren // BMC Medical Informatics and Decision Making. – 2020. – No. 20 (S3). – P. 1–8. https://doi.org/10.1186/s12911-020-1120-5</mixed-citation><mixed-citation xml:lang="en">Yu Ch., Ren G. Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units. BMC Medical Informatics and Decision Making, 2020, no. 20 (S3), рр. 1–8. https://doi.org/10.1186/s12911-020-1120-5</mixed-citation></citation-alternatives></ref><ref id="cit24"><label>24</label><citation-alternatives><mixed-citation xml:lang="ru">Imitation learning: progress, taxonomies and challenges [Electronic resource] / B. Zheng [et al.] // IEEE Transactions on Neural Networks and Learning Systems. – 2022. – P. 1–22. – Mode of access: https://arxiv.org/abs/2106.12177. – Date of access: 20.06.2024.</mixed-citation><mixed-citation xml:lang="en">Zheng B., Verma S., Zhou J., Tsang I., Chen F. Imitation learning: progress, taxonomies and challenges. IEEE Transactions on Neural Networks and Learning Systems, 2022, рр. 1–22. Available at: https://arxiv.org/abs/2106.12177 (accessed 20.06.2024).</mixed-citation></citation-alternatives></ref><ref id="cit25"><label>25</label><citation-alternatives><mixed-citation xml:lang="ru">Ким, Т. Ю. Форсированное управление движением мобильного робота / Т. Ю. Ким, Г. А. Прокопович, А. А. Лобатый // Информатика. – 2022. − Т. 19, № 3. – С. 86–100. https://doi.org/10.37661/1816-0301-2022-19-3-86-100</mixed-citation><mixed-citation xml:lang="en">Kim T. Yu., Prakapovich R. A. Lobatiy A. A. Forced motion control of a mobile robot. Informatika [Informatics], 2022, vol. 19, no. 3, pp. 86−100 (In Russ.). https://doi.org/10.37661/1816-0301-2022-19-3-86-100</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
