<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.37661/1816-0301-2022-19-4-53-68</article-id><article-id custom-type="elpub" pub-id-type="custom">inform-1211</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ОБРАБОТКА СИГНАЛОВ, ИЗОБРАЖЕНИЙ, РЕЧИ, ТЕКСТА И РАСПОЗНАВАНИЕ ОБРАЗОВ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>SIGNAL, IMAGE, SPEECH, TEXT PROCESSING AND PATTERN RECOGNITION</subject></subj-group></article-categories><title-group><article-title>Разработка алгоритма распознавания эмоций человека с использованием сверточной нейронной сети на основе аудиоданных</article-title><trans-title-group xml:lang="en"><trans-title>Algorithm development for recognizing human emotions using a convolutional neural network based on audio data</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0001-6531-1895</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Семенюк</surname><given-names>В. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Semenuk</surname><given-names>V. V.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Семенюк Виктория Валерьевна, магистр техниче-ских наук, преподаватель специальных дисциплин</p><p>ул. Горького, 163, Донецк, 83000</p></bio><bio xml:lang="en"><p>Viktoriya V. Semenuk, M. Sc. (Eng.), Teacher of Special Disciplines</p><p>st. Gorkogo, 163, Donetsk, 83000</p></bio><email xlink:type="simple">semenuk.viktoriya@gmail.com</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-3070-6656</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Складчиков</surname><given-names>М. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Skladchikov</surname><given-names>M. V.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Складчиков Максим Владимирович, магистр техни-ческих наук, преподаватель специальных дисциплин</p><p>ул. Горького, 163, Донецк, 83000</p></bio><bio xml:lang="en"><p>Maxim V. Skladchikov, M. Sc. (Eng.), Teacher of Special Disciplines</p><p>st. Gorkogo, 163, Donetsk, 83000</p></bio><email xlink:type="simple">maxsklad19981@yandex.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Донецкий техникум промышленной автоматики имени А. В. Захарченко</institution></aff><aff xml:lang="en"><institution>Donetsk Technical School of Industrial Automation after A. V. Zakharchenko</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2022</year></pub-date><pub-date pub-type="epub"><day>08</day><month>09</month><year>2022</year></pub-date><volume>19</volume><issue>4</issue><fpage>53</fpage><lpage>68</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Семенюк В.В., Складчиков М.В., 2022</copyright-statement><copyright-year>2022</copyright-year><copyright-holder xml:lang="ru">Семенюк В.В., Складчиков М.В.</copyright-holder><copyright-holder xml:lang="en">Semenuk V.V., Skladchikov M.V.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/1211">https://inf.grid.by/jour/article/view/1211</self-uri><abstract><p>Цели. Приведено описание и рассмотрен опыт создания алгоритма распознавания эмоционального состояния субъекта.Методы. Использованы методы обработки изображений.Результаты. Предложенный алгоритм позволяет распознавать эмоциональные состояния субъекта на основании звукового набора данных. Благодаря проведенному исследованию удалось улучшить точность работы алгоритма путем изменения подаваемого на вход нейронной сети набора данных.Описаны этапы обучения сверточной нейронной сети на заранее заготовленном наборе звуковых данных, а также структура алгоритма. Для валидации нейронной сети был отобран иной, не участвующийв тренировке, набор аудиоданных. В результате проведения исследования построены графики, демонстрирующие точность работы предлагаемого метода.После получения первоначальных данных сделан анализ возможностей улучшения алгоритма с точки зрения эргономики и точности его работы. Разработана стратегия, позволяющая добиться лучшего результата и получить более точный алгоритм. На основании заключений, изложенных в статье, приводится обоснование выбора представления набора данных и программного комплекса, необходимого для реализации программной части алгоритма.Заключение. Предложенный алгоритм обладает высокой точностью и не требует больших вычислительных затрат.</p></abstract><trans-abstract xml:lang="en"><p>Objectives. This article provides a description and experience of creating the algorithm for recognizing the emotional state of the subject.Methods. Image processing methods are used.Results. The proposed algorithm makes it possible to recognize the emotional states of the subject on the basis of an audio data set. It was possible to improve the accuracy of the algorithm by changing the data set supplied to the input of the neural network.The stages of training convolutional neural network on a pre-prepared set of audio data are described, and the structure of the algorithm is described. To validate the neural network different set of audio data, not participating in the training, was selected. As a result of the study, graphs were constructed demonstrating the accuracy of the proposed method.After receiving the initial data of the study, the analysis of the possibilities for improving the algorithm in terms of ergonomics and accuracy of operation was also carried out. The strategy was developed to achieve a better result and obtain a more accurate algorithm. Based on the conclusions presented in the article, the rationale for choosing the representation of the data set and the software package necessary for the implementation of the software part of the algorithm is given.Conclusion. The proposed algorithm has a high accuracy of operation and does not require large computational costs.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>нейронная сеть</kwd><kwd>распознавание эмоций человека</kwd><kwd>сверточная нейронная сеть</kwd><kwd>дактилоскопия звука</kwd><kwd>программная библиотека TensorFlow</kwd><kwd>нейросетевая библиотека Keras</kwd><kwd>пакет программ Matlab</kwd></kwd-group><kwd-group xml:lang="en"><kwd>neural network</kwd><kwd>human emotion recognition</kwd><kwd>convolutional neural network</kwd><kwd>sound fingerprinting</kwd><kwd>TensоrFlow software library</kwd><kwd>Keras neural network library</kwd><kwd>Matlab software package</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Mesaros, A. Acoustic scene classification: Overviews of DCASE 2017 challenge entries / A. Mesaros, T. Heittola, T. Virtanen // 16th Intern. Workshop on Acoustic Signal Enhancement (IWAENC 2018), Tokyo, Japan, 17–20 Sept. 2018. – Tokyo, 2018. – Р. 411–415.</mixed-citation><mixed-citation xml:lang="en">Mesaros A., Heittola T., Virtanen T. Acoustic scene classification: Overviews of DCASE 2017 challenge entries. 16th International Workshop on Acoustic Signal Enhancement (IWAENC 2018), Tokyo, Japan, 17–20 September 2018. Tokyo, 2018, рр. 411–415.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Haitsma, J. A highly robust audio fingerprinting system / J. Haitsma, T. Kalker // 3rd Intern. Conf. on Music Information Retrieval, Paris, France, 13–17 Oct. 2002. – Paris, 2002. – Р. 107–115.</mixed-citation><mixed-citation xml:lang="en">Haitsma J., Kalker T. A highly robust audio fingerprinting system. 3rd International Conference on Music Information Retrieval, Paris, France, 13–17 Octоber 2002. Paris, 2002, рр. 107–115.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Ильин, Е. П. Эмоции и чувства / Е. П. Ильин. – СПб. : Питер, 2001. – 752 с.</mixed-citation><mixed-citation xml:lang="en">Ilin E. P. Jemocii i chuvstva. Emotions and Feelings. Saint Petersburg, Piter, 2001, 752 p. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Изард, К. Э. Психология эмоций / К. Э. Изард. – СПб. : Питер, 2012. – 464 с.</mixed-citation><mixed-citation xml:lang="en">Izard K. E. Psihologija jemocij. Psychology of Emotions. Saint Petersburg, Piter, 2012, 464 p. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Карелина, И. О. Развитие понимания эмоций в период дошкольного детства: психологический ракурс : монография / И. О. Карелина. – Прага : Vědecko vydavatelské centrum «Sociosféra-CZ», 2017. – 178 с.</mixed-citation><mixed-citation xml:lang="en">Karelina I. O. Razvitie ponimanija jemocij v period doshkol'nogo detstva: psihologicheskij rakurs. Developing an Understanding of Emotions during Preschool Childhood: A Psychological Perspective, Prague, Vědecko vydavatelské centrum "Sociosféra-CZ", 2017, 178 p. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Орехова, О. А. Цветовая диагностика эмоций. Типология развития : монография / О. А. Орехова. – СПб. : Речь; М. : Сфера, 2008. – 176 с.</mixed-citation><mixed-citation xml:lang="en">Orehova O. A. Cvetovaja diagnostika jemocij. Tipologija razvitija. Monografija. Color Diagnostics of Emotions. Typology of Development. Monograph. Saint Petersburg, Sphere, 2008, 176 p. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Шаповал, Ж. Я. Распознавание эмоций человека по изображению как часть автоматизированного переводчика языка жестов / Ж. Я. Шаповал // Молодежный научно-технический вестник. – 2017. – № 7. – С. 55.</mixed-citation><mixed-citation xml:lang="en">Shapoval J. A. Recognition of Human Emotions by image as part of an automated sign language translator. Molodezhnyj nauchno-tekhnicheskij vestnik [Youth Scientific and Technical Bulletin], 2017, no. 7, p. 55 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Голубинский, А. Н. Выявление эмоционального состояния человека по речевому сигналу на основе вейвлет-анализа / А. Н. Голубинский // Вестник Воронежского института МВД России. – 2011. – № 3. – С. 144–153.</mixed-citation><mixed-citation xml:lang="en">Golubinskij A. N. Identification of a person's emotional state by a speech signal based on a Wavelet analysis. Vestnik Voronezhskogo instituta Ministerstva vnutrennih del Rossii [Bulletin of the Voronezh Institute of the Ministry of Internal Affairs of Russia], 2011, no. 3, pp. 144–153 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Сидоров, К. И. Автоматическое распознавание эмоций человека на основе реконструкций аттракторов образцов речи / К. И. Сидоров, Н. Н. Филатова // Программные системы и вычислительные методы. – 2012. – № 1. – С. 67–79.</mixed-citation><mixed-citation xml:lang="en">Sidorov K. I., Filatova N. N. Automatic recognition of human emotions based on reconstructions of attractors of speech samples. Programmnye sistemy i vychislitel'nye metody [Software systems and computational methods], 2012, no. 1, pp. 67–79 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Галичий, Д. А. Распознавание эмоций человека при помощи современных методов глубокого обучения / Д. А. Галичий, Г. И. Афанасьев, Ю. Г. Нестеров // E-SCIO. – 2021. – Т. 5, № 56. – С. 316–329.</mixed-citation><mixed-citation xml:lang="en">Galichij D. A., Afanaciev G. I., Nesterov U. G. Recognition of human emotions using modern methods of deep learning. E-SCIO, 2021, vol. 5, no. 56, pp. 316–329 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Бредихин, А. И. Применение вейвлетов в задаче распознавания эмоций человека по его речи / А. И. Бредихин // Сборник избранных статей научной сессии ТУСУР. – 2018. – № 1–3. – С. 115–119.</mixed-citation><mixed-citation xml:lang="en">Bredihin A. I. The use of wavelets in the task of recognizing a person's emotions by his speech. Sbornik izbrannyh statej nauchnoj sessii Tomskogo gosudarstvennogo universiteta sistem upravlenija i radiojelektroniki [Collection of selected articles of the scientific session of Tomsk State University of Control Systems and Radioelectronics], 2018, no. 1–3, pp. 115–119 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Рюмина, Е. В. Аналитический обзор методов распознавания эмоций по выражениям лица человека / Е. В. Рюмина, А. А. Карпов // Научно-технический вестник информационных технологий, механики и оптики. – 2020. – Т. 20, № 2. – С. 163–176.</mixed-citation><mixed-citation xml:lang="en">Rumina E. V., Karpov A. A. Analytical review of emotion recognition methods based on human facial expressions. Nauchno-tekhnicheskij vestnik informacionnyh tekhnologij, mekhaniki i optiki [Scientific and Technical Bulletin of Information Technologies, Mechanics and Optics], 2020, vol. 20, no. 2, pp. 163–176 (In Russ.). https://doi.org/10.17586/2226-1494-2020-20-2-163-176</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Dvoinikova, A. Emotion recognition and sentiment analysis of extemporaneus speech transcriptions in Russian / A. Dvoinikova, O. Verkholyak, A. Karpov // Lectures notes in computer science. – 2020. – Vol. 12335. – P. 136–144. https://doi.org/10.1007/978-3-030-60276-5_14</mixed-citation><mixed-citation xml:lang="en">Dvoinikova A., Verkholyak O., Karpov A. Emotion recognition and sentiment analysis of extemporaneus speech transcriptions in Russian. Lectures Notes in Computer Science, 2020, vol. 12335, pp. 136–144. https://doi.org/10.1007/978-3-030-60276-5_14</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Devi, J. S. Speaker emotion recognition based on speech feateres and classification techniques / J. S. Devi, S. Yarrammelle, S. P. Nandyala // Intern. J. of Image, Graphics, and Signal Processing. – 2014. – Vol. 6, no. 7. – P. 61–77. https://doi.org/10.5815/ijigsp.2014.07.08</mixed-citation><mixed-citation xml:lang="en">Devi J. S., Yarrammelle S., Nandyala S. P. Speaker emotion recognition based on speech feateres and classification techniques. International Journal of Image, Graphics, and Signal Processing, 2014, vol. 6, no. 7, pp. 61–77. https://doi.org/10.5815/ijigsp.2014.07.08</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Speech emotion recognition based on an improved brain emotion learning model / Z. I. Liu [et al.] // Neurocomputing. – 2018. – Vol. 309. – P. 145–156. https://doi.org/10.1016/j.neucom.2018.05.005</mixed-citation><mixed-citation xml:lang="en">Liu Z. I., Xie Q., Wu M., Cao W. H., Mao J. W., Mei Y. Speech emotion recognition based on an improved brain emotion learning model. Neurocomputing, 2018, vol. 309, pp. 145–156. https://doi.org/10.1016/j.neucom.2018.05.005</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Shirami, A. Speech emotion recognition based on SVM as both features selector and classifier / A. Shirami, A. R. N. Nilchi // Intern. J. of Image, Graphics, and Signal Processing. – 2016. – Vol. 8, no. 4. – P. 39–45. https://doi.org/10.5815/ijigsp.2016.04.05</mixed-citation><mixed-citation xml:lang="en">Shirami A., Nilchi A. R. N. Speech emotion recognition based on SVM as both features selector and classifier. International Journal of Image, Graphics, and Signal Processing, 2016, vol. 8, no. 4, pp. 39–45. https://doi.org/10.5815/ijigsp.2016.04.05</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Assuncao, G. Intermediary fuzzyfication in speech emotion recognition / G. Assuncao, P. Menezes // IEEE Intern. Conf. on Fuzzy System, Glasgow, United Kingdom, 19–24 July 2020. – Glasgow, 2020. – P. 9177699. https://doi.org/10.1109/FUZZ48607.2020.9177699</mixed-citation><mixed-citation xml:lang="en">Assuncao G., Menezes P. Intermediary fuzzyfication in speech emotion recognition. IEEE International Conference on Fuzzy System, Glasgow, United Kingdom, 19–24 July 2020. Glasgow, 2020, p. 9177699. https://doi.org/10.1109/FUZZ48607.2020.9177699</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Zisad, S. N. Speech emotion recognition in neurological disorders using convolutional neural network / S. N. Zisad, M. S. Hossain, K. Andersson // Lecture Notes in Computer Science. – 2020. – Vol. 12241. – P. 287–296. https://doi.org/10.1007/978-3-030-59277-6_26</mixed-citation><mixed-citation xml:lang="en">Zisad S. N., Hossain M. S., Andersson K. Speech emotion recognition in neurological disorders using convolutional neural network. Lecture Notes in Computer Science, 2020, vol. 12241, pp. 287–296. https://doi.org/10.1007/978-3-030-59277-6_26</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Werner, S. Speech emotion recognition: hemans vs machines / S. Werner, G. K. Petrenko // Discourse. – 2019. – Vol. 5, no. 5. – P. 136–152. https://doi.org/10.32603/2412-8562-2019-5-5-136-152</mixed-citation><mixed-citation xml:lang="en">Werner S., Petrenko G. K. Speech emotion recognition: hemans vs machines. Discourse, 2019, vol. 5, no. 5, pp. 136–152. https://doi.org/10.32603/2412-8562-2019-5-5-136-152</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Muppidi, A. Speech emotion recognition using quaternion convolutional neural networks / A. Muppidi, M. Radfar // IEEE Intern. Conf. of Acoustics, Speech and Signal Processing-Proceedings, Toronto, ON, Canada, 6–11 June 2021. – Toronto, 2021. – P. 6309–6313. https://doi.org/10.1109/ICASSP39728.2021.9414248</mixed-citation><mixed-citation xml:lang="en">Muppidi A., Radfar M. Speech emotion recognition using quaternion convolutional neural networks. IEEE International Conference of Acoustics, Speech and Signal Processing-Proceedings, Toronto, ON, Canada, 6–11 June 2021. Toronto, 2021, pp. 6309–6313. https://doi.org/10.1109/ICASSP39728.2021.9414248</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Zheng, W. Multi-scale discrepancy adversarial network for crosscorpus speech emotion recognition / W. Zheng, Y. Zong // Virtual Reality and Intelligent Hardware. – 2021. – Vol. 3, no. 1. – P. 65–75. https://doi.org/10.1016/j.vrih.2020.11.006</mixed-citation><mixed-citation xml:lang="en">Zheng W., Zong Y. Multi-scale discrepancy adversarial network for crosscorpus speech emotion recognition. Virtual Reality and Intelligent Hardware, 2021, vol. 3, no. 1, pp. 65–75. https://doi.org/10.1016/j.vrih.2020.11.006</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Hazjan, V. Context-independent multilingual emotion recognition from speech signals / V. Hazjan, Z. Kacic // Intern. J. of Speech Technology. – 2003. – Vol. 6, no. 3. – P. 311–320.</mixed-citation><mixed-citation xml:lang="en">Hazjan V., Kacic Z. Context-independent multilingual emotion recognition from speech signals. International Journal of Speech Technology, 2003, vol. 6, no. 3, pp. 311–320.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">Zhang, C. Autoencoder with emotion embedding for speech emotion recognition / C. Zhang, L. Xue // IEEE Access. – 2021. – Vol. 9. – P. 51231–51241. https://doi.org/10.1109/ACCESS.2021.3069818</mixed-citation><mixed-citation xml:lang="en">Zhang C., Xue L. Autoencoder with emotion embedding for speech emotion recognition. IEEE Access, 2021, vol. 9, pp. 51231–51241. https://doi.org/10.1109/ACCESS.2021.3069818</mixed-citation></citation-alternatives></ref><ref id="cit24"><label>24</label><citation-alternatives><mixed-citation xml:lang="ru">Kanwal, S. Speech emotion recognition using clustering based GA-optimized feature set / S. Kanwal, S. Asghar // IEEE Access. – 2021. – Vol. 9. – P. 125830–125842. https://doi.org/10.1109/ACCESS.2021.3111659</mixed-citation><mixed-citation xml:lang="en">Kanwal S., Asghar S. Speech emotion recognition using clustering based GA-optimized feature set. IEEE Access, 2021, vol. 9, pp. 125830–125842. https://doi.org/10.1109/ACCESS.2021.3111659</mixed-citation></citation-alternatives></ref><ref id="cit25"><label>25</label><citation-alternatives><mixed-citation xml:lang="ru">Byoung, C. K. A brief review of facial emotion recognition based on visual information / C. K. Byoung // Sensors. – 2018. – Vol. 18, iss. 2. – Р. 401. https://doi.org/10.3390/s18020401</mixed-citation><mixed-citation xml:lang="en">Byoung C. K. A brief review of facial emotion recognition based on visual information. Sensors, 2018, vol. 18, iss. 2, рр. 401. https://doi.org/10.3390/s18020401</mixed-citation></citation-alternatives></ref><ref id="cit26"><label>26</label><citation-alternatives><mixed-citation xml:lang="ru">Audio-visual emotion recognition using deep transfer learning and multiple temporal models / X. Ouyang [et al.] // ICMI '17 : Proc. of the 19th ACM Intern. Conf. on Multimodal Interaction, Glasgow, United Kingdom, 13–17 November 2017. – Glasgow, 2017. – P. 577–582. https://doi.org/10.1145/3136755.3143012</mixed-citation><mixed-citation xml:lang="en">Ouyang X., Kawaai S., Goh E. G. H., Shen S., Ding W., …, D.-Y. Huang. Audio-visual emotion recognition using deep transfer learning and multiple temporal models. ICMI '17 : Proceedings of the 19th ACM International Conference on Multimodal Interaction, Glasgow, United Kingdom, 13–17 November 2017. Glasgow, 2017, pp. 577–582. https://doi.org/10.1145/3136755. 3143012</mixed-citation></citation-alternatives></ref><ref id="cit27"><label>27</label><citation-alternatives><mixed-citation xml:lang="ru">Hassani, B. Facial expression recognition using enhanced deep 3D convolutional neural networks / B. Hassani, M. H. Mahoor // 2017 IEEE Conf. on Computer Vision and Pattern Recognition Workshops (CVPRW), Honolulu, HI, USA, 21–26 July 2017. – Honolulu, 2017. – Р. 1955–1962. https://doi.org/10.1109/CVPRW.2017.282</mixed-citation><mixed-citation xml:lang="en">Hassani B., Mahoor M. H. Facial expression recognition using enhanced deep 3D convolutional neural networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Honolulu, HI, USA, 21–26 July 2017. Honolulu, 2017, рр. 1955–1962. https://doi.org/10.1109/CVPRW.2017.282</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
