<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.37661/1816-0301-2023-20-1-40-54</article-id><article-id custom-type="elpub" pub-id-type="custom">inform-1225</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ИНТЕЛЛЕКТУАЛЬНЫЕ СИСТЕМЫ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>INTELLIGENT SYSTEMS</subject></subj-group></article-categories><title-group><article-title>Увеличение точности реидентификации людей на основе двухэтапного обучения сверточных нейронных сетей и аугментации</article-title><trans-title-group xml:lang="en"><trans-title>Improving person re-identification based on two-stage training of convolutional neural networks  and augmentation</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-9780-5731</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Игнатьева</surname><given-names>С. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Ihnatsyeva</surname><given-names>S. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Игнатьева Светлана Александровна, магистр тех- нических наук, аспирант кафедры вычислительных систем и сетей</p><p>ул. Блохина, 29, Новополоцк, 211440</p></bio><bio xml:lang="en"><p>Sviatlana A. Ihnatsyeva, M. Sc. (Eng.), Postgraduate Student of the Department of Computing Systems and Networks</p><p>st. Blokhina, 29, Novopolotsk, 211440</p></bio><email xlink:type="simple">s.ignatieva@psu.by</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-6609-5810</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Богуш</surname><given-names>Р. П.</given-names></name><name name-style="western" xml:lang="en"><surname>Bohush</surname><given-names>R. P.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Богуш Рихард Петрович, доктор технических наук, доцент, заведующий кафедрой вычислительных систем и сетей</p><p>ул. Блохина, 29, Новополоцк, 211440</p></bio><bio xml:lang="en"><p>Rykhard P. Bohush, D. Sc. (Eng.), Assoc. Prof., Head of the Department of Computing Systems and Networks</p><p>st. Blokhina, 29, Novopolotsk, 211440</p></bio><email xlink:type="simple">r.bogush@psu.by</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Полоцкий государственный университет имени Евфросинии Полоцкой</institution></aff><aff xml:lang="en"><institution>Euphrosyne Polotskaya State University of Polotsk</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2023</year></pub-date><pub-date pub-type="epub"><day>29</day><month>03</month><year>2023</year></pub-date><volume>20</volume><issue>1</issue><fpage>40</fpage><lpage>54</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Игнатьева С.А., Богуш Р.П., 2023</copyright-statement><copyright-year>2023</copyright-year><copyright-holder xml:lang="ru">Игнатьева С.А., Богуш Р.П.</copyright-holder><copyright-holder xml:lang="en">Ihnatsyeva S.A., Bohush R.P.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/1225">https://inf.grid.by/jour/article/view/1225</self-uri><abstract><sec><title>Цели</title><p>Цели. Основной целью является повышение точности повторной идентификации людей в распределенных системах видеонаблюдения.</p></sec><sec><title>Методы</title><p>Методы. Используются методы машинного обучения.</p></sec><sec><title>Результаты</title><p>Результаты. Представлена технология двухэтапного обучения сверточных нейронных сетей (СНС), отличающаяся использованием аугментации изображений для предварительного этапа и точной настройки весовых коэффициентов на основе исходного набора изображений. На первом этапе обучение осуществляется на аугментированных данных, затем выполняется точная настройка СНС на исходных изображениях, что способствует повышению эффективности ре-идентификации за счет уменьшения потерь при обучении. Использование на двух этапах разных данных не позволяет СНС запоминать тренировочные примеры, тем самым предотвращая переобучение.</p><p>Предложенный метод расширения набора данных для обучения отличается тем, что совмещает циклический сдвиг пикселей изображения, исключение цветности и замещение фрагмента уменьшенной копией другого из пакета, подаваемого на вход СНС. Данный метод аугментации позволяет увеличить разнообразие обучающих данных, что повышает робастность СНС ко многим факторам: перекрытию людей, изменению освещенности, уменьшению разрешения изображения, зависимости от местоположения отличительных особенностей объекта интереса.</p></sec><sec><title>Заключение</title><p>Заключение. Применение технологии двухэтапного обучения и предложенного метода аугментации данных позволило повысить точность повторной идентификации людей для разных СНС и наборов данных в метриках: Rank1 на 4% – 21%; mAP на 10% – 31%; mINP на 39% – 60%.</p></sec></abstract><trans-abstract xml:lang="en"><sec><title>Objectives</title><p>Objectives. The main goal is to improve person re-identification accuracy in distributed video surveillance systems.</p></sec><sec><title>Methods</title><p>Methods. Machine learning methods are applied.</p></sec><sec><title>Result</title><p>Result. A technology for two-stage training of convolutional neural networks (CNN) is presented, characterized by the use of image augmentation for the preliminary stage and fine tuning of weight coefficients based on the original images set for training. At the first stage, training is carried out on augmented data, at the second stage, fine tuning of the CNN is performed on the original images, which allows minimizing the losses and increasing model efficiency. The use of different data at different training stages does not allow the CNN to remember training examples, thereby preventing overfitting.</p><p>Proposed method as expanding the training sample differs as it combines an image pixels cyclic shift, color  exclusion and fragment replacement with a reduced copy of another image. This augmentation method allows to get a wide variety of training data, which increases the CNN robustness to occlusions, illumination, low image resolution, dependence on the location of features.</p></sec><sec><title>Conclusion</title><p>Conclusion. The use of two-stage learning technology and the proposed data augmentation method made it possible to increase the person re-identification accuracy for different CNNs and datasets: in the Rank1 metric  by 4–21 %; in the mAP by 10–31 %; in the mINP by 39–60 %. </p></sec></trans-abstract><kwd-group xml:lang="ru"><kwd>Ре-идентификация людей</kwd><kwd>сверточные нейронные сети</kwd><kwd>предварительное обучение</kwd><kwd>точная настройка</kwd><kwd>расширение обучающей выборки</kwd></kwd-group><kwd-group xml:lang="en"><kwd>Person re-identification</kwd><kwd>convolutional neural network</kwd><kwd>pre-train</kwd><kwd>fine tuning</kwd><kwd>augmentation</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">ImageNet: A large-scale hierarchical image database / J. Deng [et al.] // 2009 IEEE Conf. on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009. – Miami, 2009. – P. 248–255. https://doi.org/10.1109/CVPR.2009.5206848</mixed-citation><mixed-citation xml:lang="en">Deng J., Dong W., Socher R., Li L., …, Fei-Fei L. ImageNet: A large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009. Miami, 2009, pр. 248–255. https://doi.org/10.1109/CVPR.2009.5206848</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Unsupervised pre-training for person re-identification / D. Fu [et al.] // 2021 IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021. – Nashville, 2021. – P. 14745–14754. https://doi.org/10.1109/CVPR46437.2021.01451</mixed-citation><mixed-citation xml:lang="en">Fu D., Chen D., Bao J., Yang H., Yuan L., …, Chen D. Unsupervised pre-training for person re-identification. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021. Nashville, 2021, pр. 14745–14754. https://doi.org/10.1109/CVPR46437.2021.01451</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Богатырева, А. А. Исследование способности к transfer learning сверточных нейронных сетей, обученных на ImageNet / А. А. Богатырева, А. Р. Виноградова, С. А. Тихомирова // Междунар. журнал прикладных и фундаментальных исследований. – 2019. – № 7. – С. 106–111.</mixed-citation><mixed-citation xml:lang="en">Bogatyreva A. A., Vinogradova A. R., Tikhomirova S. A. Study of the ability of convolution neural networks pretrained on ImageNet to transfer learning. Mezhdunarodnyj zhurnal prikladnyh i fundamental'nyh issledovanij [International Journal of Applied and Fundamentall Research], 2019, no. 7, pp. 106–111 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Конарев, Д. И. Повышение точности предварительно обученных нейронных сетей путем тонкой настройки / Д. И. Конарев, А. А. Гуламов // Материалы конф. «Информационные технологии в управлении», Санкт-Петербург, 6–8 окт. 2020 г. – СПб., 2020. – С. 200–212.</mixed-citation><mixed-citation xml:lang="en">Konarev D. I., Gulamov A. A. Improving the accuracy of pretrained neural networks by fine-tuning. Materialy konferencii "Informacionnye tehnologii v upravlenii", Sankt-Peterburg, 6–8 oktjabrja 2020 g. [Proceedings of the Conference "Information Technologies in Management", Saint Petersburg, 6–8 October 2020]. Saint Petersburg, 2020, рр. 200–212 (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">DeVries, T. Improved Regularization of Convolutional Neural Networks with CutOut / T. DeVries, G. W. Taylor. – 2017. – Mode of access: https://doi.org/10.48550/arXiv.1708.04552. – Date of access: 09.08.2022.</mixed-citation><mixed-citation xml:lang="en">DeVries T., Taylor G. W. Improved Regularization of Convolutional Neural Networks with CutOut, 2017. Available at: https://doi.org/10.48550/arXiv.1708.04552 (accessed 09.08.2022).</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Dropout: A simple way to prevent neural networks from overfitting / N. Srivastava [et al.] // J. of Machine Learning Research. – 2014. – No. 15. – P. 1929–1958. https://doi.org/10.5555/2627435.2670313</mixed-citation><mixed-citation xml:lang="en">Srivastava N., Hinton G. E., Krizhevsky A., Sutskever I., Salakhutdinov R. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 2014, no. 15, pp. 1929–1958. https://doi.org/10.5555/2627435.2670313</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Choice of activation function in convolution neural network for person re-identification in video surveillance systems / H. Chen [et al.] // Programming and Computer Software. – 2022. – Vol. 48, no. 5. – P. 312–321. http://doi.org/10.1134/S0361768822050036</mixed-citation><mixed-citation xml:lang="en">Chen H., Ihnatsyeva S., Bohush R., Ablameyko S. Choice of activation function in convolution neural network for person re-identification in video surveillance systems. Programming and Computer Software, 2022, vol. 48, no. 5, pp. 312–321. http://doi.org/10.1134/S0361768822050036</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Random Erasing Data Augmentation / Z. Zhong [et al.]. – 2020. – Mode of access: https://doi.org/10.1609/AAAI.V34I07.7000. – Date of access: 09.08.2022.</mixed-citation><mixed-citation xml:lang="en">Zhong Z., Zheng L., Kang G., Li S., Yang Y. Random Erasing Data Augmentation, 2020. Available at: https://doi.org/10.1609/AAAI.V34I07.7000 (accessed 09.08.2022).</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">CutMix: Regularization strategy to train strong classifiers with localizable features / S. Yun [et al.] // 2019 IEEE/CVF Intern. Conf. on Computer Vision (ICCV), Seoul, Korea (South), 27 Oct. – 2 Nov. 2019. – Seoul, 2019. – P. 6022–6031. https://doi.org/10.1109/ICCV.2019.00612</mixed-citation><mixed-citation xml:lang="en">Yun S., Han D., Oh S., Chun S., Choe J., Yoo Y. J. CutMix: Regularization strategy to train strong classifiers with localizable features. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 27 October – 2 November 2019. Seoul, 2019, pр. 6022–6031. https://doi.org/10.1109/ICCV.2019.00612</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Cut-thumbnail: A novel data augmentation for convolutional neural network / T. Xie [et al.] // Proc. of the 29th ACM Intern. Conf. on Multimedia, Virtual Event, China, 20–24 Oct. 2021. – Virtual Event, China, 2021. – Р. 1627–1635. https://doi.org/10.1145/3474085.3475302</mixed-citation><mixed-citation xml:lang="en">Xie T., Cheng X., Wang X., Liu M., Deng J., …, Liu M. Cut-thumbnail: A novel data augmentation for convolutional neural network. Proceedings of the 29th ACM International Conference on Multimedia, Virtual Event, China, 20–24 October 2021. Virtual Event, China, 2021. https://doi.org/10.1145/3474085.3475302</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Mixup: Beyond Empirical Risk Minimization / H. Zhang [et al.]. – 2018. – Mode of access: https://doi.org/10.48550/arXiv.1710.09412. – Date of access: 09.08.2022.</mixed-citation><mixed-citation xml:lang="en">Zhang H., Cissé M., Dauphin Y., Lopez-Paz D. Mixup: Beyond Empirical Risk Minimization, 2018. Available at: https://doi.org/10.48550/arXiv.1710.09412 (accessed 09.08.2022).</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">ImageNet-Trained CNNs are Biased Towards Texture; Increasing Shape Bias Improves Accuracy and Robustness / R. Geirhos [et al.]. – 2019. – Mode of access: https://doi.org/10.48550/arXiv.1811.12231. – Date of access: 09.08.2022.</mixed-citation><mixed-citation xml:lang="en">Geirhos R., Rubisch P., Michaelis C., Bethge M., Wichmann F., Brendel W. ImageNet-Trained CNNs are Biased Towards Texture; Increasing Shape Bias Improves Accuracy and Robustness, 2019. Available at: https://doi.org/10.48550/arXiv.1811.12231 (accessed 09.08.2022).</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Gong, Y. An Effective Data Augmentation for Person Re-identification / Y. Gong, Z. Zeng. – 2021. – Mode of access: https://doi.org/10.48550/arXiv.2101.08533. – Date of access: 09.08.2022.</mixed-citation><mixed-citation xml:lang="en">Gong Y., Zeng Z. An Effective Data Augmentation for Person Re-identification, 2021. Available at: https://doi.org/10.48550/arXiv.2101.08533 (accessed 09.08.2022).</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Adversarially occluded samples for person re-identification / H. Huang [et al.] // 2018 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018. – Salt Lake City, 2018. – P. 5098–5107. https://doi.org/10.1109/CVPR.2018.00535</mixed-citation><mixed-citation xml:lang="en">Huang H., Li D., Zhang Z., Chen X., Huang K. Adversarially occluded samples for person re-identification. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018. Salt Lake City, 2018, pр. 5098–5107. https://doi.org/10.1109/CVPR.2018.00535</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Deep learning for person re-identification: A survey and outlook / M. Ye [et al.] // IEEE Transactions on Pattern Analysis and Machine Intelligence. – 2021. – Vol. 44, iss. 6. – Р. 2872–2893. https://doi.org/10.1109/TPAMI.2021.3054775</mixed-citation><mixed-citation xml:lang="en">Ye M., Shen J., Lin G., Xiang T., Shao L., Hoi S. C. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, vol. 44, iss. 6, рр. 2872–2893. https://doi.org/10.1109/TPAMI.2021.3054775</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Deep residual learning for image recognition / K. He [et al.] // 2016 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. – Las Vegas, 2016. – P. 770–778. https://doi.org/10.1109/cvpr.2016.90</mixed-citation><mixed-citation xml:lang="en">He K., Zhang X., Ren S., Sun J. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. Las Vegas, 2016, рр. 770–778. https://doi.org/10.1109/cvpr.2016.90</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Huang, G. Densely connected convolutional networks / G. Huang, Z. Liu, K. Q. Weinberger // 2017 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. – Honolulu, 2017. – P. 2261–2269. https://doi.org/10.1109/CVPR.2017.243</mixed-citation><mixed-citation xml:lang="en">Huang G., Liu Z., Weinberger K. Q. Densely connected convolutional networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. Honolulu, 2017, рр. 2261–2269. https://doi.org/10.1109/CVPR.2017.243</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Scalable person re-identification: A benchmark / L. Zheng [et al.] // 2015 IEEE Intern. Conf. on Computer Vision (ICCV), Santiago, Chile, 7–13 Dec. 2015. – Santiago, 2015. – P. 1116–1124. https://doi.org/10.1109/ICCV.2015.133</mixed-citation><mixed-citation xml:lang="en">Zheng L., Shen L., Tian L., Wang S., Wang J., Tian Q. Scalable person re-identification: A benchmark. 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015. Santiago, 2015, рр. 1116–1124. https://doi.org/10.1109/ICCV.2015.133</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Performance Measures and a Data Set for Multi-target, Multi-camera Tracking / E. Ristani [et al.]. – 2016. – Mode of access: https://doi.org/10.1007/978-3-319-48881-3_2. – Date of access: 09.08.2022.</mixed-citation><mixed-citation xml:lang="en">Ristani E., Solera F., Zou R. S., Cucchiara R., Tomasi C. Performance Measures and a Data Set for Multi-target, Multi-camera Tracking, 2016. Available at: https://doi.org/10.1007/978-3-319-48881-3_2 (accessed 09.08.2022).</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Person transfer GAN to bridge domain gap for person re-identification / L. Wei [et al.] // 2018 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018. – Salt Lake City, 2018. – P. 79–88. https://doi.org/10.1109/CVPR.2018.00016</mixed-citation><mixed-citation xml:lang="en">Wei L., Zhang S., Gao W., Tian Q. Person transfer GAN to bridge domain gap for person re-identification. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018. Salt Lake City, 2018, рр. 79–88. https://doi.org/10.1109/CVPR.2018.00016</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
