<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id custom-type="elpub" pub-id-type="custom">inform-3</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ОБРАБОТКА СИГНАЛОВ, ИЗОБРАЖЕНИЙ, РЕЧИ, ТЕКСТА И РАСПОЗНАВАНИЕ ОБРАЗОВ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>SIGNAL, IMAGE, SPEECH, TEXT PROCESSING AND PATTERN RECOGNITION</subject></subj-group></article-categories><title-group><article-title>МЕТОД ОБЪЕДИНЕНИЯ РЕШЕНИЙ КЛАССИФИКАТОРОВ ДЛЯ ЗАДАЧИ РАСПОЗНАВАНИЯ ДИКТОРА</article-title><trans-title-group xml:lang="en"><trans-title>MERGING CLASSIFIER DECISIONS FOR SPEAKER RECOGNITION</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Имамвердиев</surname><given-names>Я. Н.</given-names></name><name name-style="western" xml:lang="en"><surname>Imamverdiyev</surname><given-names>Y. N.</given-names></name></name-alternatives><email xlink:type="simple">yadigar@lan.ab.az</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Сухостат</surname><given-names>Л. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Sukhostat</surname><given-names>L. V.</given-names></name></name-alternatives><email xlink:type="simple">lsuhostat@hotmail.com</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff xml:lang="ru" id="aff-1"><institution>Институт информационных технологий Национальной академии наук Азербайджана</institution><country>Russian Federation</country></aff><pub-date pub-type="collection"><year>2015</year></pub-date><pub-date pub-type="epub"><day>25</day><month>09</month><year>2016</year></pub-date><volume>0</volume><issue>1</issue><fpage>17</fpage><lpage>25</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Имамвердиев Я.Н., Сухостат Л.В., 2016</copyright-statement><copyright-year>2016</copyright-year><copyright-holder xml:lang="ru">Имамвердиев Я.Н., Сухостат Л.В.</copyright-holder><copyright-holder xml:lang="en">Imamverdiyev Y.N., Sukhostat L.V.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/3">https://inf.grid.by/jour/article/view/3</self-uri><abstract><p>Предлагается использование нечетких интегралов для объединения решений классификаторовсистем распознавания диктора. В качестве набора признаков рассматриваются мгновенная частота и мгновенная амплитуда. Предлагаемый метод показывает значительно лучшие результаты по сравнению с применением единственного классификатора. Проводится сравнение предлагаемого метода с другими методами объединения решений классификаторов.</p></abstract><trans-abstract xml:lang="en"><p>The paper proposes using fuzzy integrals for merging classifier decisions in speaker recognitionsystems. Instantaneous frequency and instantaneous amplitude are considered as the set of features. The approach shows significantly better results than a single classifier. A comparison of the proposed approach with the other methods for merging classifier decisions is provided.</p></trans-abstract></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Ross, A.A. Handbook of Multibiometrics / A.A. Ross, K. Nandakumar, A.K. Jain. – London : Springer, 2006. – 198 p.</mixed-citation><mixed-citation xml:lang="en">Ross, A.A. Handbook of Multibiometrics / A.A. Ross, K. Nandakumar, A.K. Jain. – London : Springer, 2006. – 198 p.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Solomonoff, A. Advances in channel compensation for SVM speaker recognition / A. Solomonoff, W. Campbell, I. Boardman // Proc. of ICASSP. – Philadelphia, PA, 2005. – P. 629– 632.</mixed-citation><mixed-citation xml:lang="en">Solomonoff, A. Advances in channel compensation for SVM speaker recognition / A. Solomonoff, W. Campbell, I. Boardman // Proc. of ICASSP. – Philadelphia, PA, 2005. – P. 629– 632.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Gader, P.D. Fusion of handwritten word classifiers / P.D. Gader, M.A. Mohamed, J.M. Keller //Pattern Recognition Letters. – 1996. – № 17. – P. 577–584.4. Michel, G. The representation of importance and interaction of features by fuzzy measure /G. Michel // Pattern Recognition Letters. – 1996. – № 17. – P. 567–575.</mixed-citation><mixed-citation xml:lang="en">Gader, P.D. Fusion of handwritten word classifiers / P.D. Gader, M.A. Mohamed, J.M. Keller //Pattern Recognition Letters. – 1996. – № 17. – P. 577–584.4. Michel, G. The representation of importance and interaction of features by fuzzy measure /G. Michel // Pattern Recognition Letters. – 1996. – № 17. – P. 567–575.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Kuncheva, L.I. Decision templates for multiple classifier fusion: an experimental comparison / L.I. Kuncheva, J.C. Bezdek, R.P.W. Duin // Pattern Recognition. – 2001. – № 34. – P. 299–314.</mixed-citation><mixed-citation xml:lang="en">Kuncheva, L.I. Decision templates for multiple classifier fusion: an experimental comparison / L.I. Kuncheva, J.C. Bezdek, R.P.W. Duin // Pattern Recognition. – 2001. – № 34. – P. 299–314.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Mirhosseini, A.R. Human face image recognition: an evidence aggregation approach / A.R. Mirhosseini, H. Yan // Computer Vision and Image Understanding. – 1998. – № 71. – P. 213– 230.</mixed-citation><mixed-citation xml:lang="en">Mirhosseini, A.R. Human face image recognition: an evidence aggregation approach / A.R. Mirhosseini, H. Yan // Computer Vision and Image Understanding. – 1998. – № 71. – P. 213– 230.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Pham, T.D. Color image segmentation using fuzzy integral and mountain clustering / T.D. Pham, H. Yan // Fuzzy sets and systems. – 1999. – № 107. – P. 121–130.</mixed-citation><mixed-citation xml:lang="en">Pham, T.D. Color image segmentation using fuzzy integral and mountain clustering / T.D. Pham, H. Yan // Fuzzy sets and systems. – 1999. – № 107. – P. 121–130.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Kwak, K.-C. Face recognition using fuzzy integral and wavelet decomposition method / K.- C. Kwak, W. Pedrycz // IEEE Transactions on Systems, Man, and Cybernetics. – 2004. – № 34. – P. 1666–1675.</mixed-citation><mixed-citation xml:lang="en">Kwak, K.-C. Face recognition using fuzzy integral and wavelet decomposition method / K.- C. Kwak, W. Pedrycz // IEEE Transactions on Systems, Man, and Cybernetics. – 2004. – № 34. – P. 1666–1675.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Auephanwiriyakul, S. Generalized Choquet fuzzy integral fusion / S. Auephanwiriyakul, M.K. James, P.D. Gader // Information Fusion. – 2002. – № 3. – P. 69–85.</mixed-citation><mixed-citation xml:lang="en">Auephanwiriyakul, S. Generalized Choquet fuzzy integral fusion / S. Auephanwiriyakul, M.K. James, P.D. Gader // Information Fusion. – 2002. – № 3. – P. 69–85.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Wolf, J.J. Efficient acoustic parameters for speaker recognition / J.J. Wolf // J. Acoustical Society of America. – 1982. – Vol. 51, № 6. – P. 2044–2056.</mixed-citation><mixed-citation xml:lang="en">Wolf, J.J. Efficient acoustic parameters for speaker recognition / J.J. Wolf // J. Acoustical Society of America. – 1982. – Vol. 51, № 6. – P. 2044–2056.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Kinnunen, T. An overview of text-independent speaker recognition: from features to supervectors / T. Kinnunen, H. Li // Speech Communication. – 2010. – Vol. 52, № 1. – P. 12– 40.</mixed-citation><mixed-citation xml:lang="en">Kinnunen, T. An overview of text-independent speaker recognition: from features to supervectors / T. Kinnunen, H. Li // Speech Communication. – 2010. – Vol. 52, № 1. – P. 12– 40.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Rose, P. Forensic speaker identification. Taylor &amp; Francis forensic science series / P. Rose. N.Y. : Taylor &amp; Francis, 2002. – 380 p.</mixed-citation><mixed-citation xml:lang="en">Rose, P. Forensic speaker identification. Taylor &amp; Francis forensic science series / P. Rose. N.Y. : Taylor &amp; Francis, 2002. – 380 p.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Kinnunen, T. Spectral features for automatic text-independent speaker recognition. Licentiate thesis / T. Kinnunen. – Finland : University of Joensuu, 2003.</mixed-citation><mixed-citation xml:lang="en">Kinnunen, T. Spectral features for automatic text-independent speaker recognition. Licentiate thesis / T. Kinnunen. – Finland : University of Joensuu, 2003.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Маркел, Дж. Линейное предсказание речи / Дж. Маркел, А.X. Грей. – М. : Связь, 1980. – 308 с.</mixed-citation><mixed-citation xml:lang="en">Маркел, Дж. Линейное предсказание речи / Дж. Маркел, А.X. Грей. – М. : Связь, 1980. – 308 с.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Furui, S. Cepstral analysis techniques for automatic speaker verification / S. Furui // IEEE tran. acoust., speech, signal processing. – 1981. – Vol. 27. – P. 254–272.</mixed-citation><mixed-citation xml:lang="en">Furui, S. Cepstral analysis techniques for automatic speaker verification / S. Furui // IEEE tran. acoust., speech, signal processing. – 1981. – Vol. 27. – P. 254–272.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Reynolds, D. Channel robust speaker verification via feature mapping / D. Reynolds // Proc. of ICASSP. – Hong Kong, 2003. – Vol. 2. – P. 53–56.</mixed-citation><mixed-citation xml:lang="en">Reynolds, D. Channel robust speaker verification via feature mapping / D. Reynolds // Proc. of ICASSP. – Hong Kong, 2003. – Vol. 2. – P. 53–56.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Doddington, G. Speaker recognition based on idiolectal differences between speakers / G. Doddington // Proc. of Eurospeech. – Aalborg, Denmark, 2001. – Vol. 4. – P. 2521–2524.</mixed-citation><mixed-citation xml:lang="en">Doddington, G. Speaker recognition based on idiolectal differences between speakers / G. Doddington // Proc. of Eurospeech. – Aalborg, Denmark, 2001. – Vol. 4. – P. 2521–2524.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Hemant, A.P. Forensic Speaker Recognition / A.P. Hemant, Amy Neustein. – Heidelberg : Springer, 2012. – 540 p.</mixed-citation><mixed-citation xml:lang="en">Hemant, A.P. Forensic Speaker Recognition / A.P. Hemant, Amy Neustein. – Heidelberg : Springer, 2012. – 540 p.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Benediktsson, J.A. Consensus theoretic classification methods / J.A. Benediktsson, P.H. Swain // IEEE Trans. Systems Man Cybernet. – 1992. – № 22. – P. 688–704.</mixed-citation><mixed-citation xml:lang="en">Benediktsson, J.A. Consensus theoretic classification methods / J.A. Benediktsson, P.H. Swain // IEEE Trans. Systems Man Cybernet. – 1992. – № 22. – P. 688–704.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Ho, T.K. Decision combination in multiple classifier systems / T.K. Ho, J.J. Hull, S.N. Srihari // IEEE Trans. Pattern Anal. Machine Intelligence. – 1994. – № 16. – P. 66–75.</mixed-citation><mixed-citation xml:lang="en">Ho, T.K. Decision combination in multiple classifier systems / T.K. Ho, J.J. Hull, S.N. Srihari // IEEE Trans. Pattern Anal. Machine Intelligence. – 1994. – № 16. – P. 66–75.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Xu, L. Methods of combining multiple classifiers and their applications to hand-written character recognition / L. Xu, A. Krzyzak, C.Y. Suen // IEEE Trans. Systems Man Cybernet. – 1992. – № 23. – P. 418–435.</mixed-citation><mixed-citation xml:lang="en">Xu, L. Methods of combining multiple classifiers and their applications to hand-written character recognition / L. Xu, A. Krzyzak, C.Y. Suen // IEEE Trans. Systems Man Cybernet. – 1992. – № 23. – P. 418–435.</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Soong, F.K. On the use of instantaneous and transitional spectral information in speaker recognition / F.K. Soong, A.E. Rosenberg // IEEE Trans. Acoust. Speech, Signal Process. – 1988. – ASSP-36. – P. 871–879.</mixed-citation><mixed-citation xml:lang="en">Soong, F.K. On the use of instantaneous and transitional spectral information in speaker recognition / F.K. Soong, A.E. Rosenberg // IEEE Trans. Acoust. Speech, Signal Process. – 1988. – ASSP-36. – P. 871–879.</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Farrell, K.R. Text-dependent speaker verification using data fusion / K.R. Farrell // IEEE Intern. Conf. on Acoustic, Speech and Signal Processing. – Detroit, Michigan, USA, 1995. – P. 349–352.</mixed-citation><mixed-citation xml:lang="en">Farrell, K.R. Text-dependent speaker verification using data fusion / K.R. Farrell // IEEE Intern. Conf. on Acoustic, Speech and Signal Processing. – Detroit, Michigan, USA, 1995. – P. 349–352.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">Sub-word speaker verification using data fusion methods / K.R. Farrell [et al.] // IEEE Workshop on Neural Networks for Signal Processing. – Amelia Island, Florida, 1997. – P. 531–540.</mixed-citation><mixed-citation xml:lang="en">Sub-word speaker verification using data fusion methods / K.R. Farrell [et al.] // IEEE Workshop on Neural Networks for Signal Processing. – Amelia Island, Florida, 1997. – P. 531–540.</mixed-citation></citation-alternatives></ref><ref id="cit24"><label>24</label><citation-alternatives><mixed-citation xml:lang="ru">Farrell, K.R. An analysis of data fusion methods for speaker verification / K.R. Farrell, R.P. Ramachandran, R.J. Mammone // IEEE Intern. Conf. on Acoustic, Speech and Signal Processing. – Washington, USA, 1998. – P. 1129–1132.</mixed-citation><mixed-citation xml:lang="en">Farrell, K.R. An analysis of data fusion methods for speaker verification / K.R. Farrell, R.P. Ramachandran, R.J. Mammone // IEEE Intern. Conf. on Acoustic, Speech and Signal Processing. – Washington, USA, 1998. – P. 1129–1132.</mixed-citation></citation-alternatives></ref><ref id="cit25"><label>25</label><citation-alternatives><mixed-citation xml:lang="ru">Schalkwyk, J. Speaker verification with low storage requirements / J. Schalkwyk, N. Jain, E. Barnard // IEEE Intern. Conf. on Acoustic, Speech and Signal Processing. – Georgia, USA, 996. – P. 693–696.</mixed-citation><mixed-citation xml:lang="en">Schalkwyk, J. Speaker verification with low storage requirements / J. Schalkwyk, N. Jain, E. Barnard // IEEE Intern. Conf. on Acoustic, Speech and Signal Processing. – Georgia, USA, 996. – P. 693–696.</mixed-citation></citation-alternatives></ref><ref id="cit26"><label>26</label><citation-alternatives><mixed-citation xml:lang="ru">Zadeh, L.A. Fuzzy sets / L.A. Zadeh // Information and Control. – 1965. – № 8. – P. 338– 353.</mixed-citation><mixed-citation xml:lang="en">Zadeh, L.A. Fuzzy sets / L.A. Zadeh // Information and Control. – 1965. – № 8. – P. 338– 353.</mixed-citation></citation-alternatives></ref><ref id="cit27"><label>27</label><citation-alternatives><mixed-citation xml:lang="ru">Gupta, M.M. Fuzzy measures and fuzzy integrals / M.M. Gupta, G.N. Saridis, B.R. Gaines. – N.Y. : Elsevier, 1977. – 510 p.29. Murofushi, T. A theory of fuzzy measures. Representation, the Choquet integral and null sets / T. Murofushi, M. Sugeno // J. Math. Anal. Appl. – 1991. – Vol. 159, № 2. – P. 532–549.</mixed-citation><mixed-citation xml:lang="en">Gupta, M.M. Fuzzy measures and fuzzy integrals / M.M. Gupta, G.N. Saridis, B.R. Gaines. – N.Y. : Elsevier, 1977. – 510 p.29. Murofushi, T. A theory of fuzzy measures. Representation, the Choquet integral and null sets / T. Murofushi, M. Sugeno // J. Math. Anal. Appl. – 1991. – Vol. 159, № 2. – P. 532–549.</mixed-citation></citation-alternatives></ref><ref id="cit28"><label>28</label><citation-alternatives><mixed-citation xml:lang="ru">Maragos, P. On amplitude and frequency demodulation using energy operators / P. Maragos, J.F. Kaiser, T.F. Quatieri // IEEE Trans. on Signal Processing. – 1993. – Vol. 41, № 4. – P. 1532–1550.</mixed-citation><mixed-citation xml:lang="en">Maragos, P. On amplitude and frequency demodulation using energy operators / P. Maragos, J.F. Kaiser, T.F. Quatieri // IEEE Trans. on Signal Processing. – 1993. – Vol. 41, № 4. – P. 1532–1550.</mixed-citation></citation-alternatives></ref><ref id="cit29"><label>29</label><citation-alternatives><mixed-citation xml:lang="ru">Zhang, W.D. A priori threshold determination for phrase-prompted speaker verification / W.D. Zhang [et al.] // Proc. Eurospeech’99. – Budapest, Hungary, 1999. – P. 1203–1206.</mixed-citation><mixed-citation xml:lang="en">Zhang, W.D. A priori threshold determination for phrase-prompted speaker verification / W.D. Zhang [et al.] // Proc. Eurospeech’99. – Budapest, Hungary, 1999. – P. 1203–1206.</mixed-citation></citation-alternatives></ref><ref id="cit30"><label>30</label><citation-alternatives><mixed-citation xml:lang="ru">Hamid, L.A. Quality based Speaker Verification Systems using Fuzzy Inference Fusion Scheme / L.A. Hamid, D.A. Ramli // Proc. of the Intern. Conf. on Communications, Signal Processing and Computers. – Interlaken, Switzerland, 2014. – P. 96–103.</mixed-citation><mixed-citation xml:lang="en">Hamid, L.A. Quality based Speaker Verification Systems using Fuzzy Inference Fusion Scheme / L.A. Hamid, D.A. Ramli // Proc. of the Intern. Conf. on Communications, Signal Processing and Computers. – Interlaken, Switzerland, 2014. – P. 96–103.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
