<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id custom-type="elpub" pub-id-type="custom">inform-152</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ОБРАБОТКА СИГНАЛОВ, ИЗОБРАЖЕНИЙ, РЕЧИ, ТЕКСТА И РАСПОЗНАВАНИЕ ОБРАЗОВ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>SIGNAL, IMAGE, SPEECH, TEXT PROCESSING AND PATTERN RECOGNITION</subject></subj-group></article-categories><title-group><article-title>АДАПТАЦИЯ СКРЫТЫХ МАРКОВСКИХ МОДЕЛЕЙ К РАСПОЗНАВАНИЮ ЭМОЦИОНАЛЬНО ОКРАШЕННОЙ РЕЧИ</article-title><trans-title-group xml:lang="en"><trans-title>ADAPTIVE LEARNING OF HIDDEN MARKOV MODELS FOR EMOTIONAL SPEECH</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Ткаченя</surname><given-names>А. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Tkachenia</surname><given-names>A. V.</given-names></name></name-alternatives><email xlink:type="simple">tkachenia@gmail.com</email></contrib></contrib-group><pub-date pub-type="collection"><year>2014</year></pub-date><pub-date pub-type="epub"><day>06</day><month>10</month><year>2016</year></pub-date><volume>0</volume><issue>3</issue><fpage>21</fpage><lpage>27</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Ткаченя А.В., 2016</copyright-statement><copyright-year>2016</copyright-year><copyright-holder xml:lang="ru">Ткаченя А.В.</copyright-holder><copyright-holder xml:lang="en">Tkachenia A.V.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/152">https://inf.grid.by/jour/article/view/152</self-uri><abstract><p>Рассматривается алгоритм интерактивной неконтролируемой оценки параметров скрытых марковских моделей (СММ). Решается задача адаптации СММ к эмоционально окрашенной речи. Для увеличения достоверности уточненных параметров СММ предлагается механизм забывания и обновления. Приводятся функциональная блок-схема рассматриваемого алгоритма адаптации СММ, а также полученные результаты улучшения эффективности распознавания эмоциональной речи.</p></abstract><trans-abstract xml:lang="en"><p>An on-line unsupervised algorithm for estimating the hidden Markov models (HMM) parame-ters is presented. The problem of hidden Markov models adaptation to emotional speech is solved. To increase the reliability of estimated HMM parameters, a mechanism of forgetting and updating is proposed. A functional block diagram of the hidden Markov models adaptation algorithm is also provided with obtained results, which improve the efficiency of emotional speech recognition.</p></trans-abstract></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Baum, L.E. An inequality and associated maximization techniques in statistical estimation for probabilistic functions of Markov processes / L.E. Baum // Inequalities. – 1972. – № 3. – P. 1–8.</mixed-citation><mixed-citation xml:lang="en">Baum, L.E. An inequality and associated maximization techniques in statistical estimation for probabilistic functions of Markov processes / L.E. Baum // Inequalities. – 1972. – № 3. – P. 1–8.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">A maximization technique occurring in the statistical analysis of probabilistic function of Markov chains / L.E. Baum [et al.]// Ann. Math. Stat. – 1970. – № 41. – P. 164–171.</mixed-citation><mixed-citation xml:lang="en">A maximization technique occurring in the statistical analysis of probabilistic function of Markov chains / L.E. Baum [et al.]// Ann. Math. Stat. – 1970. – № 41. – P. 164–171.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Juang, B.-H. Maximum likelihood estimation for multivariate mixture observations of Mar-kov chains / B.-H. Juang, S.E. Levinson, M.M. Sondhi // IEEE Trans. Inform. Theory. – 1993. – № 2. – P. 307–309.</mixed-citation><mixed-citation xml:lang="en">Juang, B.-H. Maximum likelihood estimation for multivariate mixture observations of Mar-kov chains / B.-H. Juang, S.E. Levinson, M.M. Sondhi // IEEE Trans. Inform. Theory. – 1993. – № 2. – P. 307–309.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Liporace, L.R. Maximum likelihood estimation for multivariate observations of Markov sources / L.R. Liporace // IEEE Trans. Inform. Theory. – 1995. - № 28. – P. 729–734.</mixed-citation><mixed-citation xml:lang="en">Liporace, L.R. Maximum likelihood estimation for multivariate observations of Markov sources / L.R. Liporace // IEEE Trans. Inform. Theory. – 1995. - № 28. – P. 729–734.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Gauvain, J.-L. Maximum a posteriori estimation for multivariate Gaussian mixture observa-tions of Markov chains / J.-L. Gauvain, C.-H. Lee // IEEE Trans. Speech Audio Processing. – 1994. – № 2. – P. 291–298.</mixed-citation><mixed-citation xml:lang="en">Gauvain, J.-L. Maximum a posteriori estimation for multivariate Gaussian mixture observa-tions of Markov chains / J.-L. Gauvain, C.-H. Lee // IEEE Trans. Speech Audio Processing. – 1994. – № 2. – P. 291–298.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Huo, Q. Bayesian adaptive learning of the parameters of hidden Markov model for speech recog-nition / Q. Huo, C. Chan, C.-H. Lee // IEEE Trans. Speech Audio Processing. – 1992. – № 5. – P. 334–345.</mixed-citation><mixed-citation xml:lang="en">Huo, Q. Bayesian adaptive learning of the parameters of hidden Markov model for speech recog-nition / Q. Huo, C. Chan, C.-H. Lee // IEEE Trans. Speech Audio Processing. – 1992. – № 5. – P. 334–345.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Lee, C.-H. A study on speaker adaptation of the parameters of continuous density hidden Markov models / C.-H. Lee, C.-H. Lin, B.-H. Juang // IEEE Trans. Signal Processing. – 1991. – № 39. – P. 806–814.</mixed-citation><mixed-citation xml:lang="en">Lee, C.-H. A study on speaker adaptation of the parameters of continuous density hidden Markov models / C.-H. Lee, C.-H. Lin, B.-H. Juang // IEEE Trans. Signal Processing. – 1991. – № 39. – P. 806–814.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Matsuoka, T. A study of on-line Bayesian adaptation for HMM-based speech recognition / T. Matsuoka, C.-H. Lee // Proc. EUROSPEECH-93. – Berlin, Germany, 1993. – P. 815–818.</mixed-citation><mixed-citation xml:lang="en">Matsuoka, T. A study of on-line Bayesian adaptation for HMM-based speech recognition / T. Matsuoka, C.-H. Lee // Proc. EUROSPEECH-93. – Berlin, Germany, 1993. – P. 815–818.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Huo, Q. On-Line Adaptive Learning of the Continuous Density Hidden Markov Model Based on Approximate Recursive Bayes Estimate / Q. Huo, C.-H. Lee // Speech and Audio Processing. – 1997. – № 5. – P. 161–172.</mixed-citation><mixed-citation xml:lang="en">Huo, Q. On-Line Adaptive Learning of the Continuous Density Hidden Markov Model Based on Approximate Recursive Bayes Estimate / Q. Huo, C.-H. Lee // Speech and Audio Processing. – 1997. – № 5. – P. 161–172.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Рылов, А.С. Анализ речи в распознающих системах / А.С. Рылов. – Минск : Бест-принт, 2003. – 264 с.</mixed-citation><mixed-citation xml:lang="en">Рылов, А.С. Анализ речи в распознающих системах / А.С. Рылов. – Минск : Бест-принт, 2003. – 264 с.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Bilmes, J. A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estima-tion for Gaussian Mixture and Hidden Markov Models / J. Bilmes // International Computer Science Institute. – 1998. – № 1. – P. 164–191.</mixed-citation><mixed-citation xml:lang="en">Bilmes, J. A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estima-tion for Gaussian Mixture and Hidden Markov Models / J. Bilmes // International Computer Science Institute. – 1998. – № 1. – P. 164–191.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">The HTK Book (for HTK v. 3.4) / S. Young [et al.]. – Cambridge University Engineering Department, 2006. – 359 p.</mixed-citation><mixed-citation xml:lang="en">The HTK Book (for HTK v. 3.4) / S. Young [et al.]. – Cambridge University Engineering Department, 2006. – 359 p.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Krishnamurthy, V. On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure / V. Krishnamurthy, J.B. Moore // IEEE Trans. Signal Processing. – 1993. – № 41 (8). – P. 2557–2573.</mixed-citation><mixed-citation xml:lang="en">Krishnamurthy, V. On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure / V. Krishnamurthy, J.B. Moore // IEEE Trans. Signal Processing. – 1993. – № 41 (8). – P. 2557–2573.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Weinstein, E. Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure / E. Weinstein, M. Feder, A.V. Oppenheim // IEEE Trans. Acoust, Speech, Signal Processing. – 1990. – № 38 (9). – P. 1652–1654.</mixed-citation><mixed-citation xml:lang="en">Weinstein, E. Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure / E. Weinstein, M. Feder, A.V. Oppenheim // IEEE Trans. Acoust, Speech, Signal Processing. – 1990. – № 38 (9). – P. 1652–1654.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">MULTEXT-J. Japanese MULTEXT Prosodic Corpus [Electronic resource]. – Mode of access : http://research.nii.ac.jp/src/en/MULTEXT-J.html. – Date of access : 30.09.2013.</mixed-citation><mixed-citation xml:lang="en">MULTEXT-J. Japanese MULTEXT Prosodic Corpus [Electronic resource]. – Mode of access : http://research.nii.ac.jp/src/en/MULTEXT-J.html. – Date of access : 30.09.2013.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Bou-Ghazale, S.E. A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress / S.E. Bou-Ghazale, J.H.L. Hansen // Speech and Audio Processing. – 2000. – № 8. – P. 429–442.</mixed-citation><mixed-citation xml:lang="en">Bou-Ghazale, S.E. A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress / S.E. Bou-Ghazale, J.H.L. Hansen // Speech and Audio Processing. – 2000. – № 8. – P. 429–442.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">K-fold cross-validation. Wikipedia [Electronic resource]. – Mode of access : http://en.wikipedia.org/wiki/Cross-validation_%28statistics%29. – Date of access : 18.08.2014.</mixed-citation><mixed-citation xml:lang="en">K-fold cross-validation. Wikipedia [Electronic resource]. – Mode of access : http://en.wikipedia.org/wiki/Cross-validation_%28statistics%29. – Date of access : 18.08.2014.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
