<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id custom-type="elpub" pub-id-type="custom">inform-673</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ОБРАБОТКА СИГНАЛОВ, ИЗОБРАЖЕНИЙ, РЕЧИ, ТЕКСТА И РАСПОЗНАВАНИЕ ОБРАЗОВ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>SIGNAL, IMAGE, SPEECH, TEXT PROCESSING AND PATTERN RECOGNITION</subject></subj-group></article-categories><title-group><article-title>АЛГОРИТМ СЕГМЕНТАЦИИ РЕЧИ НА ОСНОВЕ МЕТОДА ДИНАМИЧЕСКОГО ПРОГРАММИРОВАНИЯ</article-title><trans-title-group xml:lang="en"><trans-title></trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Давыдов</surname><given-names>А. Г.</given-names></name></name-alternatives><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff xml:lang="ru" id="aff-1"><institution>Объединенный институт проблем информатики НАН Беларуси</institution><country>Belarus</country></aff><pub-date pub-type="collection"><year>2006</year></pub-date><pub-date pub-type="epub"><day>06</day><month>12</month><year>2018</year></pub-date><volume>0</volume><issue>1(9)</issue><fpage>47</fpage><lpage>57</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Давыдов А.Г., 2018</copyright-statement><copyright-year>2018</copyright-year><copyright-holder xml:lang="ru">Давыдов А.Г.</copyright-holder><copyright-holder xml:lang="en">Давыдов А.Г.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/673">https://inf.grid.by/jour/article/view/673</self-uri><abstract><p>Рассматривается система автоматической сегментации речи на основе динамического программирования. В качестве вектора признаков предлагается использовать спектр и усредненные конечные разности спектра по времени. Определяются оптимальные параметры работы системы на тестовом множестве из 1128 элементов.</p></abstract></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Lobanov B.M, Karnevskaya E.B. Phonetics and its Applications. – Stuttgart: Franz Steiner Verlag, 2002. – P. 445–452.</mixed-citation><mixed-citation xml:lang="en">Lobanov B.M, Karnevskaya E.B. Phonetics and its Applications. – Stuttgart: Franz Steiner Verlag, 2002. – P. 445–452.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Система экспресс-идентификации голоса личности методом клонирования акустических характеристик речи /А.Г. Давыдов, В.В. Киселев, Б.М. Лобанов, Л.И. Цирульник // Тез. докл. Междунар. конф. «Теория и практика речевой коммуникации». – М., 2004. – С. 23–28.</mixed-citation><mixed-citation xml:lang="en">Система экспресс-идентификации голоса личности методом клонирования акустических характеристик речи /А.Г. Давыдов, В.В. Киселев, Б.М. Лобанов, Л.И. Цирульник // Тез. докл. Междунар. конф. «Теория и практика речевой коммуникации». – М., 2004. – С. 23–28.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Malfrere F., Dutoit T. High quality speech synthesis for phonetic speech segmentation // Proc. of Eurospeech’97. – Rhodes, Greece, 1997. – Р. 2631–2634.</mixed-citation><mixed-citation xml:lang="en">Malfrere F., Dutoit T. High quality speech synthesis for phonetic speech segmentation // Proc. of Eurospeech’97. – Rhodes, Greece, 1997. – Р. 2631–2634.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Система сегментации речевого сигнала методом анализа через синтез / А.Г. Давыдов, В.В. Киселев, Б.М. Лобанов, Л.И. Цирульник // Известия Белорусской инженерной академии. – № 1 (17)/1’. – 2004. – С. 112–115.</mixed-citation><mixed-citation xml:lang="en">Система сегментации речевого сигнала методом анализа через синтез / А.Г. Давыдов, В.В. Киселев, Б.М. Лобанов, Л.И. Цирульник // Известия Белорусской инженерной академии. – № 1 (17)/1’. – 2004. – С. 112–115.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Sethy A., Narayanan S. Refined speech segmentation for concatenative speech synthesis // Proc. of ICSLP 2002 – INTERSPEECH 2002. – Denver, USA, 2002. – Р. 149–152.</mixed-citation><mixed-citation xml:lang="en">Sethy A., Narayanan S. Refined speech segmentation for concatenative speech synthesis // Proc. of ICSLP 2002 – INTERSPEECH 2002. – Denver, USA, 2002. – Р. 149–152.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Лобанов Б.М. Синтез речи по тексту // Четвертая Междунар. летняя школа-семинар по искусственному интеллекту: сб. науч. тр. – Мн.: Изд-во БГУ, 2000. – С. 57–76.</mixed-citation><mixed-citation xml:lang="en">Лобанов Б.М. Синтез речи по тексту // Четвертая Междунар. летняя школа-семинар по искусственному интеллекту: сб. науч. тр. – Мн.: Изд-во БГУ, 2000. – С. 57–76.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Development of an emotional speech synthesizer in Spamish / J.M. Montero, J. Guiterrez-Arriola, J. Colas et al. // Proc. of Eurospeech’99. – Budapest, Hungary, 1999. – P. 2099–2102.</mixed-citation><mixed-citation xml:lang="en">Development of an emotional speech synthesizer in Spamish / J.M. Montero, J. Guiterrez-Arriola, J. Colas et al. // Proc. of Eurospeech’99. – Budapest, Hungary, 1999. – P. 2099–2102.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Aravoice: An Arabic Text-to-Speech system / Z. Zemirli, R.A. Obrecht, A. Henni, M. Sellami // Proc. of SPECOM’2003. – Moskow, Russia, 2003. – P. 170–177.</mixed-citation><mixed-citation xml:lang="en">Aravoice: An Arabic Text-to-Speech system / Z. Zemirli, R.A. Obrecht, A. Henni, M. Sellami // Proc. of SPECOM’2003. – Moskow, Russia, 2003. – P. 170–177.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Сорокин В.Н., Цыплухин А.И. Сегментация и распознавание гласных // Информационные процессы. – 2004. – Т. 4. – № 2. – С. 202–220.</mixed-citation><mixed-citation xml:lang="en">Сорокин В.Н., Цыплухин А.И. Сегментация и распознавание гласных // Информационные процессы. – 2004. – Т. 4. – № 2. – С. 202–220.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Zwicker E., Flottorp G., Stevens S.S. Critical bandwidth in loudness summation // J. Acoust. Soc. Am. – № 29. – 1957. – Р. 548–557.</mixed-citation><mixed-citation xml:lang="en">Zwicker E., Flottorp G., Stevens S.S. Critical bandwidth in loudness summation // J. Acoust. Soc. Am. – № 29. – 1957. – Р. 548–557.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Hermansky H., Morgan N. RASTA processing of speech // IEEE Trans. on Speech and Audio Proc. – 1994. – Vol. 2. – № 4. – Р. 578–589.</mixed-citation><mixed-citation xml:lang="en">Hermansky H., Morgan N. RASTA processing of speech // IEEE Trans. on Speech and Audio Proc. – 1994. – Vol. 2. – № 4. – Р. 578–589.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">A Low-Power, Fixed-Point, Front-End Feature Extraction for a Distributed Speech Recognition System / B. Delaney, N. Jayant, M. Hans et al. // IEEE International Conference on Acoustic Speech and Signal Processing, May 2002. – Orlando, Florida, 2002.</mixed-citation><mixed-citation xml:lang="en">A Low-Power, Fixed-Point, Front-End Feature Extraction for a Distributed Speech Recognition System / B. Delaney, N. Jayant, M. Hans et al. // IEEE International Conference on Acoustic Speech and Signal Processing, May 2002. – Orlando, Florida, 2002.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Bellman R.E. Dynamic Programming // Princeton University Press. – Princeton, NJ, USA, 1957.</mixed-citation><mixed-citation xml:lang="en">Bellman R.E. Dynamic Programming // Princeton University Press. – Princeton, NJ, USA, 1957.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Лобанов Б.М., Слуцкер Г.С., Тизик А.П. Автоматическое распознавание звукосочетаний в текущем речевом сигнале // Тр. НИИР. – Вып. 4. – М., 1969. – С. 67–75.</mixed-citation><mixed-citation xml:lang="en">Лобанов Б.М., Слуцкер Г.С., Тизик А.П. Автоматическое распознавание звукосочетаний в текущем речевом сигнале // Тр. НИИР. – Вып. 4. – М., 1969. – С. 67–75.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Itakura F. Minimum Prediction Residual Principle Applied to Speech Recognition // IEEE Transactions on Acoustics, Speech and Signal Processing. – Vol. ASSP-23. – 1975. – P. 52–72.</mixed-citation><mixed-citation xml:lang="en">Itakura F. Minimum Prediction Residual Principle Applied to Speech Recognition // IEEE Transactions on Acoustics, Speech and Signal Processing. – Vol. ASSP-23. – 1975. – P. 52–72.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Sakoe H., Chiba S. Dynamic programming algorithm optimization for spoken word recognition // IEEE Transactions on Acoustics, Speech and Signal Processing. – Vol. 26. – 1978. – P. 43–49.</mixed-citation><mixed-citation xml:lang="en">Sakoe H., Chiba S. Dynamic programming algorithm optimization for spoken word recognition // IEEE Transactions on Acoustics, Speech and Signal Processing. – Vol. 26. – 1978. – P. 43–49.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Вентцель Е.С. Исследование операций: задачи, принципы, методология. – М.: Наука, 1988. – 208 с.</mixed-citation><mixed-citation xml:lang="en">Вентцель Е.С. Исследование операций: задачи, принципы, методология. – М.: Наука, 1988. – 208 с.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Salvador S., Chan P. FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space // KDD Workshop on Mining Temporal and Sequential Data, August 22, 2004. – Seattle, Washington, 2004.</mixed-citation><mixed-citation xml:lang="en">Salvador S., Chan P. FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space // KDD Workshop on Mining Temporal and Sequential Data, August 22, 2004. – Seattle, Washington, 2004.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
