<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id custom-type="elpub" pub-id-type="custom">inform-502</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ОБРАБОТКА СИГНАЛОВ, ИЗОБРАЖЕНИЙ, РЕЧИ, ТЕКСТА И РАСПОЗНАВАНИЕ ОБРАЗОВ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>SIGNAL, IMAGE, SPEECH, TEXT PROCESSING AND PATTERN RECOGNITION</subject></subj-group></article-categories><title-group><article-title>ВЕКТОРНО-ПАРАМЕТРИЧЕСКОЕ  НИЗКОСКОРОСТНОЕ СЖАТИЕ РЕЧЕВЫХ СИГНАЛОВ  НА ОСНОВЕ СУПЕРКАДРОВ С ПЕРЕМЕННОЙ СТРУКТУРОЙ</article-title><trans-title-group xml:lang="en"><trans-title></trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Борискевич</surname><given-names>А. А.</given-names></name></name-alternatives><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Рак</surname><given-names>А. О.</given-names></name></name-alternatives><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff xml:lang="ru" id="aff-1"><institution>Белорусский государственный университет информатики и радиоэлектроники</institution><country>Belarus</country></aff><pub-date pub-type="collection"><year>2009</year></pub-date><pub-date pub-type="epub"><day>16</day><month>10</month><year>2018</year></pub-date><volume>1</volume><issue>2(22)</issue><fpage>57</fpage><lpage>70</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Борискевич А.А., Рак А.О., 2018</copyright-statement><copyright-year>2018</copyright-year><copyright-holder xml:lang="ru">Борискевич А.А., Рак А.О.</copyright-holder><copyright-holder xml:lang="en">Борискевич А.А., Рак А.О.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/502">https://inf.grid.by/jour/article/view/502</self-uri><abstract><p>Разрабатывается алгоритм векторно-параметрического низкоскоростного сжатия речи, основанный на использовании параметрической модели синтеза речевого сигнала с линейным предсказанием, суперкадров с переменной структурой, векторного квантования параметров суперкадра (коэффицента усиления, периода основного тона и LSF(line spectrum frequency)-коэффициентов) и интерполяции LSF-кадров. Даются рекомендации по выбору структуры суперкадра в зависимости от типа передаваемых параметров модели речевого сигнала. Осуществляется программная реализация алгоритма низкоскоростного параметрического сжатия речи в среде моделирования Matlab. Показывается, что разборчивость речи сохраняется при битовых скоростях 300–800 бит/с. Устанавливается, что увеличение битовой скорости обычно не приводит к значительному улучшению качества звучания из-за ограничений, накладываемых выбранной моделью речеобразования.</p></abstract></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Максимов, М.И. Проектирование низкоскоростных речепреобразующих устройств для каналов с высоким процентом ошибок / М.И. Максимов, Н.А. Сидорова, О.В. Чернояров // Электросвязь. – 2008. – № 7. – С. 48–49.</mixed-citation><mixed-citation xml:lang="en">Максимов, М.И. Проектирование низкоскоростных речепреобразующих устройств для каналов с высоким процентом ошибок / М.И. Максимов, Н.А. Сидорова, О.В. Чернояров // Электросвязь. – 2008. – № 7. – С. 48–49.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">MELP: The new federal standard at 2400 bits/s / L.M. Supplee [et al.] // IEEE International Conference on Acoustics, Speech, and Signal Processing. – Munich, 1997. – P. 1591–1594.</mixed-citation><mixed-citation xml:lang="en">MELP: The new federal standard at 2400 bits/s / L.M. Supplee [et al.] // IEEE International Conference on Acoustics, Speech, and Signal Processing. – Munich, 1997. – P. 1591–1594.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Compandent's MELPe-Enhanced Mixed-Excitation Linear Predictive Vocoder [Electronic resource]. – Mode of access : http://www.compandent.com/products_melpe.htm. – Date of access : 03.03.2009.</mixed-citation><mixed-citation xml:lang="en">Compandent's MELPe-Enhanced Mixed-Excitation Linear Predictive Vocoder [Electronic resource]. – Mode of access : http://www.compandent.com/products_melpe.htm. – Date of access : 03.03.2009.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Chamberlain, M. A 600 bps MELP vocoder for use on HF channels / M. Chamberlain // IEEE Military Communications Conference, MILCOM-2001, Communications for Network-Centric Operations: Creating the Information Force. – USA, 2001. – Vol. 1. – P. 447–453.</mixed-citation><mixed-citation xml:lang="en">Chamberlain, M. A 600 bps MELP vocoder for use on HF channels / M. Chamberlain // IEEE Military Communications Conference, MILCOM-2001, Communications for Network-Centric Operations: Creating the Information Force. – USA, 2001. – Vol. 1. – P. 447–453.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">New NATO STANAG narrow band voice coder at 600 bit/s / G. Guilmin [et al.] // IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-2006. – France Toulouse, 2006. – Vol. 1. – P. 689–692.</mixed-citation><mixed-citation xml:lang="en">New NATO STANAG narrow band voice coder at 600 bit/s / G. Guilmin [et al.] // IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-2006. – France Toulouse, 2006. – Vol. 1. – P. 689–692.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Wang, T. A 1200/2400 bps coding suite based on MELP / T. Wang, K. Koishida, V. Cuper¬man // Proc. of IEEE Workshop on Speech Coding. – Tsukuba, Japan, 2002. – Vol. 1. – P. 122–126.</mixed-citation><mixed-citation xml:lang="en">Wang, T. A 1200/2400 bps coding suite based on MELP / T. Wang, K. Koishida, V. Cuper¬man // Proc. of IEEE Workshop on Speech Coding. – Tsukuba, Japan, 2002. – Vol. 1. – P. 122–126.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Padellini, M. Very low bit rate (VLBR) speech coding around 500 bit/sec / M. Padellini, F. Capman, G. Baudoin // 12th European Signal Processing Conference (EUSIPCO 2004). – Vienna, Austria, 2004. – P. 1669–1672.</mixed-citation><mixed-citation xml:lang="en">Padellini, M. Very low bit rate (VLBR) speech coding around 500 bit/sec / M. Padellini, F. Capman, G. Baudoin // 12th European Signal Processing Conference (EUSIPCO 2004). – Vienna, Austria, 2004. – P. 1669–1672.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">DARPA ASE. Program [Electronic resource]. – Mode of access : http://www.darpa.mil/ ato/solicit/ASE/index.htm. – Date of access : 03.03.2009.</mixed-citation><mixed-citation xml:lang="en">DARPA ASE. Program [Electronic resource]. – Mode of access : http://www.darpa.mil/ ato/solicit/ASE/index.htm. – Date of access : 03.03.2009.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Kritzinger, C. Low Bit Rate Speech Coding [Electronic resource]. – Mode of access : etd.sun.ac.za/jspui/bitstream/10019/89/1/KritzC.pdf. – Date of access : 03.03.2009.</mixed-citation><mixed-citation xml:lang="en">Kritzinger, C. Low Bit Rate Speech Coding [Electronic resource]. – Mode of access : etd.sun.ac.za/jspui/bitstream/10019/89/1/KritzC.pdf. – Date of access : 03.03.2009.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Попов, О.Б. Цифровая обработка сигналов в трактах звукового вещания / О.Б. Попов, С.Г. Рихтер. – М. : Горячая линия – Телеком, 2007. – 341 с.</mixed-citation><mixed-citation xml:lang="en">Попов, О.Б. Цифровая обработка сигналов в трактах звукового вещания / О.Б. Попов, С.Г. Рихтер. – М. : Горячая линия – Телеком, 2007. – 341 с.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Фант, Г. Акустистическая теория речеобразования / Г. Фант; пер. с англ. Л.А. Варшавского, В.И. Медведева ; под ред. В.С. Григорьева. – М. : Наука, 1964. – 284 с.</mixed-citation><mixed-citation xml:lang="en">Фант, Г. Акустистическая теория речеобразования / Г. Фант; пер. с англ. Л.А. Варшавского, В.И. Медведева ; под ред. В.С. Григорьева. – М. : Наука, 1964. – 284 с.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Маркел, Дж.Д. Линейное предсказание речи / Дж.Д. Маркел, А.Х. Грэй ; пер с англ. ; под ред. Ю.Н. Прохорова, В.С. Звездина. – М. : Связь, 1980. – 308 с.</mixed-citation><mixed-citation xml:lang="en">Маркел, Дж.Д. Линейное предсказание речи / Дж.Д. Маркел, А.Х. Грэй ; пер с англ. ; под ред. Ю.Н. Прохорова, В.С. Звездина. – М. : Связь, 1980. – 308 с.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Kabal, P. The computation of Line Spectral Frequencies Using Chebyshev Polynomials / P. Kabal, R.P. Ramachanandran // IEEE Trans. Acoustics, Speech, Signal Processing. – 1986. – Vol. 34, № 6. – P. 1419–1426.</mixed-citation><mixed-citation xml:lang="en">Kabal, P. The computation of Line Spectral Frequencies Using Chebyshev Polynomials / P. Kabal, R.P. Ramachanandran // IEEE Trans. Acoustics, Speech, Signal Processing. – 1986. – Vol. 34, № 6. – P. 1419–1426.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Рабинер, Л.Р. Цифровая обработка речевых сигналов / Л.Р. Рабинер, Р.В. Шафер. – М. : Радио и связь, 1981. – 496 с.</mixed-citation><mixed-citation xml:lang="en">Рабинер, Л.Р. Цифровая обработка речевых сигналов / Л.Р. Рабинер, Р.В. Шафер. – М. : Радио и связь, 1981. – 496 с.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Марпл-мл., С.Л. Цифровой спектральный анализ и его приложения / С.Л. Марпл-мл. ; пер. c англ. – М. : Мир, 1990. – 584 с.</mixed-citation><mixed-citation xml:lang="en">Марпл-мл., С.Л. Цифровой спектральный анализ и его приложения / С.Л. Марпл-мл. ; пер. c англ. – М. : Мир, 1990. – 584 с.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Linde, Y. An Algorithm for Vector Quantizer Design / Y. Linde, A. Buzo, R. Gray // IEEE Transactions on Communications. – 1980. – Vol. 28, № 1. – P. 84–94.</mixed-citation><mixed-citation xml:lang="en">Linde, Y. An Algorithm for Vector Quantizer Design / Y. Linde, A. Buzo, R. Gray // IEEE Transactions on Communications. – 1980. – Vol. 28, № 1. – P. 84–94.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Real time vector quantization of LSP parameters / B. Kovesi [et al.] // Speech communication. – 1999. – Vol. 29, № 1. – P. 39–47.</mixed-citation><mixed-citation xml:lang="en">Real time vector quantization of LSP parameters / B. Kovesi [et al.] // Speech communication. – 1999. – Vol. 29, № 1. – P. 39–47.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Paliwal, K.K. Quantization of LPC Parameters / K.K. Paliwal, B.S. Atal [Electronic re-source]. – Mode of access : maxwell.me.gu.edu.au/spl/publications/papers/book_sc_kkp.pdf. – Date of access : 03.03.2009.</mixed-citation><mixed-citation xml:lang="en">Paliwal, K.K. Quantization of LPC Parameters / K.K. Paliwal, B.S. Atal [Electronic re-source]. – Mode of access : maxwell.me.gu.edu.au/spl/publications/papers/book_sc_kkp.pdf. – Date of access : 03.03.2009.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Paliwal, K.K. Efficient vector quantization of LPC parameters at 24 bits/frame [Electronic resource]. – Mode of access : max-well.me.gu.edu.au/spl/publications/papers/icassp91_kkp_lpc.pdf. – Date of access : 03.03.2009.</mixed-citation><mixed-citation xml:lang="en">Paliwal, K.K. Efficient vector quantization of LPC parameters at 24 bits/frame [Electronic resource]. – Mode of access : max-well.me.gu.edu.au/spl/publications/papers/icassp91_kkp_lpc.pdf. – Date of access : 03.03.2009.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Hansen, J.H.L. An effective quality evaluation protocol for speech enhancement algorithms / J.H.L. Hansen, B.L. Pellom [Electronic resource]. – Mode of access : http://citeseerx.ist.psu.edu/viewdoc/ summary?doi=10.1.1.44.9149. – Date of access : 03.03.2009.</mixed-citation><mixed-citation xml:lang="en">Hansen, J.H.L. An effective quality evaluation protocol for speech enhancement algorithms / J.H.L. Hansen, B.L. Pellom [Electronic resource]. – Mode of access : http://citeseerx.ist.psu.edu/viewdoc/ summary?doi=10.1.1.44.9149. – Date of access : 03.03.2009.</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Zwicker, E. Psychoacoustics, Facts and Models / E. Zwicker, H. Fast. – N.Y. : Springer-Verlag, 1990. – 354 p.</mixed-citation><mixed-citation xml:lang="en">Zwicker, E. Psychoacoustics, Facts and Models / E. Zwicker, H. Fast. – N.Y. : Springer-Verlag, 1990. – 354 p.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
