<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">inform</journal-id><journal-title-group><journal-title xml:lang="ru">Информатика</journal-title><trans-title-group xml:lang="en"><trans-title>Informatics</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1816-0301</issn><issn pub-type="epub">2617-6963</issn><publisher><publisher-name>UIIP NASB</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.37661/1816-0301-2026-23-1-58-68</article-id><article-id custom-type="elpub" pub-id-type="custom">inform-1386</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ИНФОРМАЦИОННЫЕ ТЕХНОЛОГИИ</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>INFORMATION TECHNOLOGY</subject></subj-group></article-categories><title-group><article-title>Метод повышения пространственного разрешения амбисонического звука на основе разреженного МДКП-представления</article-title><trans-title-group xml:lang="en"><trans-title>Method for spatial resolution enhancement of ambisonic audio based on sparse MDCT representation</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Лихачёв</surname><given-names>Д. С.</given-names></name><name name-style="western" xml:lang="en"><surname>Likhachov</surname><given-names>Denis S.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Лихачёв Денис Сергеевич, кандидат технических наук, доцент кафедры электронных вычислительных средств</p><p>ул. П. Бровки, 6, Минск, 220013</p></bio><bio xml:lang="en"><p>Denis S. Likhachov, Cand. Sci. (Eng.), Assoc. Prof. of Computer Engineering Department</p><p>st. P. Brovki, 6, Minsk, 220013</p></bio><email xlink:type="simple">likhachov@bsuir.by</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Петровский</surname><given-names>Н. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Petrovsky</surname><given-names>Nick A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Петровский Николай Александрович, кандидат технических наук, доцент кафедры электронных вычислительных средств</p><p>ул. П. Бровки, 6, Минск, 220013</p></bio><bio xml:lang="en"><p>Nick A. Petrovsky, Cand. Sci. (Eng.), Assoc. Prof. of Computer Engineering Department</p><p>st. P. Brovki, 6, Minsk, 220013</p></bio><email xlink:type="simple">nick.petrovsky@bsuir.by</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Азаров</surname><given-names>И. С.</given-names></name><name name-style="western" xml:lang="en"><surname>Azarov</surname><given-names>Elias S.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Азаров Илья Сергеевич, доктор технических наук, профессор, заведующий кафедрой электронных вычислительных средств</p><p>ул. П. Бровки, 6, Минск, 220013</p></bio><bio xml:lang="en"><p>Elias S. Azarov, Dr. Sci. (Eng.), Prof., Head of Computer Engineering Department</p><p>st. P. Brovki, 6, Minsk, 220013</p></bio><email xlink:type="simple">azarov@bsuir.by</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Белорусский государственный университет информатики и радиоэлектроники</institution></aff><aff xml:lang="en"><institution>Belarusian State University of Informatics and Radioelectronics</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2026</year></pub-date><pub-date pub-type="epub"><day>27</day><month>03</month><year>2026</year></pub-date><volume>23</volume><issue>1</issue><fpage>58</fpage><lpage>68</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Лихачёв Д.С., Петровский Н.А., Азаров И.С., 2026</copyright-statement><copyright-year>2026</copyright-year><copyright-holder xml:lang="ru">Лихачёв Д.С., Петровский Н.А., Азаров И.С.</copyright-holder><copyright-holder xml:lang="en">Likhachov D.S., Petrovsky N.A., Azarov E.S.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://inf.grid.by/jour/article/view/1386">https://inf.grid.by/jour/article/view/1386</self-uri><abstract><sec><title>Цели</title><p>Цели. Решается задача разработки метода повышения пространственного разрешения амбисонического звука на основе разреженного МДКП-представления (МДКП – модифицированное дискретное косинусное преобразование), позволяющего улучшить эффективность существующих подходов представления пространственного звука в цифровом виде.</p></sec><sec><title>Методы</title><p>Методы. Предлагаемый подход основан на временно-частотной декомпозиции аудиосигнала с последующим извлечением направленной плоской волны из каждой частотной составляющей. Данный подход развивает основные идеи методов разложения на плоские волны с высоким разрешением (HARPEX) и аудиокодирования с учетом направления прихода звука, используя преимущества вещественной разреженной декомпозиции.</p></sec><sec><title>Результаты</title><p>Результаты. Метод повышения пространственного разрешения амбисонического звука предполагает использование вещественных частотных компонентов, что, в отличие от комплексных, обеспечивает более простую и устойчивую оценку направления прихода звука. Разреженная декомпозиция позволяет реализовать точный и унифицированный подход к описанию звуков различной природы – от транзиентных до тоновых.</p></sec><sec><title>Заключение</title><p>Заключение. Практические результаты подтверждают применимость предложенного метода для обработки аудио с повышением разрешения вплоть до амбисоники седьмого порядка. Основным недостатком метода является его сравнительно с другими методами высокая вычислительная сложность, что, однако, некритично для приложений, которые не требуют обработки в реальном времени.</p></sec></abstract><trans-abstract xml:lang="en"><sec><title>Objectives</title><p>Objectives. The problem of developing a method for spatial resolution enhancement of ambisonic audio which improves the efficiency of existing approaches to digital spatial sound representation is being solved.</p></sec><sec><title>Methods</title><p>Methods. The proposed approach is based on time-frequency decomposition of the audio signal with subsequent extraction of a directional plane wave from each frequency component. The approach develops the core ideas of high-resolution plane wave expansion (HARPEX) and directional audio coding (DirAC) methods, utilizing the advantages of real-valued sparse decomposition.</p></sec><sec><title>Results</title><p>Results. Application of the proposed ambisonic spatial resolution enhancement method involves the use of real-valued frequency components, which, unlike complex ones, provide a simpler and more robust estimation of sound arrival direction. Sparse decomposition enables an accurate and unified approach to describing sounds of various nature – from transient to tonal.</p></sec><sec><title>Conclusion</title><p>Conclusion. Practical results confirm the method's applicability for audio processing with resolution enhancement up to seventh-order ambisonics. The disadvantage of the method is its high computational complexity; however, it is suitable for applications that do not require real-time processing.</p></sec></trans-abstract><kwd-group xml:lang="ru"><kwd>пространственный звук</kwd><kwd>амбисоника</kwd><kwd>апмиксинг</kwd><kwd>пространственное разрешение</kwd><kwd>разреженное представление</kwd><kwd>быстрое преобразование Фурье</kwd><kwd>модифицированное дискретное косинусное преобразование</kwd></kwd-group><kwd-group xml:lang="en"><kwd>spatial audio</kwd><kwd>ambisonics</kwd><kwd>upmixing</kwd><kwd>spatial resolution</kwd><kwd>sparse representation</kwd><kwd>fast Fourier transform</kwd><kwd>modified discrete cosine transform</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Pulkki V. Spatial sound reproduction with directional audio coding. Journal of the Audio Engineering Society, 2007, vol. 55, no. 6, pp. 503–516.</mixed-citation><mixed-citation xml:lang="en">Pulkki V. Spatial sound reproduction with directional audio coding. Journal of the Audio Engineering Society, 2007, vol. 55, no. 6, pp. 503–516.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Berge S., Barrett N. High angular resolution planewave expansion. Proceedings of the 2nd International Symposium on Ambisonics and Spherical Acoustics, Paris, France, 6–7 May 2010. Available at: https://www.harpex.net/harpex.pdf (accessed 20.11.2025).</mixed-citation><mixed-citation xml:lang="en">Berge S., Barrett N. High angular resolution planewave expansion. Proceedings of the 2nd International Symposium on Ambisonics and Spherical Acoustics, Paris, France, 6–7 May 2010. Available at: https://www.harpex.net/harpex.pdf (accessed 20.11.2025).</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Herzog A., Habets E. A. P. Direction and reverberation preserving noise reduction of ambisonics signals. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, vol. 28, pp. 2461–2475. https://doi.org/10.1109/TASLP.2020.3013979.</mixed-citation><mixed-citation xml:lang="en">Herzog A., Habets E. A. P. Direction and reverberation preserving noise reduction of ambisonics signals. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, vol. 28, pp. 2461–2475. https://doi.org/10.1109/TASLP.2020.3013979.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Rafaely B. Fundamentals of Spherical Array Process. Springer, 2015, 204 р. (Springer Topics in Signal Processing).</mixed-citation><mixed-citation xml:lang="en">Rafaely B. Fundamentals of Spherical Array Process. Springer, 2015, 204 р. (Springer Topics in Signal Processing).</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Schwartz O., Gannot S., Habets E. Multi-microphone speech dereverberation and noise reduction using relative early transfer functions. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2015, vol. 23, no. 2, pp. 240–251.</mixed-citation><mixed-citation xml:lang="en">Schwartz O., Gannot S., Habets E. Multi-microphone speech dereverberation and noise reduction using relative early transfer functions. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2015, vol. 23, no. 2, pp. 240–251.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Meyer J., Elko G. W. Spherical microphone arrays for 3D sound recordings. Audio Signal Processing for Next-Generation Multimedia Communication Systems. In Y. Huang, J. Benesty (eds). Norwell, MA, USA, Kluwer Academic, 2004, pp. 67–89.</mixed-citation><mixed-citation xml:lang="en">Meyer J., Elko G. W. Spherical microphone arrays for 3D sound recordings. Audio Signal Processing for Next-Generation Multimedia Communication Systems. In Y. Huang, J. Benesty (eds). Norwell, MA, USA, Kluwer Academic, 2004, pp. 67–89.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">McCormack L., Politis A. SPARTA &amp; COMPASS: Real-time implementations of linear and parametric spatial audio reproduction and processing methods. 2019 AES International Conference on Immersive and Interactive Audio, York, UK, 27–29 March 2019. Available at: https://acris.aalto.fi/ws/portalfiles/portal/34325781/ELEC_McCormick_Sparta_AES.pdf (accessed 20.11.2025).</mixed-citation><mixed-citation xml:lang="en">McCormack L., Politis A. SPARTA &amp; COMPASS: Real-time implementations of linear and parametric spatial audio reproduction and processing methods. 2019 AES International Conference on Immersive and Interactive Audio, York, UK, 27–29 March 2019. Available at: https://acris.aalto.fi/ws/portalfiles/portal/34325781/ELEC_McCormick_Sparta_AES.pdf (accessed 20.11.2025).</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
