Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. Accurate speaker counting, diarization and separation for advanced recognition of multichannel multispeaker conversations. Computer Speech and Language. 2025. Vol. 92. pp. 101780.. doi: 10.1016/j.csl.2025.101780
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. STCON System for the CHiME-8 Challenge. 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024). 2024. pp. 13-17.. doi: 10.21437/CHiME.2024-3
Khokhlov Y., Prisyach T., Mitrofanov A., Dutov D., Agafonov I., Timofeeva T., Romanenko A., Korenevsky M. Classification of Room Impulse Responses and its application for channel verification and diarization. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2024. pp. 274--278.. doi: 10.21437/interspeech.2024-2081
UCONV-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition
Andrusenko A., Nasretdinov R., Romanenko A. UCONV-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023. pp. 1-5.. doi: 10.1109/ICASSP49357.2023.10095430
Prisyach T., Khokhlov Y., Korenevsky M., Mitrofanov A., Timofeeva T., Odegov I., Nasretdinov R., Lezhenin I., Miroshnichenko D., Karelin A., Mitrofanova M., Svechnikov R., Novoselov S., Romanenko A. STCON System for the CHiME-7 Challenge. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). 2023. pp. 87-92.. doi: 10.21437/CHiME.2023-17
Andrusenko A., Romanenko A. Improving out of vocabulary words recognition accuracy for an end-to-end Russian speech recognition system. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2022. Vol. 22. No. 6(142). pp. 1143-1149.. doi: 10.17586/2226-1494-2022-22-6-1143-1149
Mitrofanov A., korenevskaya M., Podluzhny I., Khokhlov Y., Laptev A., Andrusenko A., Ilin A., Korenevsky M., Medennikov I., Romanenko A. LT-LM: A Novel Non-Autoregressive Language Model for Single-Shot Lattice Rescoring. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2021. Vol. 3. pp. 2053-2057.. doi: 10.21437/Interspeech.2021-1716
Medennikov I., Korenevsky M., Prisyach T., Khokhlov Y., Korenevskaya M., Sorokin I., Timofeeva T., Mitrofanov A., Andrusenko A., Laptev A., Romanenko A. The STC System for the CHiME-6 Challenge. 6th International Workshop on Speech Processing in Everyday Environments (CHiME 2020). 2020. pp. 36-41.. doi: 10.21437/CHiME.2020-9
Medennikov I., Korenevsky M., Prisyach T., Khokhlov Y., Korenevskaya M., Sorokin I., Timofeeva T.N., Mitrofanov A., Andrusenko A., Podluzhny I., Laptev A., Romanenko A. Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 274-278.. doi: 10.21437/Interspeech.2020-1602
Khokhlov Y.-., Zatvornitskiy A., Medennikov I., Sorokin I., Prisyach T., Romanenko A., Mitrofanov A., Bataev V., Andrusenko A.I., Korenevskaya M., Petrov O. R-vectors: New Technique for Adaptation to Room Acoustics. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1243-1247.. doi: 10.21437/Interspeech.2019-2645
Medennikov I., Khokhlov Y., Romanenko A., Sorokin I., Mitrofanov A., Bataev V., Andrusenko A.I., Korenevskaya M., Petrov O., Zatvornitskiy A. The STC ASR System for the VOiCES from a Distance Challenge 2019. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 2453-2457.. doi: 10.21437/Interspeech.2019-1574
Романенко А.Н., Матвеев Ю.Н., Минкер В. Перенос знаний в задаче автоматического распознавания русской речи в телефонных переговорах. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2018. Т. 18. № 2(114). С. 236-242.. doi: 10.17586/2226-1494-2018-18-2-236-242
Medennikov I., Romanenko A., Сорокин И., Popov D., Хохлов Ю., Присяч Т.Н., Мальковский Н., Батаев В., Astapov S., Korenevsky M., Zatvornitskiy A. The STC System for the CHiME 2018 Challenge. CHiME 2018 Workshop on Speech Processing in Everyday Environments. 2018. pp. 1-5.. doi: 10.21437/CHiME.2018-1
Романенко А.Н. Объединение признаков в задаче обучения нейросетевых акустических моделей. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2018. Т. 18. № 2(114). С. 350–352.. doi: 10.17586/2226-1494-2018-18-2-350-352
Medennikov I., Khokhlov Y., Romanenko A., Popov D., Tomashenko N., Sorokin I., Zatvornitskiy A. An investigation of mixup training strategies for acoustic models in ASR. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2018. pp. 2903-2907.. doi: 10.21437/Interspeech.2018-2191
Khokhlov Y.-., Medennikov I., Romanenko A., Mendelev V., Korenevsky M., Prudnikov A., Tomashenko N., Zatvornitsky A. The STC Keyword Search System For OpenKWS 2016 Evaluation. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2017. pp. 3602-3606.. doi: 10.21437/Interspeech.2017-1212
Medennikov I., Romanenko A., Prudnikov A., Mendelev V., Khokhlov Y.Y., Korenevsky M., Tomashenko N., Zatvornitskiy A. Acoustic Modeling In The STC Keyword Search System For OpenKWS 2016 Evaluation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 76-86.. doi: 10.1007/978-3-319-66429-3_7
Романенко А.Н. Использование фрагментов слов для повышения качества поиска токенов, не содержащихся в словаре. Альманах научных работ молодых ученых Университета ИТМО. 2017. Т. 3. С. 161-163.
Khokhlov Y.-., Tomashenko N., Medennikov I., Romanenko A. Fast and Accurate OOV Decoder on High-Level Features. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2017. pp. 2884-2888.. doi: 10.21437/Interspeech.2017-1367
Повышение качества поиска токенов, не содержащихся в словаре распознавания
Romanenko A., Mendelev V. Speaker-dependent bottleneck features for Egyptian Arabic speech recognition. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 620-626.. doi: 10.1007/978-3-319-43958-7_75
Романенко А.Н. Разработка системы автоматического распознавания речи для египетского диалекта арабского языка в телефонном канале. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2016. Т. 16. № 4(104). С. 703-709.. doi: 10.17586/2226-1494-2016-16-4-703-709
Korenevsky M., Romanenko A. Feature space VTS with phase term modeling. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 312-320.. doi: 10.1007/978-3-319-43958-7_37
РАСПОЗНАВАНИЕ СПОНТАННОЙ АРАБСКОЙ РЕЧИ В ТЕЛЕФОННОМ КАНАЛЕ
Романенко А.Н. Исследование смеси обучающих речевых корпусов в задаче распознавания спонтанной речи. Альманах научных работ молодых ученых Университета ИТМО. 2015. Т. 3. С. 54-56.
Использование упрощенного алгоритма рандомизированной стохастической аппроксимации для оптимизации параметров декодера в задаче распознавания речи
Merkin N., Medennikov I.P., Romanenko A.N., Zatvornitskiy A. Controlling the uncertainty area in the real time LVCSR application. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2014. Vol. 8773. No. LNAI. pp. 153–160.. doi: 10.1007/978-3-319-11581-8_19
Romanenko A.N., Zatvornitsky A., Medennikov I.P. Simplified Simultaneous Perturbation Stochastic Approximation for the optimization of free decoding parameters. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2014. Vol. 8773. No. LNAI. pp. 402-409.. doi: 10.1007/978-3-319-11581-8_50
Zatvornitskiy A., Romanenko A.N., Korenevsky M. Proportional-Integral-Derivative Control of Automatic Speech Recognition Speed. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2014. Vol. 8773. No. LNAI. pp. 360–367.. doi: 10.1007/978-3-319-11581-8_45