Mitrofanov A., Novoselov S., Prisyach T., Marchevskiy V., Karelin A., Khmelev N., Dutov D., Malykh S., Agafonov I., Nikitin A.V., Petrov O. Cryfish: On deep audio analysis with Large Language Modelss. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2025. pp. in press.
Khmelev N., Anikin A., Zorkina A., Korenevskaya A., Novoselov S., Malykh S., Volokhov V., Marchevskiy V., Volkova M., Lavrentyeva G. Joint Voice Activity Detection and Quality Estimation for Efficient Speech Preprocessing. 2025 27th International Conference on Digital Signal Processing and its Applications (DSPA). 2025. pp. 1-6.. doi: 10.1109/DSPA64310.2025.10977856
Khmelev N., Avdeeva A., Novoselov S., Chirkovskiy A., Volkova M. Robust Speaker Recognition for Whispered Speech. 2025 27th International Conference on Digital Signal Processing and its Applications (DSPA). 2025. pp. 1-5.. doi: 10.1109/DSPA64310.2025.10977907
Khmelev N., Malykh S., Anikin A., Korenevskaya A., Novoselov S., Volokhov V., Zorkina A., Marchevskiy V., Lavrentyeva G. In Search of Optimal Pretraining Strategy for Robust Speaker Recognition. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1-5.. doi: 10.1109/ICASSP49660.2025.10889905
Ausev E., Volokhov V., Novoselov S., Marchevskiy V., Shangina E., Logunov A. ITMO language diarization and identification systems for the DISPLACE 2024 challenge. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1-5.. doi: 10.1109/ICASSP49660.2025.10889582
Novoselov S.A., Korenevskaia A.M., Khmelev N.A., Malykh S.I., Anikin A.A., Zorkina A.A., Volokhov V.A., Marchevskii V.D. STCON NIST SRE24 System: Composite Speaker Recognition Solution for Challenging Scenarios. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2025. pp. in press.
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. Accurate speaker counting, diarization and separation for advanced recognition of multichannel multispeaker conversations. Computer Speech and Language. 2025. Vol. 92. pp. 101780.. doi: 10.1016/j.csl.2025.101780
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. STCON System for the CHiME-8 Challenge. 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024). 2024. pp. 13-17.. doi: 10.21437/CHiME.2024-3
Novoselov S., Volokhov V., Lavrentyeva G. Universal Speaker Recognition Encoders for Different Speech Segments Duration. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023. pp. 1-5.. doi: 10.1109/ICASSP49357.2023.10096081
Prisyach T., Khokhlov Y., Korenevsky M., Mitrofanov A., Timofeeva T., Odegov I., Nasretdinov R., Lezhenin I., Miroshnichenko D., Karelin A., Mitrofanova M., Svechnikov R., Novoselov S., Romanenko A. STCON System for the CHiME-7 Challenge. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). 2023. pp. 87-92.. doi: 10.21437/CHiME.2023-17
Novoselov S., Lavrentyeva G., Volokhov V., Volkova M., Khmelev N., Akulov A. Investigation of Different Calibration Methods for Deep Speaker Embedding Based Verification Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2023. Vol. 14338. pp. 159-168.. doi: 10.1007/978-3-031-48309-7_13
Novoselov S., Lavrentyeva G., Avdeeva A., Volokhov V., Khmelev N., Akulov A., Leonteva P. On the robustness of wav2vec 2.0 based speaker recognition systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2023. pp. 3177-3181.. doi: 10.21437/Interspeech.2023-881
Методические указания к выполнению лабораторных работ по курсу "Распознавание диктора"
Lavrentyeva G., Novoselov S., Volokhov V., Avdeeva A.S., Gusev A., Vinogradova A., Korsunov I., Kozlov A., Pekhovsky T., Shulipa A., Smirnov E., Galyuk V. STC speaker recognition systems for the NIST SRE 2021. Odyssey 2022: The Speaker and Language Recognition Workshop. 2022. pp. 1-11.
Gusev A., Vinogradova A., Novoselov S., Astapov S. SdSVC Challenge 2021: Tips and Tricks to Boost the Short-duration Speaker Verification System Performance. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2021. Vol. 3. pp. 2003-2007.. doi: 10.21437/Interspeech.2021-1737
Gusev A., Volokhov V., Vinogradova A., Andzhukaev T., Shulipa A., Novoselov S., Pekhovsky T., Kozlov A. STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 3466-3470.. doi: 10.21437/Interspeech.2020-2580
Gusev A., Volokhov V., Andzhukaev T., Novoselov S., Lavrentyeva G., Volkova M., Gazizullina A., Shulipa A., Gorlanov A., Avdeeva A.S., Ivanov A., Kozlov A., Pekhovsky T., Matveev Y. Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances. Odyssey 2020: The Speaker and Language Recognition Workshop. 2020. pp. 179-186.. doi: 10.21437/Odyssey.2020-26
Lavrentyeva G., Volkova M., Avdeeva A., Novoselov S., Gorlanov A., Andzukaev T., Ivanov A., Kozlov A. Blind speech signal quality estimation for speaker verification systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 1535-1539.. doi: 10.21437/Interspeech.2020-1826
Volkova M.V., Andzhukaev T., Lavrentyeva G., Novoselov S., Kozlov A. Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2019. Vol. 11658. pp. 520–529.. doi: 10.1007/978-3-030-26061-3_53
Lavrentyeva G., Novoselov S., Andzhukaev T., Volkova M., Gorlanov A., Kozlov A. STC Antispoofing Systems for the ASVspoof2019 Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1033-1037.. doi: 10.21437/Interspeech.2019-1768
Lavrentyeva G., Novoselov S., Volkova M.V., Matveev Y.N., De Marsiko M. Phonespoof: A New Dataset for Spoofing Attack Detection in Telephone Channel. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2019. pp. 2572-2576.. doi: 10.1109/ICASSP.2019.8682942
Novoselov S., Gusev A., Ivanov A., Pekhovsky T., Shulipa A., Avdeeva A.S., Gorlanov A., Kozlov A. Speaker Diarization with Deep Speaker Embeddings for DIHARD Challenge II. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1003-1007.. doi: 10.21437/Interspeech.2019-2757
Novoselov S., Gusev A., Ivanov A., Pekhovsky T., Shulipa A., Lavrentyeva G., Volokhov V., Kozlov A. STC Speaker Recognition Systems for The VOiCES From a Distance Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 2443-2447.. doi: 10.21437/Interspeech.2019-2783
Novoselov S., Kudashev O., Shchemelinin V., Kremnev I., Lavrentyeva G. Deep CNN based feature extractor for text-prompted speaker recognition. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2018. pp. 5334-5338.. doi: 10.1109/ICASSP.2018.8462358
Лаврентьева Г.М., Новоселов С.А., Козлов А.В., Кудашев О.Ю., Щемелинин В.Л., Матвеев Ю.Н., Де Марсико М. Методы детектирования спуфинг-атак повторного воспроизведения на голосовые биометрические системы. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2018. Т. 18. № 3(115). С. 428–436.. doi: 10.17586/2226-1494-2018-18-3-428-436
Novoselov S.A., Shulipa A., Kremnev I.A., Kozlov S., Shchemelinin V. On deep speaker embeddings for text-independent speaker recognition. ODYSSEY 2018, Speaker and Language Recognition Workshop. 2018. pp. 378-385.. doi: 10.21437/Odyssey.2018-53
Novoselov S., Shchemelinin V., Shulipa A., Kozlov A., Kremnev I. Triplet loss based cosine similarity metric learning for text-independent speaker recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2018. pp. 2242-2246.. doi: 10.21437/Interspeech.2018-1209
Lavrentyeva G., Novoselov S., Simonchik K. Anti-spoofing methods for automatic speaker verification system. Communications in Computer and Information Science. 2017. Vol. 661. pp. 172-184.. doi: 10.1007/978-3-319-52920-2_17
Щемелинин В.Л., Лаврентьева Г.М., Алсуфьев А.А., Новоселов С.А. Метод повышения эффективности идентификации диктора за счет использования мультисессионных голосовых моделей. Альманах научных работ молодых ученых Университета ИТМО. 2017. Т. 3. С. 223-226.
Luckyanets E., Melnikov A., Kudashev O., Novoselov S., Lavrentyeva G. Bimodal Anti-Spoofing System for Mobile Security. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 211-220.. doi: 10.1007/978-3-319-66429-3_20
Lavrentyeva G., Novoselov S., Malykh E., Kozlov A., Kudashev O., Shchemelinin V. Audio-replay attack detection countermeasures. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 171-181.. doi: 10.1007/978-3-319-66429-3_16
Lavrentyeva G., Novoselov S., Malykh E., Kozlov A., Kudashev O., Shchemelinin V. Audio replay attack detection with deep learning frameworks. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2017. pp. 82-86.. doi: 10.21437/Interspeech.2017-360
Malykh E., Novoselov S., Kudashev O. On residual cnn in text-dependent speaker verification task. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 593-601.. doi: 10.1007/978-3-319-66429-3_59
Smirnov E., Melnikov A., Novoselov S., Luckyanets E., Lavrentyeva G. Doppelganger Mining for Face Representation Learning. IEEE International Conference on Computer Vision Workshops (ICCVW 2017). 2017. pp. 1916-1923.. doi: 10.1109/ICCVW.2017.226
Kudashev O., Novoselov S., Simonchik K., Kozlov A. A speaker recognition system for the SITW challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2016. pp. 833-837.. doi: 10.21437/Interspeech.2016-1197
Shulipa A., Novoselov S., Melnikov A. Approaches for out-of-domain adaptation to improve speaker recognition performance. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 124-130.. doi: 10.1007/978-3-319-43958-7_14
Shulipa A., Novoselov S., Matveev Y. Scores Calibration in Speaker Recognition Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 596-603.. doi: 10.1007/978-3-319-43958-7_72
Pekhovsky T., Novoselov S., Sholohov A., Kudashev O. On autoencoders in the i-vector space for speaker recognition. Odyssey 2016: Speaker and Language Recognition Workshop. 2016. pp. 217-224.. doi: 10.21437/Odyssey.2016-31
Simonchik K.K., Novoselov S., Lavrentyeva G. Comparative analysis of classifiers for automatic language recognition in spontaneous speech. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 174-181.. doi: 10.1007/978-3-319-43958-7_20
Kudashev O., Novoselov S., Pekhovsky T., Simonchik K., Lavrentyeva G. Usage of DNN in speaker recognition: advantages and problems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9719. pp. 82-91.. doi: 10.1007/978-3-319-40663-3_10
Novoselov S., Kozlov A., Lavrentyeva G., Simonchik K., Shchemelinin V. STC anti-spoofing systems for the ASVspoof 2015 challenge. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2016. pp. 5475-5479.. doi: 10.1109/ICASSP.2016.7472724
Новоселов С.А., Козлов А.В., Лаврентьева Г.М., Симончик К.К., Щемелинин В.Л. Противодействие спуфинг атакам на голосовые биометрические системы. Речевые технологии. 2016. № 1-2. С. 22-33.
Novoselov S., Pekhovsky T., Shulipa A., Kudashev O. PLDA-based System for Text-prompted Password Speaker Verification. AVSS 2015 - 12th IEEE International Conference on Advanced Video and Signal Based Surveillance. 2015. pp. 7301798.. doi: 10.1109/AVSS.2015.7301798
Lavrentyeva G., Kozlov A., Novoselov S., Simonchik K., Shchemelinin V. Automatically Trained TTS for Effective Attacks to Anti-spoofing System. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2015. Vol. 9319. pp. 137-143.. doi: 10.1007/978-3-319-23132-7_17
Shchemelinin V., Kozlov A., Lavrentyeva G., Novoselov S., Simonchik K. Vulnerability of Voice Verification System with STC Anti-spoofing Detector to Different Methods of Spoofing Attacks. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2015. Vol. 9319. pp. 480-486.. doi: 10.1007/978-3-319-23132-7_59
Novoselov S., Pekhovsky T., Kudashev O., Mendelev V., Prudnikov A. Non-linear PLDA for i-Vector Speaker Verification. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2015. pp. 214–218.
Новоселов С.А., Сухмель В.А., Шолохов А.В., Пеховский Т.С. Применение DTW-метода для мультисессионного обучения скрытых марковских моделей в задаче текстозависимой верификации диктора. Известия высших учебных заведений. Приборостроение. 2014. Т. 57. № 2. С. 77-84.
Novoselov S.A., Pekhovsky T.S., Simonchik K.K., Shulipa A.K. RBM-PLDA subsystem for the NIST i-Vector Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2014. pp. 378-382.
Novoselov S., Pekhovsky T.S., Shulipa A.K., Sholokhov A.V. Text-dependent GMM-JFA system for password based speaker verification. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2014. pp. 729-733.. doi: 10.1109/ICASSP.2014.6853692
Германия, Дрезден