Ausev E., Volokhov V., Novoselov S., Marchevskiy V., Shangina E., Logunov A. ITMO language diarization and identification systems for the DISPLACE 2024 challenge. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1-5.. doi: 10.1109/ICASSP49660.2025.10889582
Khmelev N., Avdeeva A., Novoselov S., Chirkovskiy A., Volkova M. Robust Speaker Recognition for Whispered Speech. 2025 27th International Conference on Digital Signal Processing and its Applications (DSPA). 2025. pp. 1-5.. doi: 10.1109/DSPA64310.2025.10977907
Malykh S., Anikin A., Khmelev N., Korenevskaya A., Zorkina A., Novoselov S., Marchevskiy V., Volokhov V., Shulipa A., Kozlov A., Melnikov A., Galyuk V., Pekhovsky T. STCON NIST SRE24 System: Composite Speaker Recognition Solution for Challenging Scenarios. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2025. pp. 3983-3987.. doi: 10.21437/Interspeech.2025-2170
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. Accurate speaker counting, diarization and separation for advanced recognition of multichannel multispeaker conversations. Computer Speech and Language. 2025. Vol. 92. pp. 101780.. doi: 10.1016/j.csl.2025.101780
Khmelev N., Anikin A., Zorkina A., Korenevskaya A., Novoselov S., Malykh S., Volokhov V., Marchevskiy V., Volkova M., Lavrentyeva G. Joint Voice Activity Detection and Quality Estimation for Efficient Speech Preprocessing. 2025 27th International Conference on Digital Signal Processing and its Applications (DSPA). 2025. pp. 1-6.. doi: 10.1109/DSPA64310.2025.10977856
Mitrofanov A., Novoselov S., Prisyach T., Marchevskiy V., Karelin A., Khmelev N., Dutov D., Malykh S., Agafonov I., Nikitin A., Petrov O. Cryfish: On deep audio analysis with Large Language Models. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2025. pp. 3249-3253.. doi: 10.21437/Interspeech.2025-2109
Khmelev N., Malykh S., Anikin A., Korenevskaya A., Novoselov S., Volokhov V., Zorkina A., Marchevskiy V., Lavrentyeva G. In Search of Optimal Pretraining Strategy for Robust Speaker Recognition. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1-5.. doi: 10.1109/ICASSP49660.2025.10889905
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. STCON System for the CHiME-8 Challenge. 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024). 2024. pp. 13-17.. doi: 10.21437/CHiME.2024-3
Novoselov S., Lavrentyeva G., Avdeeva A., Volokhov V., Khmelev N., Akulov A., Leonteva P. On the robustness of wav2vec 2.0 based speaker recognition systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2023. pp. 3177-3181.. doi: 10.21437/Interspeech.2023-881
Novoselov S., Lavrentyeva G., Volokhov V., Volkova M., Khmelev N., Akulov A. Investigation of Different Calibration Methods for Deep Speaker Embedding Based Verification Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2023. Vol. 14338. pp. 159-168.. doi: 10.1007/978-3-031-48309-7_13
Novoselov S., Volokhov V., Lavrentyeva G. Universal Speaker Recognition Encoders for Different Speech Segments Duration. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023. pp. 1-5.. doi: 10.1109/ICASSP49357.2023.10096081
Prisyach T., Khokhlov Y., Korenevsky M., Mitrofanov A., Timofeeva T., Odegov I., Nasretdinov R., Lezhenin I., Miroshnichenko D., Karelin A., Mitrofanova M., Svechnikov R., Novoselov S., Romanenko A. STCON System for the CHiME-7 Challenge. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). 2023. pp. 87-92.. doi: 10.21437/CHiME.2023-17
Методические указания к выполнению лабораторных работ по курсу "Распознавание диктора"
Lavrentyeva G., Novoselov S., Volokhov V., Avdeeva A.S., Gusev A., Vinogradova A., Korsunov I., Kozlov A., Pekhovsky T., Shulipa A., Smirnov E., Galyuk V. STC speaker recognition systems for the NIST SRE 2021. Odyssey 2022: The Speaker and Language Recognition Workshop. 2022. pp. 1-11.
Gusev A., Vinogradova A., Novoselov S., Astapov S. SdSVC Challenge 2021: Tips and Tricks to Boost the Short-duration Speaker Verification System Performance. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2021. Vol. 3. pp. 2003-2007.. doi: 10.21437/Interspeech.2021-1737
Lavrentyeva G., Volkova M., Avdeeva A., Novoselov S., Gorlanov A., Andzukaev T., Ivanov A., Kozlov A. Blind speech signal quality estimation for speaker verification systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 1535-1539.. doi: 10.21437/Interspeech.2020-1826
Gusev A., Volokhov V., Vinogradova A., Andzhukaev T., Shulipa A., Novoselov S., Pekhovsky T., Kozlov A. STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 3466-3470.. doi: 10.21437/Interspeech.2020-2580
Gusev A., Volokhov V., Andzhukaev T., Novoselov S., Lavrentyeva G., Volkova M., Gazizullina A., Shulipa A., Gorlanov A., Avdeeva A.S., Ivanov A., Kozlov A., Pekhovsky T., Matveev Y. Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances. Odyssey 2020: The Speaker and Language Recognition Workshop. 2020. pp. 179-186.. doi: 10.21437/Odyssey.2020-26
Novoselov S., Gusev A., Ivanov A., Pekhovsky T., Shulipa A., Lavrentyeva G., Volokhov V., Kozlov A. STC Speaker Recognition Systems for The VOiCES From a Distance Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 2443-2447.. doi: 10.21437/Interspeech.2019-2783
Lavrentyeva G., Novoselov S., Andzhukaev T., Volkova M., Gorlanov A., Kozlov A. STC Antispoofing Systems for the ASVspoof2019 Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1033-1037.. doi: 10.21437/Interspeech.2019-1768
Novoselov S., Gusev A., Ivanov A., Pekhovsky T., Shulipa A., Avdeeva A.S., Gorlanov A., Kozlov A. Speaker Diarization with Deep Speaker Embeddings for DIHARD Challenge II. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1003-1007.. doi: 10.21437/Interspeech.2019-2757
Lavrentyeva G., Novoselov S., Volkova M.V., Matveev Y.N., De Marsiko M. Phonespoof: A New Dataset for Spoofing Attack Detection in Telephone Channel. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2019. pp. 2572-2576.. doi: 10.1109/ICASSP.2019.8682942
Volkova M.V., Andzhukaev T., Lavrentyeva G., Novoselov S., Kozlov A. Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2019. Vol. 11658. pp. 520–529.. doi: 10.1007/978-3-030-26061-3_53
Лаврентьева Г.М., Новоселов С.А., Козлов А.В., Кудашев О.Ю., Щемелинин В.Л., Матвеев Ю.Н., Де Марсико М. Методы детектирования спуфинг-атак повторного воспроизведения на голосовые биометрические системы. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2018. Т. 18. № 3(115). С. 428–436.. doi: 10.17586/2226-1494-2018-18-3-428-436
Novoselov S., Kudashev O., Shchemelinin V., Kremnev I., Lavrentyeva G. Deep CNN based feature extractor for text-prompted speaker recognition. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2018. pp. 5334-5338.. doi: 10.1109/ICASSP.2018.8462358
Novoselov S., Shchemelinin V., Shulipa A., Kozlov A., Kremnev I. Triplet loss based cosine similarity metric learning for text-independent speaker recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2018. pp. 2242-2246.. doi: 10.21437/Interspeech.2018-1209
Novoselov S.A., Shulipa A., Kremnev I.A., Kozlov S., Shchemelinin V. On deep speaker embeddings for text-independent speaker recognition. ODYSSEY 2018, Speaker and Language Recognition Workshop. 2018. pp. 378-385.. doi: 10.21437/Odyssey.2018-53
Luckyanets E., Melnikov A., Kudashev O., Novoselov S., Lavrentyeva G. Bimodal Anti-Spoofing System for Mobile Security. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 211-220.. doi: 10.1007/978-3-319-66429-3_20
Lavrentyeva G., Novoselov S., Simonchik K. Anti-spoofing methods for automatic speaker verification system. Communications in Computer and Information Science. 2017. Vol. 661. pp. 172-184.. doi: 10.1007/978-3-319-52920-2_17
Lavrentyeva G., Novoselov S., Malykh E., Kozlov A., Kudashev O., Shchemelinin V. Audio-replay attack detection countermeasures. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 171-181.. doi: 10.1007/978-3-319-66429-3_16
Lavrentyeva G., Novoselov S., Malykh E., Kozlov A., Kudashev O., Shchemelinin V. Audio replay attack detection with deep learning frameworks. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2017. pp. 82-86.. doi: 10.21437/Interspeech.2017-360
Smirnov E., Melnikov A., Novoselov S., Luckyanets E., Lavrentyeva G. Doppelganger Mining for Face Representation Learning. IEEE International Conference on Computer Vision Workshops (ICCVW 2017). 2017. pp. 1916-1923.. doi: 10.1109/ICCVW.2017.226
Malykh E., Novoselov S., Kudashev O. On residual cnn in text-dependent speaker verification task. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 593-601.. doi: 10.1007/978-3-319-66429-3_59
Щемелинин В.Л., Лаврентьева Г.М., Алсуфьев А.А., Новоселов С.А. Метод повышения эффективности идентификации диктора за счет использования мультисессионных голосовых моделей. Альманах научных работ молодых ученых Университета ИТМО. 2017. Т. 3. С. 223-226.
Shulipa A., Novoselov S., Melnikov A. Approaches for out-of-domain adaptation to improve speaker recognition performance. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 124-130.. doi: 10.1007/978-3-319-43958-7_14
Kudashev O., Novoselov S., Simonchik K., Kozlov A. A speaker recognition system for the SITW challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2016. pp. 833-837.. doi: 10.21437/Interspeech.2016-1197
Novoselov S., Kozlov A., Lavrentyeva G., Simonchik K., Shchemelinin V. STC anti-spoofing systems for the ASVspoof 2015 challenge. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2016. pp. 5475-5479.. doi: 10.1109/ICASSP.2016.7472724
Новоселов С.А., Козлов А.В., Лаврентьева Г.М., Симончик К.К., Щемелинин В.Л. Противодействие спуфинг атакам на голосовые биометрические системы. Речевые технологии. 2016. № 1-2. С. 22-33.
Pekhovsky T., Novoselov S., Sholohov A., Kudashev O. On autoencoders in the i-vector space for speaker recognition. Odyssey 2016: Speaker and Language Recognition Workshop. 2016. pp. 217-224.. doi: 10.21437/Odyssey.2016-31
Shulipa A., Novoselov S., Matveev Y. Scores Calibration in Speaker Recognition Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 596-603.. doi: 10.1007/978-3-319-43958-7_72
Kudashev O., Novoselov S., Pekhovsky T., Simonchik K., Lavrentyeva G. Usage of DNN in speaker recognition: advantages and problems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9719. pp. 82-91.. doi: 10.1007/978-3-319-40663-3_10
Simonchik K.K., Novoselov S., Lavrentyeva G. Comparative analysis of classifiers for automatic language recognition in spontaneous speech. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 174-181.. doi: 10.1007/978-3-319-43958-7_20
Novoselov S., Pekhovsky T., Kudashev O., Mendelev V., Prudnikov A. Non-linear PLDA for i-Vector Speaker Verification. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2015. pp. 214–218.
Shchemelinin V., Kozlov A., Lavrentyeva G., Novoselov S., Simonchik K. Vulnerability of Voice Verification System with STC Anti-spoofing Detector to Different Methods of Spoofing Attacks. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2015. Vol. 9319. pp. 480-486.. doi: 10.1007/978-3-319-23132-7_59
Lavrentyeva G., Kozlov A., Novoselov S., Simonchik K., Shchemelinin V. Automatically Trained TTS for Effective Attacks to Anti-spoofing System. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2015. Vol. 9319. pp. 137-143.. doi: 10.1007/978-3-319-23132-7_17
Novoselov S., Pekhovsky T., Shulipa A., Kudashev O. PLDA-based System for Text-prompted Password Speaker Verification. AVSS 2015 - 12th IEEE International Conference on Advanced Video and Signal Based Surveillance. 2015. pp. 7301798.. doi: 10.1109/AVSS.2015.7301798
Новоселов С.А., Сухмель В.А., Шолохов А.В., Пеховский Т.С. Применение DTW-метода для мультисессионного обучения скрытых марковских моделей в задаче текстозависимой верификации диктора. Известия высших учебных заведений. Приборостроение. 2014. Т. 57. № 2. С. 77-84.
Novoselov S., Pekhovsky T.S., Shulipa A.K., Sholokhov A.V. Text-dependent GMM-JFA system for password based speaker verification. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2014. pp. 729-733.. doi: 10.1109/ICASSP.2014.6853692
Novoselov S.A., Pekhovsky T.S., Simonchik K.K., Shulipa A.K. RBM-PLDA subsystem for the NIST i-Vector Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2014. pp. 378-382.
Германия, Дрезден