Khmelev N., Malykh S., Anikin A., Korenevskaya A., Novoselov S., Volokhov V., Zorkina A., Marchevskiy V., Lavrentyeva G. In Search of Optimal Pretraining Strategy for Robust Speaker Recognition. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1-5.. doi: 10.1109/ICASSP49660.2025.10889905
Ausev E., Volokhov V., Novoselov S., Marchevskiy V., Shangina E., Logunov A. ITMO language diarization and identification systems for the DISPLACE 2024 challenge. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2025. pp. 1-5.. doi: 10.1109/ICASSP49660.2025.10889582
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. Accurate speaker counting, diarization and separation for advanced recognition of multichannel multispeaker conversations. Computer Speech and Language. 2025. Vol. 92. pp. 101780.. doi: 10.1016/j.csl.2025.101780
Novoselov S.A., Korenevskaia A.M., Khmelev N.A., Malykh S.I., Anikin A.A., Zorkina A.A., Volokhov V.A., Marchevskii V.D. STCON NIST SRE24 System: Composite Speaker Recognition Solution for Challenging Scenarios. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2025. pp. in press.
Khmelev N., Avdeeva A., Novoselov S., Chirkovskiy A., Volkova M. Robust Speaker Recognition for Whispered Speech. 2025 27th International Conference on Digital Signal Processing and its Applications (DSPA). 2025. pp. 1-5.. doi: 10.1109/DSPA64310.2025.10977907
Khmelev N., Anikin A., Zorkina A., Korenevskaya A., Novoselov S., Malykh S., Volokhov V., Marchevskiy V., Volkova M., Lavrentyeva G. Joint Voice Activity Detection and Quality Estimation for Efficient Speech Preprocessing. 2025 27th International Conference on Digital Signal Processing and its Applications (DSPA). 2025. pp. 1-6.. doi: 10.1109/DSPA64310.2025.10977856
Mitrofanov A., Novoselov S., Prisyach T., Marchevskiy V., Karelin A., Khmelev N., Dutov D., Malykh S., Agafonov I., Nikitin A.V., Petrov O. Cryfish: On deep audio analysis with Large Language Modelss. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2025. pp. in press.
Mitrofanov A., Prisyach T., Timofeeva T., Novoselov S., Korenevsky M., Khokhlov Y., Akulov A., Anikin A., Khalili R., Lezhenin I., Melnikov A., Miroshnichenko D., Mamaev N., Odegov I., Rudnitskaya O., Romanenko A. STCON System for the CHiME-8 Challenge. 8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024). 2024. pp. 13-17.. doi: 10.21437/CHiME.2024-3
Prisyach T., Khokhlov Y., Korenevsky M., Mitrofanov A., Timofeeva T., Odegov I., Nasretdinov R., Lezhenin I., Miroshnichenko D., Karelin A., Mitrofanova M., Svechnikov R., Novoselov S., Romanenko A. STCON System for the CHiME-7 Challenge. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). 2023. pp. 87-92.. doi: 10.21437/CHiME.2023-17
Novoselov S., Lavrentyeva G., Volokhov V., Volkova M., Khmelev N., Akulov A. Investigation of Different Calibration Methods for Deep Speaker Embedding Based Verification Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2023. Vol. 14338. pp. 159-168.. doi: 10.1007/978-3-031-48309-7_13
Novoselov S., Lavrentyeva G., Avdeeva A., Volokhov V., Khmelev N., Akulov A., Leonteva P. On the robustness of wav2vec 2.0 based speaker recognition systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2023. pp. 3177-3181.. doi: 10.21437/Interspeech.2023-881
Novoselov S., Volokhov V., Lavrentyeva G. Universal Speaker Recognition Encoders for Different Speech Segments Duration. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023. pp. 1-5.. doi: 10.1109/ICASSP49357.2023.10096081
Lavrentyeva G., Novoselov S., Volokhov V., Avdeeva A.S., Gusev A., Vinogradova A., Korsunov I., Kozlov A., Pekhovsky T., Shulipa A., Smirnov E., Galyuk V. STC speaker recognition systems for the NIST SRE 2021. Odyssey 2022: The Speaker and Language Recognition Workshop. 2022. pp. 1-11.
Методические указания к выполнению лабораторных работ по курсу "Распознавание диктора"
Gusev A., Vinogradova A., Novoselov S., Astapov S. SdSVC Challenge 2021: Tips and Tricks to Boost the Short-duration Speaker Verification System Performance. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2021. Vol. 3. pp. 2003-2007.. doi: 10.21437/Interspeech.2021-1737
Gusev A., Volokhov V., Vinogradova A., Andzhukaev T., Shulipa A., Novoselov S., Pekhovsky T., Kozlov A. STC-innovation Speaker Recognition Systems for Far-Field Speaker Verification Challenge 2020. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 3466-3470.. doi: 10.21437/Interspeech.2020-2580
Gusev A., Volokhov V., Andzhukaev T., Novoselov S., Lavrentyeva G., Volkova M., Gazizullina A., Shulipa A., Gorlanov A., Avdeeva A.S., Ivanov A., Kozlov A., Pekhovsky T., Matveev Y. Deep Speaker Embeddings for Far-Field Speaker Recognition on Short Utterances. Odyssey 2020: The Speaker and Language Recognition Workshop. 2020. pp. 179-186.. doi: 10.21437/Odyssey.2020-26
Lavrentyeva G., Volkova M., Avdeeva A., Novoselov S., Gorlanov A., Andzukaev T., Ivanov A., Kozlov A. Blind speech signal quality estimation for speaker verification systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2020. pp. 1535-1539.. doi: 10.21437/Interspeech.2020-1826
Lavrentyeva G., Novoselov S., Andzhukaev T., Volkova M., Gorlanov A., Kozlov A. STC Antispoofing Systems for the ASVspoof2019 Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1033-1037.. doi: 10.21437/Interspeech.2019-1768
Novoselov S., Gusev A., Ivanov A., Pekhovsky T., Shulipa A., Lavrentyeva G., Volokhov V., Kozlov A. STC Speaker Recognition Systems for The VOiCES From a Distance Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 2443-2447.. doi: 10.21437/Interspeech.2019-2783
Novoselov S., Gusev A., Ivanov A., Pekhovsky T., Shulipa A., Avdeeva A.S., Gorlanov A., Kozlov A. Speaker Diarization with Deep Speaker Embeddings for DIHARD Challenge II. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019. pp. 1003-1007.. doi: 10.21437/Interspeech.2019-2757
Lavrentyeva G., Novoselov S., Volkova M.V., Matveev Y.N., De Marsiko M. Phonespoof: A New Dataset for Spoofing Attack Detection in Telephone Channel. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2019. pp. 2572-2576.. doi: 10.1109/ICASSP.2019.8682942
Volkova M.V., Andzhukaev T., Lavrentyeva G., Novoselov S., Kozlov A. Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2019. Vol. 11658. pp. 520–529.. doi: 10.1007/978-3-030-26061-3_53
Novoselov S., Kudashev O., Shchemelinin V., Kremnev I., Lavrentyeva G. Deep CNN based feature extractor for text-prompted speaker recognition. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2018. pp. 5334-5338.. doi: 10.1109/ICASSP.2018.8462358
Лаврентьева Г.М., Новоселов С.А., Козлов А.В., Кудашев О.Ю., Щемелинин В.Л., Матвеев Ю.Н., Де Марсико М. Методы детектирования спуфинг-атак повторного воспроизведения на голосовые биометрические системы. Научно-технический вестник информационных технологий, механики и оптики [Scientific and Technical Journal of Information Technologies, Mechanics and Optics]. 2018. Т. 18. № 3(115). С. 428–436.. doi: 10.17586/2226-1494-2018-18-3-428-436
Novoselov S., Shchemelinin V., Shulipa A., Kozlov A., Kremnev I. Triplet loss based cosine similarity metric learning for text-independent speaker recognition. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2018. pp. 2242-2246.. doi: 10.21437/Interspeech.2018-1209
Novoselov S.A., Shulipa A., Kremnev I.A., Kozlov S., Shchemelinin V. On deep speaker embeddings for text-independent speaker recognition. ODYSSEY 2018, Speaker and Language Recognition Workshop. 2018. pp. 378-385.. doi: 10.21437/Odyssey.2018-53
Malykh E., Novoselov S., Kudashev O. On residual cnn in text-dependent speaker verification task. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 593-601.. doi: 10.1007/978-3-319-66429-3_59
Lavrentyeva G., Novoselov S., Simonchik K. Anti-spoofing methods for automatic speaker verification system. Communications in Computer and Information Science. 2017. Vol. 661. pp. 172-184.. doi: 10.1007/978-3-319-52920-2_17
Smirnov E., Melnikov A., Novoselov S., Luckyanets E., Lavrentyeva G. Doppelganger Mining for Face Representation Learning. IEEE International Conference on Computer Vision Workshops (ICCVW 2017). 2017. pp. 1916-1923.. doi: 10.1109/ICCVW.2017.226
Щемелинин В.Л., Лаврентьева Г.М., Алсуфьев А.А., Новоселов С.А. Метод повышения эффективности идентификации диктора за счет использования мультисессионных голосовых моделей. Альманах научных работ молодых ученых Университета ИТМО. 2017. Т. 3. С. 223-226.
Luckyanets E., Melnikov A., Kudashev O., Novoselov S., Lavrentyeva G. Bimodal Anti-Spoofing System for Mobile Security. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 211-220.. doi: 10.1007/978-3-319-66429-3_20
Lavrentyeva G., Novoselov S., Malykh E., Kozlov A., Kudashev O., Shchemelinin V. Audio-replay attack detection countermeasures. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2017. Vol. 10458. pp. 171-181.. doi: 10.1007/978-3-319-66429-3_16
Lavrentyeva G., Novoselov S., Malykh E., Kozlov A., Kudashev O., Shchemelinin V. Audio replay attack detection with deep learning frameworks. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2017. pp. 82-86.. doi: 10.21437/Interspeech.2017-360
Новоселов С.А., Козлов А.В., Лаврентьева Г.М., Симончик К.К., Щемелинин В.Л. Противодействие спуфинг атакам на голосовые биометрические системы. Речевые технологии. 2016. № 1-2. С. 22-33.
Shulipa A., Novoselov S., Melnikov A. Approaches for out-of-domain adaptation to improve speaker recognition performance. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 124-130.. doi: 10.1007/978-3-319-43958-7_14
Pekhovsky T., Novoselov S., Sholohov A., Kudashev O. On autoencoders in the i-vector space for speaker recognition. Odyssey 2016: Speaker and Language Recognition Workshop. 2016. pp. 217-224.. doi: 10.21437/Odyssey.2016-31
Simonchik K.K., Novoselov S., Lavrentyeva G. Comparative analysis of classifiers for automatic language recognition in spontaneous speech. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 174-181.. doi: 10.1007/978-3-319-43958-7_20
Novoselov S., Kozlov A., Lavrentyeva G., Simonchik K., Shchemelinin V. STC anti-spoofing systems for the ASVspoof 2015 challenge. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2016. pp. 5475-5479.. doi: 10.1109/ICASSP.2016.7472724
Kudashev O., Novoselov S., Pekhovsky T., Simonchik K., Lavrentyeva G. Usage of DNN in speaker recognition: advantages and problems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9719. pp. 82-91.. doi: 10.1007/978-3-319-40663-3_10
Shulipa A., Novoselov S., Matveev Y. Scores Calibration in Speaker Recognition Systems. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2016. Vol. 9811. pp. 596-603.. doi: 10.1007/978-3-319-43958-7_72
Kudashev O., Novoselov S., Simonchik K., Kozlov A. A speaker recognition system for the SITW challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2016. pp. 833-837.. doi: 10.21437/Interspeech.2016-1197
Novoselov S., Pekhovsky T., Shulipa A., Kudashev O. PLDA-based System for Text-prompted Password Speaker Verification. AVSS 2015 - 12th IEEE International Conference on Advanced Video and Signal Based Surveillance. 2015. pp. 7301798.. doi: 10.1109/AVSS.2015.7301798
Lavrentyeva G., Kozlov A., Novoselov S., Simonchik K., Shchemelinin V. Automatically Trained TTS for Effective Attacks to Anti-spoofing System. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2015. Vol. 9319. pp. 137-143.. doi: 10.1007/978-3-319-23132-7_17
Shchemelinin V., Kozlov A., Lavrentyeva G., Novoselov S., Simonchik K. Vulnerability of Voice Verification System with STC Anti-spoofing Detector to Different Methods of Spoofing Attacks. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2015. Vol. 9319. pp. 480-486.. doi: 10.1007/978-3-319-23132-7_59
Novoselov S., Pekhovsky T., Kudashev O., Mendelev V., Prudnikov A. Non-linear PLDA for i-Vector Speaker Verification. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2015. pp. 214–218.
Novoselov S.A., Pekhovsky T.S., Simonchik K.K., Shulipa A.K. RBM-PLDA subsystem for the NIST i-Vector Challenge. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2014. pp. 378-382.
Novoselov S., Pekhovsky T.S., Shulipa A.K., Sholokhov A.V. Text-dependent GMM-JFA system for password based speaker verification. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2014. pp. 729-733.. doi: 10.1109/ICASSP.2014.6853692
Новоселов С.А., Сухмель В.А., Шолохов А.В., Пеховский Т.С. Применение DTW-метода для мультисессионного обучения скрытых марковских моделей в задаче текстозависимой верификации диктора. Известия высших учебных заведений. Приборостроение. 2014. Т. 57. № 2. С. 77-84.
Германия, Дрезден