Self Multi-Head Attention for Speaker Recognition M India, P Safari, J Hernando INTERSPEECH, 2019 | 150 | 2019 |
Self-attention encoding and pooling for speaker recognition P Safari, M India, J Hernando arXiv preprint arXiv:2008.01077, 2020 | 102 | 2020 |
LSTM neural network-based speaker segmentation using acoustic and language modelling MÀ India Massana, JA Rodríguez Fonollosa, FJ Hernando Pericás INTERSPEECH 2017: 20-24 August 2017: Stockholm, 2834-2838, 2017 | 37 | 2017 |
Double multi-head attention for speaker verification M India, P Safari, J Hernando ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 29 | 2021 |
Towards large scale multimedia indexing: A case study on person discovery in broadcast news N Le, H Bredin, G Sargent, M India, P Lopez-Otero, C Barras, ... Proceedings of the 15th International Workshop on Content-Based Multimedia …, 2017 | 16 | 2017 |
I-vector transformation using k-nearest neighbors for speaker verification U Khan, M India, J Hernando ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 8 | 2020 |
UPC System for the 2016 MediaEval Multimodal Person Discovery in Broadcast TV task M India, G Martí, C Cortillas, JR Morros, FJ Hernando MediaEval 2016 Multimedia Benchmark Workshop, 2016 | 7* | 2016 |
UPC system for the 2015 MediaEval multimodal person discovery in broadcast TV task D Varas González, V Vilaplana Besler, JR Morros Rubió, ... MediaEval 2015 Multimedia Benchmark Workshop, 2015 | 6* | 2015 |
Language modelling for speaker diarization in telephonic interviews M India, J Hernando, JAR Fonollosa Computer Speech & Language 78, 101441, 2023 | 5 | 2023 |
Auto-Encoding Nearest Neighbor i-vectors for Speaker Verification U Khan, M India, J Hernando Proc. Interspeech 2019, 4060-4064, 2019 | 5 | 2019 |
Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge F Costa, M India, J Hernando arXiv preprint arXiv:2406.10598, 2024 | 4 | 2024 |
Self attention networks in speaker recognition P Safari, M India, J Hernando Applied Sciences 13 (11), 6410, 2023 | 4 | 2023 |
UPC system for the 2015 MediaEval multimodal person discovery in broadcast TV task M India Massana, D Varas González, V Vilaplana Besler, ... MediaEval 2015 Multimedia Benchmark Workshop, 2015 | 4 | 2015 |
BSC-UPC at EmoSPeech-IberLEF2024: Attention Pooling for Emotion Recognition M Casals-Salvador, F Costa, M India, J Hernando arXiv preprint arXiv:2407.12467, 2024 | 3 | 2024 |
UPC multimodal speaker diarization system for the 2018 Albayzin Challenge MÀ India Massana, I Sagastiberri, P Palau Puigdevall, E Sayrol Clols, ... IberSPEECH 2018: program and proceedings: 21-23 November 2018: Barcelona …, 2018 | 3 | 2018 |
Speaker characterization by means of attention pooling F Costa, M India, J Hernando arXiv preprint arXiv:2405.04096, 2024 | 1 | 2024 |