Suivre
Pablo Gimeno
Pablo Gimeno
Autres nomsPablo Gimeno Jordán
ELSA Corp.
Adresse e-mail validée de elsanow.io
Titre
Citée par
Citée par
Année
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data
P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida
EURASIP Journal on Audio, Speech, and Music Processing 2020, 1-19, 2020
442020
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.
I Vinals, P Gimeno, A Ortega, A Miguel, E Lleida
Interspeech, 2803-2807, 2018
322018
Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation.
I Viñals, D Ribas, V Mingote, J Llombart, P Gimeno, A Miguel, ...
Interspeech, 4310-4314, 2019
112019
Generalizing AUC optimization to multiclass classification for audio segmentation with limited training data
P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida
IEEE Signal Processing Letters 28, 1135-1139, 2021
102021
ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge
I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida
Proc. Interspeech 2019, 988-992, 2019
102019
In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge
I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida
Proc. IberSPEECH 2018, 220-223, 2018
102018
Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data
P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida
Interspeech, 3067-3071, 2020
92020
Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions
P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida
Iberspeech 2021, 2021
82021
A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data
P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida
Proc. IberSPEECH 2018, 87-91, 2018
72018
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021
P Gimeno, A Ortega, A Miguel, E Lleida
Proc. Interspeech 2021, 4359-4363, 2021
52021
Improved cross-lingual transfer learning for automatic speech translation
S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ...
arXiv preprint arXiv:2306.00789, 2023
32023
Multimodal diarization systems by training enrollment models as identity representations
V Mingote, I Viñals, P Gimeno, A Miguel, A Ortega, E Lleida
Applied Sciences 12 (3), 1141, 2022
32022
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge
V Mingote, I Vinals, P Gimeno, A Miguel, A Ortega, E Lleida
Iberspeech 2021, 2021
22021
Unsupervised adaptation of deep speech activity detection models to unseen domains
P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida
Applied Sciences 12 (4), 1832, 2022
12022
Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge
I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida
Proc. IberSPEECH 2021, 94-98, 2021
12021
ViVoVAD: a Voice Activity Detection Tool based on Recurrent Neural Networks
PG Jordán, IV Bailo, AO Giménez, AM Artiaga, EL Solano
Jornada de Jóvenes Investigadores del I3A 7, 2019
12019
Cross-Lingual Transfer Learning for Low-Resource Speech Translation
S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ...
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024
2024
Direct Text to Speech Translation System Using Acoustic Units
V Mingote, P Gimeno, L Vicente, S Khurana, A Laurent, J Duret
IEEE Signal Processing Letters, 2023
2023
Advances in Binary and Multiclass Audio Segmentation with Deep Learning Techniques
P Gimeno, A Ortega
2023
Multi-lingual Speech to Speech Translation for Under-Resourced Languages
A Larcher, Y Estève, M Rouvier, N Tomashenko, J Duret, G Laperriere, ...
Le Mans Université, 2022
2022
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20