CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech S Karlapati, A Moinet, A Joly, V Klimkov, D Sáez-Trigueros, T Drugman arXiv preprint arXiv:2004.14617, 2020 | 82 | 2020 |
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data M Łajszczak, G Cámbara, Y Li, F Beyhan, A van Korlaar, F Yang, A Joly, ... arXiv preprint arXiv:2402.08093, 2024 | 35 | 2024 |
CAMP: a Two-Stage Approach to Modelling Prosody in Context Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech S Karlapati, A Abbas, Z Hodari, A Moinet, A Joly, P Karanasou, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody P Makarov, A Abbas, M Łajszczak, A Joly, S Karlapati, A Moinet, ... arXiv preprint arXiv:2206.14643, 2022 | 16 | 2022 |
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer S Karlapati, P Karanasou, M Lajszczak, A Abbas, A Moinet, P Makarov, ... arXiv preprint arXiv:2206.13443, 2022 | 14 | 2022 |
Expressive, Variable, and Controllable Duration Modelling in TTS A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ... arXiv preprint arXiv:2206.14165, 2022 | 11 | 2022 |
A learned conditional prior for the VAE acoustic space of a TTS system P Karanasou, S Karlapati, A Moinet, A Joly, A Abbas, S Slangen, ... | 9 | 2021 |
Predicting deformation mechanisms in architected metamaterials using GNN PP Indurkar, S Karlapati, AJD Shaikeea, VS Deshpande arXiv preprint arXiv:2202.09427, 2022 | 8 | 2022 |
Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments A Mottini, J Lorenzo-Trueba, SVK Karlapati, T Drugman arXiv preprint arXiv:2106.08873, 2021 | 7 | 2021 |
Hash based frequent pattern mining approach to text compression C Oswald, S Srinidhi, KS Vishnu, TV Vishal, B Sivaselvan First EAI International Conference on Computer Science and Engineering, 228-238, 2017 | 4 | 2017 |
Multi-scale spectrogram text-to-speech SA Abbas, B Bollepalli, AP Moinet, TR Drugman, AVPY Joly, P Karanasou, ... US Patent 11,694,674, 2023 | 3 | 2023 |
eCat: An end-to-end model for multi-speaker TTS & many-to-many fine-grained prosody transfer A Abbas, S Karlapati, B Schnell, P Karanasou, MG Moya, A Nagaraj, ... arXiv preprint arXiv:2306.11327, 2023 | 3 | 2023 |
Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials I Grega, I Batatia, G Csányi, S Karlapati, VS Deshpande arXiv preprint arXiv:2401.16914, 2024 | 2 | 2024 |
A Comparative Analysis of Pretrained Language Models for Text-to-Speech MG Moya, P Karanasou, S Karlapati, B Schnell, N Peinelt, A Moinet, ... 12th Speech Synthesis Workshop (SSW) 2023, 2023 | 2* | 2023 |
Learned condition text-to-speech synthesis P Karanasou, SVK Karlapati, AP Moinet, AVPY Joly, SA Abbas, ... US Patent 11,830,476, 2023 | 1 | 2023 |
Synthetic speech processing JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI US Patent 11,735,156, 2023 | 1 | 2023 |
Synthetic speech processing JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI US Patent App. 18/305,456, 2023 | 1 | 2023 |
Synthetic speech processing by representing text by phonemes exhibiting predicted volume and pitch using neural networks A Joly, S Slangen, AP Moinet, TR Drugman, P Karanasou, SA Abbas, ... US Patent 11,978,431, 2024 | | 2024 |
Synthetic speech processing AVPY Joly, P Karanasou, APJ Moinet, TR Drugman, SVK Karlapati, ... US Patent 11,574,624, 2023 | | 2023 |