Follow
Sri Karlapati
Sri Karlapati
Amazon Research, Cambridge, United Kingdom.
Verified email at amazon.com
Title
Cited by
Cited by
Year
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech
S Karlapati, A Moinet, A Joly, V Klimkov, D Sáez-Trigueros, T Drugman
arXiv preprint arXiv:2004.14617, 2020
822020
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
M Łajszczak, G Cámbara, Y Li, F Beyhan, A van Korlaar, F Yang, A Joly, ...
arXiv preprint arXiv:2402.08093, 2024
352024
CAMP: a Two-Stage Approach to Modelling Prosody in Context
Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
342021
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
S Karlapati, A Abbas, Z Hodari, A Moinet, A Joly, P Karanasou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
P Makarov, A Abbas, M Łajszczak, A Joly, S Karlapati, A Moinet, ...
arXiv preprint arXiv:2206.14643, 2022
162022
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
S Karlapati, P Karanasou, M Lajszczak, A Abbas, A Moinet, P Makarov, ...
arXiv preprint arXiv:2206.13443, 2022
142022
Expressive, Variable, and Controllable Duration Modelling in TTS
A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ...
arXiv preprint arXiv:2206.14165, 2022
112022
A learned conditional prior for the VAE acoustic space of a TTS system
P Karanasou, S Karlapati, A Moinet, A Joly, A Abbas, S Slangen, ...
92021
Predicting deformation mechanisms in architected metamaterials using GNN
PP Indurkar, S Karlapati, AJD Shaikeea, VS Deshpande
arXiv preprint arXiv:2202.09427, 2022
82022
Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
A Mottini, J Lorenzo-Trueba, SVK Karlapati, T Drugman
arXiv preprint arXiv:2106.08873, 2021
72021
Hash based frequent pattern mining approach to text compression
C Oswald, S Srinidhi, KS Vishnu, TV Vishal, B Sivaselvan
First EAI International Conference on Computer Science and Engineering, 228-238, 2017
42017
Multi-scale spectrogram text-to-speech
SA Abbas, B Bollepalli, AP Moinet, TR Drugman, AVPY Joly, P Karanasou, ...
US Patent 11,694,674, 2023
32023
eCat: An end-to-end model for multi-speaker TTS & many-to-many fine-grained prosody transfer
A Abbas, S Karlapati, B Schnell, P Karanasou, MG Moya, A Nagaraj, ...
arXiv preprint arXiv:2306.11327, 2023
32023
Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials
I Grega, I Batatia, G Csányi, S Karlapati, VS Deshpande
arXiv preprint arXiv:2401.16914, 2024
22024
A Comparative Analysis of Pretrained Language Models for Text-to-Speech
MG Moya, P Karanasou, S Karlapati, B Schnell, N Peinelt, A Moinet, ...
12th Speech Synthesis Workshop (SSW) 2023, 2023
2*2023
Learned condition text-to-speech synthesis
P Karanasou, SVK Karlapati, AP Moinet, AVPY Joly, SA Abbas, ...
US Patent 11,830,476, 2023
12023
Synthetic speech processing
JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI
US Patent 11,735,156, 2023
12023
Synthetic speech processing
JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI
US Patent App. 18/305,456, 2023
12023
Synthetic speech processing by representing text by phonemes exhibiting predicted volume and pitch using neural networks
A Joly, S Slangen, AP Moinet, TR Drugman, P Karanasou, SA Abbas, ...
US Patent 11,978,431, 2024
2024
Synthetic speech processing
AVPY Joly, P Karanasou, APJ Moinet, TR Drugman, SVK Karlapati, ...
US Patent 11,574,624, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20