Rif A. Saurous
Rif A. Saurous
Verified email at google.com
Title
Cited by
Cited by
Year
CNN architectures for large-scale audio classification
S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ...
2017 ieee international conference on acoustics, speech and signal …, 2017
6042017
Tacotron: Towards end-to-end speech synthesis
Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ...
arXiv preprint arXiv:1703.10135, 2017
520*2017
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
4312018
Fixing a broken ELBO
AA Alemi, B Poole, I Fischer, JV Dillon, RA Saurous, K Murphy
arXiv preprint arXiv:1711.00464, 2017
1392017
Deep probabilistic programming
D Tran, MD Hoffman, RA Saurous, E Brevdo, K Murphy, DM Blei
arXiv preprint arXiv:1701.03757, 2017
1312017
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis
Y Wang, D Stanton, Y Zhang, RJ Skerry-Ryan, E Battenberg, J Shor, ...
arXiv preprint arXiv:1803.09017, 2018
1222018
Towards end-to-end prosody transfer for expressive speech synthesis with tacotron
RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ...
arXiv preprint arXiv:1803.09047, 2018
1212018
Tensorflow distributions
JV Dillon, I Langmore, D Tran, E Brevdo, S Vasudevan, D Moore, B Patton, ...
arXiv preprint arXiv:1711.10604, 2017
782017
Unsupervised learning of semantic audio representations
A Jansen, M Plakal, R Pandya, DPW Ellis, S Hershey, J Liu, RC Moore, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
492018
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
arXiv preprint arXiv:1810.04826, 2018
472018
Trainable frontend for robust and far-field keyword spotting
Y Wang, P Getreuer, T Hughes, RF Lyon, RA Saurous
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
382017
Scalable learning of non-decomposable objectives
EET Eban, M Schain, A Mackey, A Gordon, RA Saurous, G Elidan
arXiv preprint arXiv:1608.04802, 2016
382016
An information-theoretic analysis of deep latent-variable models
A Alemi, B Poole, I Fischer, J Dillon, RA Saurus, K Murphy
332018
Uncovering latent style factors for expressive speech synthesis
Y Wang, RJ Skerry-Ryan, Y Xiao, D Stanton, J Shor, E Battenberg, ...
arXiv preprint arXiv:1711.00520, 2017
232017
Simple, distributed, and accelerated probabilistic programming
D Tran, MW Hoffman, D Moore, C Suter, S Vasudevan, A Radul
Advances in Neural Information Processing Systems, 7598-7609, 2018
202018
Exploring tradeoffs in models for low-latency speech enhancement
K Wilson, M Chinen, J Thorpe, B Patton, J Hershey, RA Saurous, ...
2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC …, 2018
112018
Differentiable consistency constraints for improved deep speech enhancement
S Wisdom, JR Hershey, K Wilson, J Thorpe, M Chinen, B Patton, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
102019
On using backpropagation for speech texture generation and voice conversion
J Chorowski, RJ Weiss, RA Saurous, S Bengio
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
102018
Towards learning semantic audio representations from unlabeled data
A Jansen, M Plakal, R Pandya, DPW Ellis, S Hershey, J Liu, RC Moore, ...
signal 2 (3), 7-11, 2017
92017
Neumann optimizer: A practical optimization algorithm for deep neural networks
S Krishnan, Y Xiao, RA Saurous
arXiv preprint arXiv:1712.03298, 2017
82017
The system can't perform the operation now. Try again later.
Articles 1–20