Kevin Wilson
Kevin Wilson
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
CNN architectures for large-scale audio classification
S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ...
2017 ieee international conference on acoustics, speech and signal …, 2017
7562017
Learning the speech front-end with raw waveform CLDNNs
TN Sainath, RJ Weiss, A Senior, KW Wilson, O Vinyals
Sixteenth Annual Conference of the International Speech Communication …, 2015
3772015
Speech denoising using nonnegative matrix factorization with priors
KW Wilson, B Raj, P Smaragdis, A Divakaran
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
2892008
Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation
A Ephrat, I Mosseri, O Lang, T Dekel, K Wilson, A Hassidim, WT Freeman, ...
arXiv preprint arXiv:1804.03619, 2018
2052018
Speech acoustic modeling from raw multichannel waveforms
Y Hoshen, RJ Weiss, KW Wilson
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
1782015
Multichannel signal processing with deep neural networks for automatic speech recognition
TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017
1152017
Regularized non-negative matrix factorization with temporal dependencies for speech denoising
KW Wilson, B Raj, P Smaragdis
Ninth Annual Conference of the International Speech Communication Association, 2008
1062008
Acoustic Modeling for Google Home.
B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ...
Interspeech, 399-403, 2017
1052017
Low latency video storyboard delivery with selectable resolution levels
NO Krahnstoever, KW Wilson
US Patent App. 13/785,913, 2014
972014
Multiple person and speaker activity tracking with a particle filter
N Checka, KW Wilson, MR Siracusa, T Darrell
2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004
972004
Visual speech recognition with loosely synchronized feature streams
K Saenko, K Livescu, M Siracusa, K Wilson, J Glass, T Darrell
Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2 …, 2005
872005
Neural network adaptive beamforming for robust multichannel speech recognition
B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani
752016
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms
TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
692015
Indexing a recording of audiovisual content to enable rich navigation
SW Fu, B Keating, S Vedula, K Wilson, S Ahmad
US Patent App. 11/282,318, 2007
662007
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
arXiv preprint arXiv:1810.04826, 2018
652018
Factored spatial and spectral multichannel raw waveform CLDNNs
TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
642016
Processing multi-channel audio waveforms
TN Sainath, RJ Weiss, KW Wilson, AW Senior, A Narayanan, Y Hoshen, ...
US Patent 9,697,826, 2017
622017
Learning a precedence effect-like weighting function for the generalized cross-correlation framework
KW Wilson, T Darrell
IEEE Transactions on audio, speech, and language processing 14 (6), 2156-2164, 2006
472006
A probabilistic framework for multi-modal multi-person tracking
N Checka, K Wilson, V Rangarajan, T Darrell
2003 Conference on Computer Vision and Pattern Recognition Workshop 9, 100-100, 2003
462003
A multi-modal approach for determining speaker location and focus
M Siracusa, LP Morency, K Wilson, J Fisher, T Darrell
Proceedings of the 5th international conference on Multimodal interfaces, 77-80, 2003
402003
The system can't perform the operation now. Try again later.
Articles 1–20