Follow
Neil Zeghidour
Neil Zeghidour
Kyutai
Verified email at kyutai.org
Title
Cited by
Cited by
Year
Fader networks: Manipulating images by sliding attributes
G Lample, N Zeghidour, N Usunier, A Bordes, L Denoyer, MA Ranzato
Advances in Neural Information Processing Systems, 2017
5762017
Wavesplit: End-to-end speech separation by speaker clustering
N Zeghidour, D Grangier
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2840-2849, 2021
2342021
Soundstream: An end-to-end neural audio codec
N Zeghidour, A Luebs, A Omran, J Skoglund, M Tagliasacchi
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 495-507, 2021
2152021
Contrastive learning of general-purpose audio representations
A Saeed, D Grangier, N Zeghidour
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2052021
Musiclm: Generating music from text
A Agostinelli, TI Denk, Z Borsos, J Engel, M Verzetti, A Caillon, Q Huang, ...
arXiv preprint arXiv:2301.11325, 2023
1762023
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
1642023
Learning Filterbanks from Raw Speech for Phone Recognition
N Zeghidour, N Usunier, I Kokkinos, T Schatz, G Synnaeve, E Dupoux
ICASSP 2018, 2017
1202017
LEAF: A Learnable Frontend for Audio Classification
N Zeghidour, O Teboul, FC Quitry, M Tagliasacchi
ICLR 2021, 2021
1182021
End-to-end speech recognition from the raw waveform
N Zeghidour, N Usunier, G Synnaeve, R Collobert, E Dupoux
Interspeech 2018, 2018
1072018
Fully convolutional speech recognition
N Zeghidour, Q Xu, V Liptchinsky, N Usunier, G Synnaeve, R Collobert
arXiv preprint arXiv:1812.06864, 2018
1042018
Sing: Symbol-to-instrument neural generator
A Défossez, N Zeghidour, N Usunier, L Bottou, F Bach
Advances in Neural Information Processing Systems, 2018
712018
Joint learning of speaker and phonetic similarities with siamese networks.
N Zeghidour, G Synnaeve, N Usunier, E Dupoux
INTERSPEECH, 1295-1299, 2016
632016
A Deep Scattering Spectrum - Deep Siamese network Pipeline For Unsupervised Acoustic Modeling
N Zeghidour, G Synnaeve, M Versteegh, E Dupoux
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
482016
General-purpose, long-context autoregressive modeling with perceiver ar
C Hawthorne, A Jaegle, C Cangea, S Borgeaud, C Nash, M Malinowski, ...
International Conference on Machine Learning, 8535-8558, 2022
452022
To reverse the gradient or not: An empirical comparison of adversarial and multi-task learning in speech recognition
Y Adi, N Zeghidour, R Collobert, N Usunier, V Liptchinsky, G Synnaeve
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
412019
Speak, read and prompt: High-fidelity text-to-speech with minimal supervision
E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ...
arXiv preprint arXiv:2302.03540, 2023
372023
AudioPaLM: A Large Language Model That Can Speak and Listen
PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ...
arXiv preprint arXiv:2306.12925, 2023
322023
Learning strides in convolutional neural networks
R Riad, O Teboul, D Grangier, N Zeghidour
arXiv preprint arXiv:2202.01653, 2022
322022
Deep multi-class learning from label proportions
G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert
arXiv preprint arXiv:1905.12909, 2019
312019
Learning to detect dysarthria from raw speech
J Millet, N Zeghidour
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
292019
The system can't perform the operation now. Try again later.
Articles 1–20