Olivier Pietquin
Olivier Pietquin
Google Brain (On leave of Professor at University of Lille - CRIStAL - SequeL team)
Verified email at univ-lille.fr - Homepage
Title
Cited by
Cited by
Year
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
517*2018
Noisy networks for exploration
M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ...
arXiv preprint arXiv:1706.10295, 2017
3892017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
2352017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
2292017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, A Courville
arXiv preprint arXiv:1707.00683, 2017
2182017
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
1892006
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1382005
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
Eighth Annual Conference of the International Speech Communication Association, 2007
1252007
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
1182016
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
Knowledge Engineering Review 28 (1), 59-73, 2013
1072013
Sample-efficient batch reinforcement learning for dialogue management optimization
O Pietquin, M Geist, S Chandramohan, H Frezza-Buet
ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 1-21, 2011
942011
End-to-end automatic speech translation of audiobooks
A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
932018
Algorithmic Survey of Parametric Value Function Approximation
M Geist, O Pietquin
Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
92*2013
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
902010
User simulation in dialogue systems using inverse reinforcement learning
S Chandramohan, M Geist, F Lefevre, O Pietquin
Twelfth Annual Conference of the International Speech Communication Association, 2011
862011
End-to-end optimization of goal-driven and visually grounded dialogue systems
F Strub, H De Vries, J Mary, B Piot, A Courville, O Pietquin
arXiv preprint arXiv:1703.05423, 2017
832017
Inverse reinforcement learning through structured classification
E Klein, M Geist, B Piot, O Pietquin
NIPS 2012, 1-9, 2012
812012
Data-driven methods for adaptive spoken dialogue systems: Computational learning for conversational interfaces
O Lemon, O Pietquin
Springer Science & Business Media, 2012
782012
Laugh-aware virtual agent and its impact on user amusement
R Niewiadomski, J Hofmann, J Urbain, T Platt, J Wagner, P Bilal, T Ito, ...
University of Zurich, 2013
632013
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
622019
The system can't perform the operation now. Try again later.
Articles 1–20