Suivre
Antoine Miech
Antoine Miech
DeepMind
Adresse e-mail validée de google.com - Page d'accueil
Titre
Citée par
Citée par
Année
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
A Miech, D Zhukov, JB Alayrac, M Tapaswi, I Laptev, J Sivic
Proceedings of the IEEE International Conference on Computer Vision, 2630-2640, 2019
3492019
Learnable pooling with context gating for video classification
A Miech, I Laptev, J Sivic
arXiv preprint arXiv:1706.06905, 2017
2862017
End-to-end learning of visual representations from uncurated instructional videos
A Miech, JB Alayrac, L Smaira, I Laptev, J Sivic, A Zisserman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
2712020
Learning a text-video embedding from incomplete and heterogeneous data
A Miech, I Laptev, J Sivic
arXiv preprint arXiv:1804.02516, 2018
1322018
Leveraging the present to anticipate the future in videos
A Miech, I Laptev, J Sivic, H Wang, L Torresani, D Tran
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
462019
Learning from video and text via large-scale discriminative clustering
A Miech, JB Alayrac, P Bojanowski, I Laptev, J Sivic
Proceedings of the IEEE international conference on computer vision, 5257-5266, 2017
432017
Thinking fast and slow: Efficient text-to-visual retrieval with transformers
A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
362021
Just ask: Learning to answer questions from millions of narrated videos
A Yang, A Miech, J Sivic, I Laptev, C Schmid
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
212021
RareAct: A video dataset of unusual interactions
A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman
arXiv preprint arXiv:2008.01018, 2020
62020
The end-of-end-to-end: A video understanding pentathlon challenge (2020)
S Albanie, Y Liu, A Nagrani, A Miech, E Coto, I Laptev, R Sukthankar, ...
arXiv preprint arXiv:2008.00744, 2020
52020
Flamingo: a visual language model for few-shot learning
JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ...
arXiv preprint arXiv:2204.14198, 2022
22022
TubeDETR: Spatio-Temporal Video Grounding with Transformers
A Yang, A Miech, J Sivic, I Laptev, C Schmid
arXiv preprint arXiv:2203.16434, 2022
12022
Learning to Answer Visual Questions from Web Videos
A Yang, A Miech, J Sivic, I Laptev, C Schmid
arXiv preprint arXiv:2205.05019, 2022
2022
Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
T Souček, JB Alayrac, A Miech, I Laptev, J Sivic
arXiv preprint arXiv:2203.11637, 2022
2022
Large-scale Learning from Video and Natural Language
A Miech
PSL Research University, 2020
2020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–15