Antoine Miech

Citée par

	Toutes	Depuis 2019
Citations	5641	5548
indice h	16	16
indice i10	19	19

2300

1150

575

1725

201820192020202120222023202471 103 226 521 948 2261 1482

Accès public

Tout afficher

9 articles

2 articles

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Ivan LaptevVisiting professor at MBZUAI, on leave from INRIAAdresse e-mail validée de inria.fr
Josef SivicCzech Technical University, CIIRC, ELLIS Unit PragueAdresse e-mail validée de cvut.cz
Jean-Baptiste AlayracDeepMind, LondonAdresse e-mail validée de google.com
Cordelia SchmidResearch director INRIA Adresse e-mail validée de inria.fr
Andrew ZissermanUniversity of OxfordAdresse e-mail validée de robots.ox.ac.uk
Antoine YangGoogle DeepMindAdresse e-mail validée de google.com
Makarand TapaswiIIIT Hyderabad, Wadhwani AIAdresse e-mail validée de iiit.ac.in
Dimitri ZhukovTractableAdresse e-mail validée de tractable.ai
Lorenzo TorresaniMeta, Fundamental AI Research (FAIR)Adresse e-mail validée de meta.com
Heng WangTikTokAdresse e-mail validée de fb.com
Du TranGoogleAdresse e-mail validée de google.com
Piotr BojanowskiMeta AIAdresse e-mail validée de fb.com
Jeff DonahueResearch Scientist, DeepMindAdresse e-mail validée de google.com
Karen SimonyanChief Scientist, Microsoft AIAdresse e-mail validée de microsoft.com

Suivre

Antoine Miech

Google DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Computer Vision


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... Advances in neural information processing systems 35, 23716-23736, 2022	1920	2022
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips A Miech, D Zhukov, JB Alayrac, M Tapaswi, I Laptev, J Sivic Proceedings of the IEEE International Conference on Computer Vision, 2630-2640, 2019	1033	2019
End-to-end learning of visual representations from uncurated instructional videos A Miech, JB Alayrac, L Smaira, I Laptev, J Sivic, A Zisserman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	693	2020
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	457	2023
Learnable pooling with context gating for video classification A Miech, I Laptev, J Sivic arXiv preprint arXiv:1706.06905, 2017	380	2017
Learning a text-video embedding from incomplete and heterogeneous data A Miech, I Laptev, J Sivic arXiv preprint arXiv:1804.02516, 2018	249	2018
Just ask: Learning to answer questions from millions of narrated videos A Yang, A Miech, J Sivic, I Laptev, C Schmid Proceedings of the IEEE/CVF international conference on computer vision …, 2021	237	2021
Zero-shot video question answering via frozen bidirectional language models A Yang, A Miech, J Sivic, I Laptev, C Schmid Advances in Neural Information Processing Systems 35, 124-141, 2022	137	2022
Thinking fast and slow: Efficient text-to-visual retrieval with transformers A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	129	2021
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	95	2023
Tubedetr: Spatio-temporal video grounding with transformers A Yang, A Miech, J Sivic, I Laptev, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	76	2022
Leveraging the present to anticipate the future in videos A Miech, I Laptev, J Sivic, H Wang, L Torresani, D Tran Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	74	2019
Learning from video and text via large-scale discriminative clustering A Miech, JB Alayrac, P Bojanowski, I Laptev, J Sivic Proceedings of the IEEE international conference on computer vision, 5257-5266, 2017	49	2017
Learning to answer visual questions from web videos A Yang, A Miech, J Sivic, I Laptev, C Schmid arXiv preprint arXiv:2205.05019, 2022	22	2022
Look for the change: Learning object states and state-modifying actions from untrimmed web videos T Souček, JB Alayrac, A Miech, I Laptev, J Sivic Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	18	2022
Rareact: A video dataset of unusual interactions A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman arXiv preprint arXiv:2008.01018, 2020	18	2020
The end-of-end-to-end: A video understanding pentathlon challenge (2020) S Albanie, Y Liu, A Nagrani, A Miech, E Coto, I Laptev, R Sukthankar, ... arXiv preprint arXiv:2008.00744, 2020	14	2020
Zorro: the masked multimodal transformer A Recasens, J Lin, J Carreira, D Jaegle, L Wang, J Alayrac, P Luc, ... arXiv preprint arXiv:2301.09595, 2023	13	2023
Perception test: A diagnostic benchmark for multimodal video models V Patraucean, L Smaira, A Gupta, A Recasens, L Markeeva, D Banarse, ... Advances in Neural Information Processing Systems 36, 2024	11	2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	6	2024

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs