Zheng Wen

Citée par

	Toutes	Depuis 2019
Citations	5222	4404
indice h	31	30
indice i10	55	51

1000

500

250

750

2014201520162017201820192020202120222023202428 68 146 184 330 502 738 847 986 907 421

Accès public

Tout afficher

8 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Branislav KvetonAmazonAdresse e-mail validée de amazon.com
Benjamin Van RoyStanford UniversityAdresse e-mail validée de stanford.edu
Ian OsbandOpenAIAdresse e-mail validée de openai.com
Csaba SzepesvariDeepMind & University of AlbertaAdresse e-mail validée de cs.ualberta.ca
Azin AshkanGoogleAdresse e-mail validée de uwaterloo.ca
Xiuyuan LuGoogle DeepMindAdresse e-mail validée de google.com
Yasin Abbasi YadkoriGoogle DeepMindAdresse e-mail validée de google.com
Vikranth DwaracherlaDeepMindAdresse e-mail validée de google.com
Morteza IbrahimiStanford UniversityAdresse e-mail validée de stanford.edu
Mohammad GhavamzadehAmazonAdresse e-mail validée de amazon.com
Sharan VaswaniSimon Fraser UniversityAdresse e-mail validée de sfu.ca
Daniel RussoColumbia UniversityAdresse e-mail validée de gsb.columbia.edu
Botao HaoDeepmindAdresse e-mail validée de google.com
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindAdresse e-mail validée de meta.com
Seyed Mohammad AsghariResearch Engineer, DeepMindAdresse e-mail validée de google.com
Brian ErikssonAdobeAdresse e-mail validée de adobe.com
S MuthukrishnanRutgers UnivAdresse e-mail validée de cs.rutgers.edu
Sumeet KatariyaAmazonAdresse e-mail validée de wisc.edu
Shlomo BerkovskyMacquarie UniversityAdresse e-mail validée de mq.edu.au
Abbas KazerouniStanford UniversityAdresse e-mail validée de stanford.edu

Suivre

Zheng Wen

Google DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Artificial Intelligence Reinforcement Learning Operations Research Large Language Models


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
A Tutorial on Thompson Sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0	1067*
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	329	2016
Deep exploration via randomized value functions I Osband, B Van Roy, DJ Russo, Z Wen Journal of Machine Learning Research 20 (124), 1-62, 2019	323	2019
Cascading bandits: Learning to rank in the cascade model B Kveton, C Szepesvári, Z Wen, A Ashkan ICML, 2015	311	2015
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits B Kveton, Z Wen, A Ashkan, C Szepesvari International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014	308	2014
Optimal demand response using device based reinforcement learning Z Wen, D O'Neill, HR Maei IEEE Transactions on Smart Grid, 2014	304	2014
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Advances in neural information processing systems 30, 2017	144*	2017
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit Y Cao, Z Wen, B Kveton, Y Xie The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	132*	2019
Cascading bandits for large-scale recommendation problems S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton arXiv preprint arXiv:1603.05359, 2016	126	2016
Matroid bandits: Fast combinatorial optimization with learning B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson UAI 2014, 2014	126	2014
Combinatorial cascading bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Advances in Neural Information Processing Systems 28, 2015	125	2015
Efficient learning in large-scale combinatorial semi-bandits Z Wen, B Kveton, A Ashkan http://jmlr.org/proceedings/papers/v37/wen15.html, 2014	108	2014
Optimal Greedy Diversity for Recommendation A Ashkan, B Kveton, S Berkovsky, Z Wen	107	2015
Online learning to rank in stochastic click models M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen International conference on machine learning, 4199-4208, 2017	106	2017
DCM Bandits: Learning to Rank with Multiple Clicks S Katariya, B Kveton, C Szepesvári, Z Wen arXiv, 2016	88	2016
Efficient Exploration and Value Function Generalization in Deterministic Systems Z Wen, B Van Roy Advances in Neural Information Processing Systems, 3021--3029, 2013	86	2013
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2024	82	2024
Model-independent online learning for influence maximization S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ... International conference on machine learning, 3530-3539, 2017	82*	2017
Stochastic rank-1 bandits S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen Artificial Intelligence and Statistics, 392-401, 2017	74	2017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	72	2019

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs