Yi Zhang

Cited by

	All	Since 2019
Citations	6536	6016
h-index	23	23
i10-index	30	30

2000

1000

500

1500

20162017201820192020202120222023202437 120 341 444 555 562 534 1898 1993

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sanjeev AroraProfessor of Computer Science, Princeton UniversityVerified email at cs.princeton.edu
Sebastien BubeckVP GenAI Research, Microsoft AIVerified email at microsoft.com
Ronen EldanWeizmann InstituteVerified email at weizmann.ac.il
Yin Tat LeePaul G. Allen School of Computer Science & Engineering, University of WashingtonVerified email at uw.edu
Yuanzhi LiAssistant Professor at CMUVerified email at andrew.cmu.edu
Rong GeDuke UniversityVerified email at cs.duke.edu
Cyril ZhangMicrosoft Research NYCVerified email at microsoft.com
Eric HorvitzMicrosoftVerified email at microsoft.com
Hamid PalangiGoogle and University of WashingtonVerified email at google.com
Marco Tulio RibeiroGoogle DeepMindVerified email at cs.washington.edu
Harsha NoriMicrosoft ResearchVerified email at microsoft.com
Scott LundbergGoogle DeepMindVerified email at google.com
Ece KamarMicrosoft ResearchVerified email at microsoft.com
Varun ChandrasekaranUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Holden LeeAssistant Professor of Applied Mathematics and Statistics, Johns Hopkins UniversityVerified email at jhu.edu
Elad HazanProfessor at Princeton University and Director Google AI PrincetonVerified email at princeton.edu
Honglak LeeLG AI Research / U. MichiganVerified email at umich.edu
Yuting ZhangAmazon Web ServicesVerified email at amazon.com
Allison Del GiornoMicrosoft ResearchVerified email at microsoft.com
Jyoti AnejaSenior Researcher, Microsoft Research, RedmondVerified email at microsoft.com

Yi Zhang

Senior Researcher at Microsoft Research Redmond

Verified email at microsoft.com - Homepage

Machine Learning Theory of Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023	2474	2023
Generalization and Equilibrium in Generative Adversarial Nets (GANs) S Arora, R Ge, Y Liang, T Ma, Y Zhang arXiv preprint arXiv:1703.00573, 2017	794	2017
Stronger generalization bounds for deep nets via a compression approach S Arora, R Ge, B Neyshabur, Y Zhang International conference on machine learning, 254-263, 2018	678	2018
Convolutional neural networks with low-rank regularization C Tai, T Xiao, Y Zhang, X Wang arXiv preprint arXiv:1511.06067, 2015	530	2015
Deep visual analogy-making SE Reed, Y Zhang, Y Zhang, H Lee Advances in neural information processing systems 28, 2015	349	2015
Textbooks are all you need S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ... arXiv preprint arXiv:2306.11644, 2023	301	2023
Do GANs actually learn the distribution? An empirical study S Arora, Y Zhang arXiv:1706.08224, 2017	195	2017
Do GANs learn the distribution? some theory and empirics S Arora, A Risteski, Y Zhang International Conference on Learning Representations, 2018	175	2018
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 2024	121	2024
Spectral filtering for general linear dynamical systems E Hazan, H Lee, K Singh, C Zhang, Y Zhang Advances in Neural Information Processing Systems 31, 2018	100	2018
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets R Kuditipudi, X Wang, H Lee, Y Zhang, Z Li, W Hu, S Arora, R Ge arXiv:1906.06247, 2019	87	2019
Phi-2: The surprising power of small language models M Javaheripi, S Bubeck, M Abdin, J Aneja, S Bubeck, CCT Mendes, ... Microsoft Research Blog, 2023	82	2023
What makes convolutional models great on long sequence modeling? Y Li, T Cai, Y Zhang, D Chen, D Dey arXiv preprint arXiv:2210.09298, 2022	77	2022
Towards Understanding the Invertibility of Convolutional Neural Networks CA Gilbert, Y Zhang, K Lee, Y Zhang, H Lee arXiv preprint arXiv:1705.08664, 2017	75	2017
Efficient full-matrix adaptive regularization N Agarwal, B Bullins, X Chen, E Hazan, K Singh, C Zhang, Y Zhang International Conference on Machine Learning, 102-110, 2019	62	2019
Unveiling transformers with lego: a synthetic reasoning task Y Zhang, A Backurs, S Bubeck, R Eldan, S Gunasekar, T Wagner arXiv preprint arXiv:2206.04301, 2022	57	2022
Why are convolutional nets more sample-efficient than fully-connected nets? Z Li, Y Zhang, S Arora arXiv preprint arXiv:2010.08515, 2020	55	2020
Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality Y Zhang, O Plevrakis, SS Du, X Li, Z Song, S Arora arXiv:2002.06668, 2020	52	2020
Calibration, Entropy Rates, and Memory in Language Models M Braverman, X Chen, SM Kakade, K Narasimhan, C Zhang, Y Zhang arXiv preprint arXiv:1906.05664, 2019	36	2019
Learning threshold neurons via the "edge of stability" K Ahn, S Bubeck, S Chewi, YT Lee, F Suarez, Y Zhang arxiv.org/2212.07469, 2022	32	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors