Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023 | 2043 | 2023 |

The power of depth for feedforward neural networks R Eldan, O Shamir Conference on learning theory, 907-940, 2016 | 958 | 2016 |

Sparks of artificial general intelligence: Early experiments with GPT-4. arXiv S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar, ... arXiv preprint arXiv:2303.12712, 2023 | 222 | 2023 |

Textbooks are all you need S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ... arXiv preprint arXiv:2306.11644, 2023 | 207 | 2023 |

Kernel-based methods for bandit convex optimization S Bubeck, YT Lee, R Eldan Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing …, 2017 | 176* | 2017 |

Testing for high‐dimensional geometry in random graphs S Bubeck, J Ding, R Eldan, MZ Rácz Random Structures & Algorithms 49 (3), 503-532, 2016 | 150 | 2016 |

Sampling from a log-concave distribution with projected Langevin Monte Carlo S Bubeck, R Eldan, J Lehec Discrete & Computational Geometry 59, 757-783, 2018 | 146 | 2018 |

Thin shell implies spectral gap up to polylog via a stochastic localization scheme R Eldan Geometric and Functional Analysis 23 (2), 532-569, 2013 | 141 | 2013 |

Textbooks are all you need ii: phi-1.5 technical report Y Li, S Bubeck, R Eldan, A Del Giorno, S Gunasekar, YT Lee arXiv preprint arXiv:2309.05463, 2023 | 136 | 2023 |

& Zhang, Y.(2023). Sparks of artificial general intelligence: Early experiments with gpt-4 S Bubeck, V Chandrasekaran, R Eldan, J Gehrke, E Horvitz, E Kamar arXiv preprint arXiv:2303.12712, 0 | 98 | |

Tinystories: How small can language models be and still speak coherent english? R Eldan, Y Li arXiv preprint arXiv:2305.07759, 2023 | 89 | 2023 |

Gaussian-width gradient complexity, reverse log-Sobolev inequalities and nonlinear large deviations R Eldan Geometric and Functional Analysis 28 (6), 1548-1596, 2018 | 84 | 2018 |

A two-sided estimate for the Gaussian noise stability deficit R Eldan Inventiones mathematicae 201, 561-624, 2015 | 83 | 2015 |

Multi-scale exploration of convex functions and bandit convex optimization S Bubeck, R Eldan Conference on Learning Theory, 583-589, 2016 | 82 | 2016 |

Approximately gaussian marginals and the hyperplane conjecture R Eldan, B Klartag Concentration, functional inequalities and isoperimetry 545, 55-68, 2011 | 67 | 2011 |

The entropic barrier: a simple and optimal universal self-concordant barrier S Bubeck, R Eldan arXiv preprint arXiv:1412.1587, 2014 | 63 | 2014 |

Localization schemes: A framework for proving mixing bounds for Markov chains Y Chen, R Eldan 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS …, 2022 | 58 | 2022 |

A spectral condition for spectral gap: fast mixing in high-temperature Ising models R Eldan, F Koehler, O Zeitouni Probability theory and related fields 182 (3), 1035-1051, 2022 | 52 | 2022 |

Network size and size of the weights in memorization with two-layers neural networks S Bubeck, R Eldan, YT Lee, D Mikulincer Advances in Neural Information Processing Systems 33, 4977-4986, 2020 | 51 | 2020 |

Unveiling transformers with lego: a synthetic reasoning task Y Zhang, A Backurs, S Bubeck, R Eldan, S Gunasekar, T Wagner arXiv preprint arXiv:2206.04301, 2022 | 50 | 2022 |