Yuhuai(Tony) Wu

Cited by

	All	Since 2019
Citations	15266	14560
h-index	34	33
i10-index	45	45

6000

3000

1500

4500

20162017201820192020202120222023202440 148 475 816 1585 1988 2748 5139 2239

Public access

View all

17 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Roger GrosseAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Jimmy BaUniversity of TorontoVerified email at cs.toronto.edu
Christian SzegedyResearcherVerified email at szegedy.org
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Ruslan SalakhutdinovUPMC Professor, Machine Learning Department, CMUVerified email at cs.cmu.edu
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com
David DuvenaudAssociate Professor, University of TorontoVerified email at cs.toronto.edu
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Albert Q. JiangUniversity of Cambridge | Mistral AIVerified email at mistral.ai
Percy LiangAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Saizheng Zhang
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com

Yuhuai(Tony) Wu

Co-Founder of xAI

Verified email at x.ai - Homepage

Machine Learning Machine Reasoning Theorem Proving


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... Nature 575 (7782), 350-354, 2019	4617*	2019
On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021	2751	2021
Openai baselines P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...	1834*	2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Y Wu, E Mansimov, RB Grosse, S Liao, J Ba Advances in Neural Information Processing Systems, 5283-5292, 2017	789	2017
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	781	2023
Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... arXiv preprint arXiv:2211.09110, 2022	605	2022
Solving quantitative reasoning problems with language models A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ... Advances in Neural Information Processing Systems 35, 3843-3857, 2022	424	2022
Backpropagation through the void: Optimizing control variates for black-box gradient estimation W Grathwohl, D Choi, Y Wu, G Roeder, D Duvenaud ICLR2018, 2017	312	2017
STaR: Bootstrapping reasoning with reasoning E Zelikman, Y Wu, ND Goodman arXiv preprint arXiv:2203.14465, 2022	273*	2022
On the quantitative analysis of decoder-based generative models Y Wu, Y Burda, R Salakhutdinov, R Grosse 5th International Conference on Learning Representations (ICLR 2017), 2016	266	2016
Sticking the landing: Simple, lower-variance gradient estimators for variational inference G Roeder, Y Wu, DK Duvenaud Advances in Neural Information Processing Systems 30, 2017	257*	2017
Architectural complexity measures of recurrent neural networks S Zhang, Y Wu, T Che, Z Lin, R Memisevic, RR Salakhutdinov, Y Bengio Advances in neural information processing systems 29, 2016	190	2016
STDP-compatible approximation of backpropagation in an energy-based model Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu Neural computation 29 (3), 555-577, 2017	182*	2017
On multiplicative integration with recurrent neural networks Y Wu, S Zhang, Y Zhang, Y Bengio, RR Salakhutdinov Advances in neural information processing systems 29, 2016	179	2016
Memorizing Transformers Y Wu, MN Rabe, DL Hutchins, C Szegedy International Conference on Learning Representations 2022, 2022	164	2022
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	164*	2018
Understanding Short-Horizon Bias in Stochastic Meta-Optimization Y Wu, M Ren, R Liao, RB Grosse Sixth International Conference on Learning Representations (ICLR 2018), 2018	132	2018
Invariant Causal Representation Learning for Out-of-Distribution Generalization C Lu, Y Wu, JM Hernández-Lobato, B Schölkopf International Conference on Learning Representations, 2022	121*	2022
Exploring length generalization in large language models C Anil, Y Wu, A Andreassen, A Lewkowycz, V Misra, V Ramasesh, ... Advances in Neural Information Processing Systems 35, 38546-38556, 2022	104	2022
Autoformalization with large language models Y Wu, AQ Jiang, W Li, M Rabe, C Staats, M Jamnik, C Szegedy Advances in Neural Information Processing Systems 35, 32353-32368, 2022	95	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors