Yuhuai(Tony) Wu
Yuhuai(Tony) Wu
Google / Stanford
Verified email at stanford.edu - Homepage
Title
Cited by
Cited by
Year
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
1819*2019
Openai baselines
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
1226*2017
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Y Wu, E Mansimov, RB Grosse, S Liao, J Ba
Advances in Neural Information Processing Systems, 5283-5292, 2017
5172017
On the quantitative analysis of decoder-based generative models
Y Wu, Y Burda, R Salakhutdinov, R Grosse
5th International Conference on Learning Representations (ICLR 2017), 2016
2162016
Backpropagation through the void: Optimizing control variates for black-box gradient estimation
W Grathwohl, D Choi, Y Wu, G Roeder, D Duvenaud
ICLR2018, 2017
2102017
Sticking the landing: Simple, lower-variance gradient estimators for variational inference
G Roeder, Y Wu, D Duvenaud
arXiv preprint arXiv:1703.09194, 2017
163*2017
Architectural complexity measures of recurrent neural networks
S Zhang, Y Wu, T Che, Z Lin, R Memisevic, RR Salakhutdinov, Y Bengio
Advances in neural information processing systems 29, 1822-1830, 2016
1542016
On multiplicative integration with recurrent neural networks
Y Wu, S Zhang, Y Zhang, Y Bengio, R Salakhutdinov
arXiv preprint arXiv:1606.06630, 2016
1512016
STDP-compatible approximation of backpropagation in an energy-based model
Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu
Neural computation 29 (3), 555-577, 2017
131*2017
The Importance of Sampling in Meta-Reinforcement Learning
B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ...
Advances in Neural Information Processing Systems, 9299-9309, 2018
99*2018
Understanding Short-Horizon Bias in Stochastic Meta-Optimization
Y Wu, M Ren, R Liao, RB Grosse
Sixth International Conference on Learning Representations (ICLR 2018), 2018
742018
On the opportunities and risks of foundation models
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
402021
Path-normalized optimization of recurrent neural networks with relu activations
Y Wu, B Neyshabur, RR Salakhutdinov, N Srebro
Advances in Neural Information Processing Systems, 3477-3485, 2016
292016
IsarStep: a Benchmark for High-level Mathematical Reasoning
W Li, L Yu, Y Wu, LC Paulson
ICLR 2021, 2021
16*2021
OPtions as REsponses: Grounding Behavioural Hierarchies in Multi-Agent Reinforcement Learning
Y Wu, A Vezhnevets, M Eckstein, R Leblond, JZ Leibo
ICML2020, 2020
16*2020
Concurrent Meta Reinforcement Learning
E Parisotto, S Ghosh, SB Yalamanchi, V Chinnaobireddy, Y Wu, ...
arXiv preprint arXiv:1903.02710, 2019
15*2019
ACTRCE: Augmenting Experience via Teacher’s Advice
Y Wu, H Chan, J Kiros, S Fidler, J Ba
13*2018
INT: An inequality benchmark for evaluating generalization in theorem proving
Y Wu, AQ Jiang, J Ba, R Grosse
ICLR 2021, 2021
122021
The scattering compositional learner: Discovering objects, attributes, relationships in analogical reasoning
Y Wu, H Dong, R Grosse, J Ba
arXiv preprint arXiv:2007.04212, 2020
112020
Nonlinear invariant risk minimization: A causal approach
C Lu, Y Wu, JM Hernández-Lobato, B Schölkopf
arXiv preprint arXiv:2102.12353, 2021
92021
The system can't perform the operation now. Try again later.
Articles 1–20