Shixiang (Shane) Gu
Shixiang (Shane) Gu
Research Scientist, Google Brain
Verified email at google.com - Homepage
TitleCited byYear
Categorical reparameterization with gumbel-softmax
E Jang, S Gu, B Poole
arXiv preprint arXiv:1611.01144, 2016
6352016
Continuous deep q-learning with model-based acceleration
S Gu, T Lillicrap, I Sutskever, S Levine
International Conference on Machine Learning, 2829-2838, 2016
3452016
Continuous deep q-learning with model-based acceleration
S Gu, T Lillicrap, I Sutskever, S Levine
International Conference on Machine Learning, 2829-2838, 2016
3452016
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates
S Gu, E Holly, T Lillicrap, S Levine
2017 IEEE international conference on robotics and automation (ICRA), 3389-3396, 2017
3362017
Towards deep neural network architectures robust to adversarial examples
S Gu, L Rigazio
arXiv preprint arXiv:1412.5068, 2014
2852014
Q-prop: Sample-efficient policy gradient with an off-policy critic
S Gu, T Lillicrap, Z Ghahramani, RE Turner, S Levine
arXiv preprint arXiv:1611.02247, 2016
1432016
Neural adaptive sequential monte carlo
SS Gu, Z Ghahramani, RE Turner
Advances in Neural Information Processing Systems, 2629-2637, 2015
832015
Muprop: Unbiased backpropagation for stochastic neural networks
S Gu, S Levine, I Sutskever, A Mnih
arXiv preprint arXiv:1511.05176, 2015
722015
Data-efficient hierarchical reinforcement learning
O Nachum, SS Gu, H Lee, S Levine
Advances in Neural Information Processing Systems, 3303-3313, 2018
572018
Interpolated policy gradient: Merging on-policy and off-policy gradient estimation for deep reinforcement learning
SS Gu, T Lillicrap, RE Turner, Z Ghahramani, B Schölkopf, S Levine
Advances in neural information processing systems, 3846-3855, 2017
502017
Temporal difference models: Model-free deep rl for model-based control
V Pong, S Gu, M Dalal, S Levine
arXiv preprint arXiv:1802.09081, 2018
442018
Sequence tutor: Conservative fine-tuning of sequence generation models with kl-control
N Jaques, S Gu, D Bahdanau, JM Hernández-Lobato, RE Turner, D Eck
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
422017
Tuning recurrent neural networks with reinforcement learning
N Jaques, S Gu, RE Turner, D Eck
332017
Video for Eyetap Wearable Computers, FPGA-Based Seeing Aids, and Glasseyes (Eyetaps)
S Mann, RCB Lo, K Ovtcharov, S Gu, D Dai, C Ngan, T Ai, R HDR
2012 25th IEEE Canadian Conference on Electrical and Computer Engineering …, 0
30
The mirage of action-dependent baselines in reinforcement learning
G Tucker, S Bhupatiraju, S Gu, RE Turner, Z Ghahramani, S Levine
arXiv preprint arXiv:1802.10031, 2018
292018
Leave no trace: Learning to reset for safe and autonomous reinforcement learning
B Eysenbach, S Gu, J Ibarz, S Levine
arXiv preprint arXiv:1711.06782, 2017
142017
Particle gibbs for infinite hidden markov models
N Tripuraneni, SS Gu, H Ge, Z Ghahramani
Advances in Neural Information Processing Systems, 2395-2403, 2015
122015
Doubly reparameterized gradient estimators for Monte Carlo objectives
G Tucker, D Lawson, S Gu, CJ Maddison
arXiv preprint arXiv:1810.04152, 2018
112018
Categorical reparametrization with gumble-softmax
E Jang, S Gu, B Poole
International Conference on Learning Representations (ICLR 2017), 2017
112017
Near-optimal representation learning for hierarchical reinforcement learning
O Nachum, S Gu, H Lee, S Levine
arXiv preprint arXiv:1810.01257, 2018
102018
The system can't perform the operation now. Try again later.
Articles 1–20