Follow
Rui Zhao
Rui Zhao
Tencent AI Lab & Robotics X
Verified email at tencent.com - Homepage
Title
Cited by
Cited by
Year
Two-stream RNN/CNN for action recognition in 3D videos
R Zhao, H Ali, P Van der Smagt
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017
1332017
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
R Zhao, X Sun, V Tresp
2019 International Conference on Machine Learning (ICML), 2019
972019
Energy-based hindsight experience prioritization
R Zhao, V Tresp
2018 Conference on Robot Learning (CoRL) (Oral), 2018
812018
Curiosity-Driven Experience Prioritization via Density Estimation
R Zhao, V Tresp
2018 NeurIPS (NIPS) Deep Reinforcement Learning Workshop, 2019
692019
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
R Zhao, J Song, Y Yuan, H Haifeng, Y Gao, Y Wu, Z Sun, W Yang
The Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23), 2023
622023
Learning goal-oriented visual dialog via tempered policy gradient
R Zhao, V Tresp
2018 IEEE Spoken Language Technology (SLT), 868-875, 2018
352018
Mutual Information State Intrinsic Control
R Zhao, Y Gao, P Abbeel, V Tresp, W Xu
2021 International Conference on Learning Representations (ICLR) (Spotlight), 2021
322021
Lifelike agility and play in quadrupedal robots using reinforcement learning and generative pre-trained models
L Han, Q Zhu, J Sheng, C Zhang, T Li, Y Zhang, H Zhang, Y Liu, C Zhou, ...
Nature Machine Intelligence 6 (7), 787-798, 2024
202024
Semantics for Global and Local Interpretation of Deep Convolutional Neural Networks
J Gu, R Zhao, V Tresp
2021 International Joint Conference on Neural Networks (IJCNN), 2021
17*2021
Addressing hindsight bias in multigoal reinforcement learning
C Bai, L Wang, Y Wang, Z Wang, R Zhao, C Bai, P Liu
IEEE Transactions on Cybernetics 53 (1), 392-405, 2021
152021
Efficient dialog policy learning via positive memory retention
R Zhao, V Tresp
2018 IEEE Spoken Language Technology (SLT), 823-830, 2018
9*2018
Learning Highly Dynamic Behaviors for Quadrupedal Robots
C Zhang, J Sheng, T Li, H Zhang, C Zhou, Q Zhu, R Zhao, Y Zhang, L Han
arXiv preprint arXiv:2402.13473, 2024
52024
RECCraft system: Towards reliable and efficient collective robotic construction
Q Xu, Y Zhang, S Zhang, R Zhao, Z Wu, D Zhang, C Zhou, X Li, J Chen, ...
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
42022
CraftEnv: A Flexible Collective Robotic Construction Environment for Multi-Agent Reinforcement Learning
R Zhao, X Liu, Y Zhang, M Li, C Zhou, S Li, L Han
Proceedings of the 2023 International Conference on Autonomous Agents and …, 2023
32023
Auto-encoding adversarial imitation learning
K Zhang, R Zhao, Z Zhang, Y Gao
arXiv preprint arXiv:2206.11004, 2022
22022
Learning Individualized Treatment Rules with Estimated Translated Inverse Propensity Score
Z Wu, Y Yang, Y Ma, Y Liu, R Zhao, M Moor, V Tresp
2020 IEEE International Conference on Healthcare Informatics (ICHI) (Best Paper), 2020
22020
MeGraph: capturing long-range interactions by alternating local and hierarchical aggregation on multi-scaled graph hierarchy
H Dong, J Xu, Y Yang, R Zhao, S Wu, C Yuan, X Li, CJ Maddison, L Han
Advances in Neural Information Processing Systems 36, 63609-63641, 2023
12023
Maximum entropy regularised multi-goal reinforcement learning
V Tresp, R Zhao
US Patent App. 16/385,209, 2020
12020
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning
C Chen, J Xu, W Liao, H Ding, Z Zhang, Y Yu, R Zhao
Proceedings of the AAAI Conference on Artificial Intelligence 38 (10), 11240 …, 2024
2024
Deep reinforcement learning in robotics and dialog systems
R Zhao
Ludwig Maximilian University of Munich, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20