Parametrized deep q-networks learning: Reinforcement learning with discrete-continuous hybrid action space J Xiong, Q Wang, Z Yang, P Sun, L Han, Y Zheng, H Fu, T Zhang, J Liu, ... arXiv preprint arXiv:1810.06394, 2018 | 217 | 2018 |
Finding robust solutions to dynamic optimization problems H Fu, B Sendhoff, K Tang, X Yao Applications of Evolutionary Computation: 16th European Conference …, 2013 | 64 | 2013 |
Robust optimization over time: Problem difficulties and benchmark problems H Fu, B Sendhoff, K Tang, X Yao IEEE Transactions on Evolutionary Computation 19 (5), 731-745, 2014 | 58 | 2014 |
What are dynamic optimization problems? H Fu, PR Lewis, B Sendhoff, K Tang, X Yao 2014 IEEE congress on evolutionary computation (CEC), 1550-1557, 2014 | 38 | 2014 |
Characterizing environmental changes in robust optimization over time H Fu, B Sendhoff, K Tang, X Yao 2012 IEEE Congress on Evolutionary Computation, 1-8, 2012 | 29 | 2012 |
Actor-critic policy optimization in a large-scale imperfect-information game H Fu, W Liu, S Wu, Y Wang, T Yang, K Li, J Xing, B Li, B Ma, Q Fu, Y Wei International Conference on Learning Representations, 2021 | 28 | 2021 |
L2E: Learning to exploit your opponent Z Wu, K Li, H Xu, Y Zang, B An, J Xing 2022 International Joint Conference on Neural Networks (IJCNN), 1-8, 2022 | 25 | 2022 |
Find robust solutions over time by two-layer multi-objective optimization method Y Guo, M Chen, H Fu, Y Liu 2014 IEEE Congress on Evolutionary Computation (CEC), 1528-1535, 2014 | 24 | 2014 |
Memetic algorithm with heuristic candidate list strategy for capacitated arc routing problem H Fu, Y Mei, K Tang, Y Zhu IEEE Congress on Evolutionary Computation, 1-8, 2010 | 17 | 2010 |
Quality-similar diversity via population based reinforcement learning S Wu, J Yao, H Fu, Y Tian, C Qian, Y Yang, Q Fu, Y Wei The Eleventh International Conference on Learning Representations, 2023 | 15 | 2023 |
Heterogeneous multi-agent zero-shot coordination by coevolution K Xue, Y Wang, C Guan, L Yuan, H Fu, Q Fu, C Qian, Y Yu arXiv preprint arXiv:2208.04957, 2022 | 13 | 2022 |
Curriculum-based co-design of morphology and control of voxel-based soft robots Y Wang, S Wu, H Fu, Q Fu, T Zhang, Y Chang, X Wang The Eleventh International Conference on Learning Representations, 2023 | 9 | 2023 |
Greedy when sure and conservative when uncertain about the opponents H Fu, Y Tian, H Yu, W Liu, S Wu, J Xiong, Y Wen, K Li, J Xing, Q Fu, ... International Conference on Machine Learning, 6829-6848, 2022 | 9 | 2022 |
Automatic grouping for efficient cooperative multi-agent reinforcement learning Y Zang, J He, K Li, H Fu, Q Fu, J Xing, J Cheng Advances in Neural Information Processing Systems 36, 2024 | 8 | 2024 |
Policy space diversity for non-transitive games J Yao, W Liu, H Fu, Y Yang, S McAleer, Q Fu, W Yang Advances in Neural Information Processing Systems 36, 2024 | 8 | 2024 |
Enhance reasoning for large language models in the game werewolf S Wu, L Zhu, T Yang, S Xu, Q Fu, Y Wei, H Fu arXiv preprint arXiv:2402.02330, 2024 | 8 | 2024 |
Autocfr: Learning to design counterfactual regret minimization algorithms H Xu, K Li, H Fu, Q Fu, J Xing Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5244-5251, 2022 | 8 | 2022 |
PreCo: Enhancing Generalization in Co-Design of Modular Soft Robots via Brain-Body Pre-Training Y Wang, S Wu, T Zhang, Y Chang, H Fu, Q Fu, X Wang Conference on Robot Learning, 478-498, 2023 | 5 | 2023 |
Combining Tree Search and Action Prediction for State-of-the-Art Performance in DouDiZhu. Y Zhang, D Yan, B Shi, H Fu, Q Fu, H Su, J Zhu, N Chen IJCAI, 3413-3419, 2021 | 5 | 2021 |
Dynamic discounted counterfactual regret minimization H Xu, K Li, H Fu, Q Fu, J Xing, J Cheng The Twelfth International Conference on Learning Representations, 2024 | 4 | 2024 |