Metadrive: Composing diverse driving scenarios for generalizable reinforcement learning Q Li, Z Peng, L Feng, Q Zhang, Z Xue, B Zhou IEEE transactions on pattern analysis and machine intelligence 45 (3), 3461-3475, 2022 | 143 | 2022 |
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning XH Liu, Z Xue, J Pang, S Jiang, F Xu, Y Yu Advances in Neural Information Processing Systems 34, 17604-17615, 2021 | 32 | 2021 |
Two-Stage Constrained Actor-Critic for Short Video Recommendation Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ... The Web Conference 2023 Research Track, 2023 | 19 | 2023 |
PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, B An Proceedings of the 29th ACM SIGKDD International Conference on Knowledge …, 2023 | 17* | 2023 |
A Large Language Model Enhanced Conversational Recommender System Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun arXiv preprint arXiv:2308.06212, 2023 | 12 | 2023 |
State Regularized Policy Optimization on Data with Dynamics Shift Z Xue, Q Cai, S Liu, D Zheng, P Jiang, K Gai, B An Advances in Neural Information Processing Systems 36, 32926--32937, 2023 | 5 | 2023 |
Guarded Policy Optimization with Imperfect Online Demonstrations Z Xue, Z Peng, Q Li, Z Liu, B Zhou The Eleventh International Conference on Learning Representations, 2023 | 3 | 2023 |
AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement Z Xue, Q Cai, T Zuo, B Yang, L Hu, P Jiang, K Gai, B An arXiv preprint arXiv:2310.03984, 2023 | 2 | 2023 |
AgentStudio: A Toolkit for Building General Virtual Agents L Zheng, Z Huang, Z Xue, X Wang, B An, S Yan arXiv preprint arXiv:2403.17918, 2024 | | 2024 |
: Energy-Based Reinforcement Learning with Stein Soft Actor Critc S Messaoud, B Mokeddem, Z Xue, B An, H Chen, S Chawla The Twelfth International Conference on Learning Representations, 2024 | | 2024 |