Folgen
Baoxiang Wang
Baoxiang Wang
Assistant Professor, The Chinese University of Hong Kong Shenzhen
Bestätigte E-Mail-Adresse bei cse.cuhk.edu.hk - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Contextual combinatorial cascading bandits
S Li, B Wang, S Zhang, W Chen
International conference on machine learning, 1245-1253, 2016
1392016
Privacy-preserving q-learning with functional noise in continuous spaces
B Wang, N Hegde
Advances in Neural Information Processing Systems 32, 2019
552019
Paid: Prioritizing app issues for developers by tracking user reviews over versions
C Gao, B Wang, P He, J Zhu, Y Zhou, MR Lyu
2015 IEEE 26th international symposium on software reliability engineering …, 2015
472015
Shapley counterfactual credits for multi-agent reinforcement learning
J Li, K Kuang, B Wang, F Liu, L Chen, F Wu, J Xiao
Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021
462021
Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control
K Young, B Wang, ME Taylor
International Joint Conference on Artificial Intelligence (IJCAI) 2019, 2018
26*2018
Deconfounded value decomposition for multi-agent reinforcement learning
J Li, K Kuang, B Wang, F Liu, L Chen, C Fan, F Wu, J Xiao
International Conference on Machine Learning, 12843-12856, 2022
152022
Multilinear extension of -submodular functions
B Wang, H Zhou
arXiv e-prints, arXiv: 2107.07103, 2021
152021
Beyond winning and losing: modeling human motivations and behaviors using inverse reinforcement learning
B Wang, T Sun, SX Zheng
Artificial Intelligence and Interactive Digital Entertainment (AIIDE) 2019., 2018
15*2018
Semantically aligned task decomposition in multi-agent reinforcement learning
W Li, D Qiao, B Wang, X Wang, B Jin, H Zha
arXiv preprint arXiv:2305.10865, 2023
102023
Improved regret bounds for linear adversarial mdps via linear optimization
F Kong, X Zhang, B Wang, S Li
arXiv preprint arXiv:2302.06834, 2023
92023
Online policy optimization for robust MDP
J Dong, J Li, B Wang, J Zhang
arXiv preprint arXiv:2209.13841, 2022
92022
Learning from good trajectories in offline multi-agent reinforcement learning
Q Tian, K Kuang, F Liu, B Wang
Proceedings of the AAAI Conference on Artificial Intelligence 37 (10), 11672 …, 2023
62023
Learning adversarial linear mixture markov decision processes with bandit feedback and unknown transition
C Zhao, R Yang, B Wang, S Li
The Eleventh International Conference on Learning Representations, 2022
62022
Algorithms and theory for supervised gradual domain adaptation
J Dong, S Zhou, B Wang, H Zhao
arXiv preprint arXiv:2204.11644, 2022
62022
Learning fair representations via distance correlation minimization
D Guo, C Wang, B Wang, H Zha
IEEE Transactions on Neural Networks and Learning Systems, 2022
52022
Combinatorial bandits under strategic manipulations
J Dong, K Li, S Li, B Wang
Proceedings of the Fifteenth ACM International Conference on Web Search and …, 2022
52022
Policy optimization with second-order advantage information
J Li, B Wang
International Joint Conference on Artificial Intelligence (IJCAI) 2018 …, 2018
42018
Online Influence Maximization under Decreasing Cascade Model
F Kong, J Xie, B Wang, T Yao, S Li
arXiv preprint arXiv:2305.15428, 2023
32023
Private Q-Learning with Functional Noise in Continuous Spaces
B Wang, N Hegde
The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2019
32019
Learning adversarial low-rank markov decision processes with unknown transition and full-information feedback
C Zhao, R Yang, B Wang, X Zhang, S Li
Advances in Neural Information Processing Systems 36, 2024
22024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20