Folgen
Shaofeng Zou
Titel
Zitiert von
Zitiert von
Jahr
Finite-sample analysis for sarsa with linear function approximation
S Zou, T Xu, Y Liang
NeurIPS 2019, 2019
2072019
Tightening mutual information-based bounds on generalization error
Y Bu, S Zou, VV Veeravalli
IEEE Journal on Selected Areas in Information Theory 1 (1), 121-130, 2020
2012020
Sequential (quickest) change detection: Classical results and new directions
L Xie, S Zou, Y Xie, VV Veeravalli
IEEE Journal on Selected Areas in Information Theory 2 (2), 494-514, 2021
1352021
Estimation of KL divergence: Optimal minimax rate
Y Bu, S Zou, Y Liang, VV Veeravalli
IEEE Transactions on Information Theory 64 (4), 2648-2674, 2018
112*2018
Online robust reinforcement learning with model uncertainty
Y Wang, S Zou
Advances in Neural Information Processing Systems 34, 7193-7206, 2021
1062021
Two time-scale off-policy TD learning: Non-asymptotic analysis over Markovian samples
T Xu, S Zou, Y Liang
Advances in Neural Information Processing Systems, 10633-10643, 2019
892019
Policy gradient method for robust reinforcement learning
Y Wang, S Zou
International conference on machine learning, 23484-23526, 2022
722022
Nonparametric Detection of Anomalous Data Streams
S Zou, Y Liang, HV Poor, X Shi
IEEE Transactions on Signal Processing 65 (21), 5785 - 5797, 2017
61*2017
Robust multi-agent reinforcement learning with state uncertainty
S He, S Han, S Su, S Han, S Zou, F Miao
TMLR, 2023
442023
An information theoretic approach to secret sharing
S Zou, Y Liang, L Lai, S Shamai
IEEE Transactions on Information Theory 61 (6), 3121-3136, 2015
442015
Quickest change detection under transient dynamics: Theory and asymptotic analysis
S Zou, G Fellouris, VV Veeravalli
IEEE Transactions on Information Theory 65 (3), 1397--1412, 2019
42*2019
Quickest detection of dynamic events in networks
S Zou, VV Veeravalli, J Li, D Towsley
IEEE Transactions on Information Theory, 2019
392019
Faster algorithm and sharper analysis for constrained Markov decision process
T Li, Z Guan, S Zou, T Xu, Y Liang, G Lan
Operations Research Letters 54, 107107, 2024
352024
Signal processing and machine learning for biomedical big data
E Sejdic, TH Falk
CRC press, 2018
352018
A robust and constrained multi-agent reinforcement learning framework for electric vehicle amod systems
S He, Y Wang, S Han, S Zou, F Miao
IROS2023, 2022
302022
Sample and communication-efficient decentralized actor-critic algorithms with finite-time analysis
Z Chen, Y Zhou, RR Chen, S Zou
International Conference on Machine Learning, 3794-3834, 2022
302022
What is the solution for state-adversarial multi-agent reinforcement learning?
S Han, S Su, S He, S Han, H Yang, S Zou, F Miao
arXiv preprint arXiv:2212.02705, 2022
292022
Information-Theoretic Understanding of Population Risk Improvement with Model Compression.
Y Bu, W Gao, S Zou, VV Veeravalli
AAAI, 3300-3307, 2020
28*2020
Sequential algorithms for moving anomaly detection in networks
G Rovatsos, S Zou, VV Veeravalli
Sequential Analysis, 2020
28*2020
Finite-sample analysis of Greedy-GQ with linear function approximation under Markovian noise
Y Wang, S Zou
Conference on Uncertainty in Artificial Intelligence, 11-20, 2020
272020
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20