Zhuoran Yang

Zitiert von

	Alle	Seit 2020
Zitate	10923	10571
h-index	48	47
i10-index	94	93

3200

1600

800

2400

20172018201920202021202220232024202529 72 228 690 1482 1994 2662 3175 555

Öffentlicher Zugriff

Alle anzeigen

45 Artikel

1 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Folgen

Zhuoran Yang

Yale University

Bestätigte E-Mail-Adresse bei yale.edu - Startseite

machine learning optimization reinforcement learning statistics


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Multi-agent reinforcement learning: A selective overview of theories and algorithms K Zhang, Z Yang, T Başar Handbook of reinforcement learning and control, 321-384, 2021	1845	2021
A Theoretical Analysis of Deep Q-Learning	905*	2020
Provably efficient reinforcement learning with linear function approximation C Jin, Z Yang, Z Wang, MI Jordan Mathematics of Operations Research 48 (3), 1496-1521, 2023	902	2023
Fully decentralized multi-agent reinforcement learning with networked agents K Zhang, Z Yang, H Liu, T Zhang, T Basar International conference on machine learning, 5872-5881, 2018	760	2018
Is pessimism provably efficient for offline rl? Y Jin, Z Yang, Z Wang International Conference on Machine Learning, 5084-5096, 2021	477	2021
Provably efficient exploration in policy optimization Q Cai, Z Yang, C Jin, Z Wang International Conference on Machine Learning, 1283-1294, 2020	330	2020
A two-timescale stochastic algorithm framework for bilevel optimization: Complexity analysis and application to actor-critic M Hong, HT Wai, Z Wang, Z Yang SIAM Journal on Optimization 33 (1), 147-180, 2023	327	2023
Neural policy gradient methods: Global optimality and rates of convergence L Wang, Q Cai, Z Yang, Z Wang arXiv preprint arXiv:1909.01150, 2019	274	2019
Neural trust region/proximal policy optimization attains globally optimal policy B Liu, Q Cai, Z Yang, Z Wang Advances in neural information processing systems 32, 2019	241	2019
Multi-agent reinforcement learning via double averaging primal-dual optimization HT Wai, Z Yang, Z Wang, M Hong Advances in neural information processing systems 31, 2018	209	2018
Provably efficient safe exploration via primal-dual policy optimization D Ding, X Wei, Z Yang, Z Wang, M Jovanovic International conference on artificial intelligence and statistics, 3304-3312, 2021	191	2021
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium Q Xie, Y Chen, Z Wang, Z Yang Mathematics of Operations Research 48 (1), 433-462, 2023	165	2023
Provably global convergence of actor-critic: A case for linear quadratic regulator with ergodic cost Z Yang, Y Chen, M Hong, Z Wang Advances in neural information processing systems 32, 2019	162	2019
Neural temporal difference and q learning provably converge to global optima Q Cai, Z Yang, JD Lee, Z Wang Mathematics of Operations Research 49 (1), 619-651, 2024	160*	2024
Policy optimization provably converges to Nash equilibria in zero-sum linear quadratic games K Zhang, Z Yang, T Basar Advances in Neural Information Processing Systems 32, 2019	152	2019
A near-optimal algorithm for stochastic bilevel optimization via double-momentum P Khanduri, S Zeng, M Hong, HT Wai, Z Wang, Z Yang Advances in neural information processing systems 34, 30271-30283, 2021	146	2021
On function approximation in reinforcement learning: Optimism in the face of large state spaces Z Yang, C Jin, Z Wang, M Wang, MI Jordan arXiv preprint arXiv:2011.04622, 2020	140*	2020
Convergent policy optimization for safe reinforcement learning M Yu, Z Yang, M Kolar, Z Wang Advances in Neural Information Processing Systems 32, 2019	137	2019
Networked multi-agent reinforcement learning in continuous spaces K Zhang, Z Yang, T Basar 2018 IEEE conference on decision and control (CDC), 2771-2776, 2018	122	2018
Pontryagin differentiable programming: An end-to-end learning and control framework W Jin, Z Wang, Z Yang, S Mou Advances in Neural Information Processing Systems 33, 7979-7992, 2020	100	2020

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von