Zeyu Zheng

Zitiert von

	Alle	Seit 2019
Zitate	1361	1305
h-index	7	7
i10-index	7	7

520

260

130

390

2017201820192020202120222023202412 43 93 137 193 170 205 504

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Satinder SinghGoogle DeepMind / U. of MichiganBestätigte E-Mail-Adresse bei umich.edu
Junhyuk OhResearch Scientist, DeepMindBestätigte E-Mail-Adresse bei google.com
Eric XingPresident at Mohamed bin Zayed University of AI, Professor of Computer Science, Carnegie Mellon UBestätigte E-Mail-Adresse bei cs.cmu.edu
Hao ZhangUC San DiegoBestätigte E-Mail-Adresse bei ucsd.edu
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLBestätigte E-Mail-Adresse bei google.com
Will DabneyDeepMindBestätigte E-Mail-Adresse bei google.com
Razvan PascanuGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
Wenfei FanProfessor of Web Data Management, University of EdinburghBestätigte E-Mail-Adresse bei inf.ed.ac.uk
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Zhongwen XuTencentBestätigte E-Mail-Adresse bei tencent.com
David SilverDeepMind, UCLBestätigte E-Mail-Adresse bei google.com
Matteo HesselResearch Engineer, Google DeepMindBestätigte E-Mail-Adresse bei google.com
Clare LyleGoogle DeepMindBestätigte E-Mail-Adresse bei deepmind.com
Risto VuorioUniversity of OxfordBestätigte E-Mail-Adresse bei cs.ox.ac.uk
Haozhu WangAmazonBestätigte E-Mail-Adresse bei amazon.com
Chengang JiPhD, University of Michigan-Ann ArborBestätigte E-Mail-Adresse bei umich.edu
L. Jay GuoProfessor of Electrical Engineering and Computer Science, The University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Evgenii NikishinPhD student, Mila, University of MontrealBestätigte E-Mail-Adresse bei umontreal.ca
Vivek VeeriahGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
Bilal PiotGoogle DeepmindBestätigte E-Mail-Adresse bei google.com

Folgen

Zeyu Zheng

DeepMind

Bestätigte E-Mail-Adresse bei deepmind.com - Startseite

artificial intelligence machine learning reinforcement learning deep learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	457	2023
Poseidon: An efficient communication architecture for distributed deep learning on {GPU} clusters H Zhang, Z Zheng, S Xu, W Dai, Q Ho, X Liang, Z Hu, J Wei, P Xie, ... 2017 USENIX Annual Technical Conference (USENIX ATC 17), 181-193, 2017	397	2017
On learning intrinsic rewards for policy gradient methods Z Zheng, J Oh, S Singh Advances in Neural Information Processing Systems, 4644-4654, 2018	193	2018
Parallelizing sequential graph computations W Fan, J Xu, Y Wu, W Yu, J Jiang, Z Zheng, B Zhang, Y Cao, C Tian Proceedings of the 2017 ACM International Conference on Management of Data …, 2017	116	2017
What Can Learned Intrinsic Rewards Capture? Z Zheng, J Oh, M Hessel, Z Xu, M Kroiss, H Van Hasselt, D Silver, S Singh International Conference on Machine Learning, 11436-11446, 2020	86	2020
Automated multi-layer optical design via deep reinforcement learning H Wang, Z Zheng, C Ji, LJ Guo Machine Learning: Science and Technology 2 (2), 025013, 2021	55	2021
Understanding plasticity in neural networks C Lyle, Z Zheng, E Nikishin, BA Pires, R Pascanu, W Dabney International Conference on Machine Learning, 23190-23211, 2023	33	2023
Adaptive Pairwise Weights for Temporal Credit Assignment Z Zheng, R Vuorio, R Lewis, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022	7*	2022
Learning State Representations from Random Deep Action-conditional Predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	7	2021
Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations N Vadori, L Ardon, S Ganesh, T Spooner, S Amrouni, J Vann, M Xu, ... Mathematical Finance 34 (2), 262-347, 2024	4	2024
GrASP: Gradient-Based Affordance Selection for Planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	3	2022
Generalized Preference Optimization: A Unified Approach to Offline Alignment Y Tang, ZD Guo, Z Zheng, D Calandriello, R Munos, M Rowland, ... arXiv preprint arXiv:2402.05749, 2024	2	2024
Disentangling the Causes of Plasticity Loss in Neural Networks C Lyle, Z Zheng, K Khetarpal, H van Hasselt, R Pascanu, J Martens, ... arXiv preprint arXiv:2402.18762, 2024	1	2024
Human Alignment of Large Language Models through Online Preference Optimisation D Calandriello, D Guo, R Munos, M Rowland, Y Tang, BA Pires, ... arXiv preprint arXiv:2403.08635, 2024		2024
Towards Perpetually Trainable Neural Networks C Lyle, Z Zheng, K Khetarpal, R Pascanu, J Martens, H van Hasselt, ...		2023
Advances in Deep Reinforcement Learning: Intrinsic Rewards, Temporal Credit Assignment, State Representations, and Value-equivalent Models Z Zheng		2022
Reinforcement learning using meta-learned intrinsic rewards Z Zheng, J Oh, SS Baveja US Patent App. 17/033,410, 2021		2021
Supplementary Material: On Learning Intrinsic Rewards for Policy Gradient Methods Z Zheng, J Oh, S Singh

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–18

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren