Vivek Veeriah

Zitiert von

	Alle	Seit 2019
Zitate	1011	745
h-index	11	10
i10-index	14	11

160

120

2013201420152016201720182019202020212022202320246 9 6 30 96 104 121 129 136 154 149 55

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Satinder SinghGoogle DeepMind / U. of MichiganBestätigte E-Mail-Adresse bei umich.edu
Junhyuk OhResearch Scientist, DeepMindBestätigte E-Mail-Adresse bei google.com
Richard S. SuttonKeen, Amii, and University of AlbertaBestätigte E-Mail-Adresse bei richsutton.com
Guo-Jun Qi (齐国君), Fellow of IEEE &...Computer Science, University of Central FloridaBestätigte E-Mail-Adresse bei ucf.edu
Zhongwen XuTencentBestätigte E-Mail-Adresse bei tencent.com
David SilverDeepMind, UCLBestätigte E-Mail-Adresse bei google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLBestätigte E-Mail-Adresse bei google.com
Matteo HesselResearch Engineer, Google DeepMindBestätigte E-Mail-Adresse bei google.com
Tom ZahavyStaff Research Scientist, Google DeepMindBestätigte E-Mail-Adresse bei deepmind.com
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Naifan ZhuangPhD student of Department of Computer Science, University of Central FloridaBestätigte E-Mail-Adresse bei knights.ucf.edu
Janarthanan RajendranAssistant Professor, Faculty of Computer Science, Dalhousie UniversityBestätigte E-Mail-Adresse bei umich.edu
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)Bestätigte E-Mail-Adresse bei ualberta.ca
Iurii KemaevDeepMindBestätigte E-Mail-Adresse bei deepmind.com
Alex KearneyPhD Candidate, University of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
Jaden TravnikUniversity of Alberta, Sony AIBestätigte E-Mail-Adresse bei ualberta.ca
Shangtong ZhangUniversity of VirginiaBestätigte E-Mail-Adresse bei virginia.edu
Zeyu ZhengDeepMindBestätigte E-Mail-Adresse bei deepmind.com
Ted MoskovitzGatsby Unit, UCLBestätigte E-Mail-Adresse bei gatsby.ucl.ac.uk
Sebastian FlennerhagResearch Scientist at DeepMindBestätigte E-Mail-Adresse bei google.com

Folgen

Vivek Veeriah

Google DeepMind

Bestätigte E-Mail-Adresse bei google.com

Reinforcement learning MCTS Artificial Intelligence Language Models Planning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Differential recurrent neural networks for action recognition V Veeriah, N Zhuang, GJ Qi Proceedings of the IEEE international conference on computer vision, 4041-4049, 2015	586	2015
Discovery of useful questions as auxiliary tasks V Veeriah, M Hessel, Z Xu, J Rajendran, RL Lewis, J Oh, HP van Hasselt, ... Advances in Neural Information Processing Systems 32, 2019	94	2019
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	78	2020
Many-goals reinforcement learning V Veeriah, J Oh, S Singh arXiv preprint arXiv:1806.09605, 2018	55	2018
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	34	2021
Face valuing: Training user interfaces with facial expressions and reinforcement learning V Veeriah, PM Pilarski, RS Sutton arXiv preprint arXiv:1606.02807, 2016	26	2016
Robust hand gesture recognition algorithm for simple mouse control V Veeriah, PL Swaminathan International Journal of Computer and Communication Engineering 2 (2), 219, 2013	26	2013
Deep Learning Architecture with Dynamically Programmed Layers for Brain Connectome Prediction V Veeriah J, R Durvasula, GJ Qi ACM KDD 2015, 2015	22	2015
Tidbd: Adapting temporal-difference step-sizes through stochastic meta-descent A Kearney, V Veeriah, JB Travnik, RS Sutton, PM Pilarski arXiv preprint arXiv:1804.03334, 2018	16	2018
Reload: Reinforcement learning with optimistic ascent-descent for last-iterate convergence in constrained mdps T Moskovitz, B O’Donoghue, V Veeriah, S Flennerhag, S Singh, T Zahavy International Conference on Machine Learning, 25303-25336, 2023	11	2023
Learning feature relevance through step size adaptation in temporal-difference learning A Kearney, V Veeriah, J Travnik, PM Pilarski, RS Sutton arXiv preprint arXiv:1903.03252, 2019	11	2019
Diversifying ai: Towards creative chess with alphazero T Zahavy, V Veeriah, S Hou, K Waugh, M Lai, E Leurent, N Tomasev, ... arXiv preprint arXiv:2308.09175, 2023	10	2023
How Should an Agent Practice? J Rajendran, R Lewis, V Veeriah, H Lee, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5454-5461, 2020	10	2020
Forward actor-critic for nonlinear function approximation in reinforcement learning V Veeriah, H van Seijen, RS Sutton Proceedings of the 16th Conference on Autonomous Agents and MultiAgent …, 2017	10	2017
Crossprop: Learning representations by stochastic meta-gradient descent in neural networks V Veeriah, S Zhang, RS Sutton Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017	9	2017
Learning state representations from random deep action-conditional predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	7	2021
Grasp: Gradient-based affordance selection for planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	3	2022
Learning options for action selection with meta-gradients in multi-task reinforcement learning VVJ Veeraiah, TBZ Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, ... US Patent App. 17/918,365, 2023	1	2023
Discovery in Reinforcement Learning V Veeriah	1	2022
Learning representations by stochastic meta-gradient descent in neural networks V Veeriah, S Zhang, RS Sutton arXiv preprint arXiv:1612.02879, 2016	1	2016

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren