Harm van Seijen

引用先

	すべて	2019 年以来
引用	1695	1380
h 指標	17	16
i10 指標	23	20

340

170

255

2011201220132014201520162017201820192020202120222023202411 7 15 16 25 29 63 141 156 196 253 313 334 127

オープンアクセス

すべて表示

5 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Richard S. SuttonKeen, Amii, and University of Alberta確認したメールアドレス: richsutton.com
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo確認したメールアドレス: cs.ox.ac.uk
Marco WieringInstitute of Artificial Intelligence and Cognitive Engineering, University of Groningen確認したメールアドレス: rug.nl
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCL確認したメールアドレス: google.com
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)確認したメールアドレス: ualberta.ca
A. Rupam MahmoodUniversity of Alberta, Amii確認したメールアドレス: ualberta.ca
Marlos C. MachadoUniversity of Alberta確認したメールアドレス: ualberta.ca

フォロー

Harm van Seijen

Sony AI

確認したメールアドレス: sony.com

reinforcement learning machine learning representation learning


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Reducing network agnostophobia AR Dhamija, M Günther, T Boult Advances in Neural Information Processing Systems 31, 2018	344	2018
A theoretical and empirical analysis of Expected Sarsa H Van Seijen, H Van Hasselt, S Whiteson, M Wiering 2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009	268	2009
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	258	2017
True online TD (lambda) H Seijen, R Sutton International Conference on Machine Learning, 692-700, 2014	126	2014
True online temporal-difference learning H Van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton Journal of Machine Learning Research 17 (145), 1-40, 2016	110	2016
Systematic generalisation with group invariant predictions F Ahmed, Y Bengio, H Van Seijen, A Courville International Conference on Learning Representations, 2020	98	2020
A Deeper Look at Planning as Learning from Replay H van Seijen, RS Sutton International Conference on Machine Learning, 2015	76	2015
Planning by prioritized sweeping with small backups H Van Seijen, R Sutton International Conference on Machine Learning, 361-369, 2013	59*	2013
Modular lifelong reinforcement learning via neural composition JA Mendez, H van Seijen, E Eaton arXiv preprint arXiv:2207.00429, 2022	40	2022
Hybrid reward architecture for reinforcement learning HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff US Patent 10,977,551, 2021	38	2021
Using a logarithmic mapping to enable lower discount factors in reinforcement learning H Van Seijen, M Fatemi, A Tavakoli Advances in Neural Information Processing Systems 32, 2019	29	2019
Exploiting Best-Match Equations for Efficient Reinforcement Learning. H van Seijen, S Whiteson, H van Hasselt, M Wiering Journal of Machine Learning Research 12 (6), 2011	27	2011
On value function representation of long horizon problems L Lehnert, R Laroche, H van Seijen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	26	2018
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	25	2017
Effective multi-step temporal-difference learning for non-linear function approximation H van Seijen arXiv preprint arXiv:1608.05151, 2016	23	2016
Dead-ends and secure exploration in reinforcement learning M Fatemi, S Sharma, H Van Seijen, SE Kahou International Conference on Machine Learning, 1873-1881, 2019	21	2019
Efficient abstraction selection in reinforcement learning H van Seijen, S Whiteson, L Kester Computational Intelligence 30 (4), 657-699, 2014	18	2014
Learning invariances for policy generalization R Tachet, P Bachman, H van Seijen arXiv preprint arXiv:1809.02591, 2018	17	2018
Separation of concerns in reinforcement learning H van Seijen, M Fatemi, J Romoff, R Laroche arXiv preprint arXiv:1612.05159, 2016	13	2016
Towards evaluating adaptivity of model-based reinforcement learning methods Y Wan, A Rahimi-Kalahroudi, J Rajendran, I Momennejad, S Chandar, ... International Conference on Machine Learning, 22536-22561, 2022	11	2022

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者