A. Rupam Mahmood

Zitiert von

	Alle	Seit 2019
Zitate	1520	1233
h-index	17	15
i10-index	21	19

260

130

195

2013201420152016201720182019202020212022202320247 19 34 58 57 102 124 178 231 231 260 209

Öffentlicher Zugriff

Alle anzeigen

8 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Richard S. SuttonKeen, Amii, and University of AlbertaBestätigte E-Mail-Adresse bei richsutton.com
Gautham VasanAmii, University of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
Martha WhiteUniversity of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
James BergstraPrincipal Engineer, Ocado TechnologyBestätigte E-Mail-Adresse bei ocado.com
Dmytro KorenkevychMeta AIBestätigte E-Mail-Adresse bei meta.com
Qingfeng LanPhD student @ University of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLBestätigte E-Mail-Adresse bei google.com
Patrick M. PilarskiProfessor, University of Alberta, Amii (Alberta Machine Intelligence Institute)Bestätigte E-Mail-Adresse bei ualberta.ca
Shibhansh DoharePhD Student, University of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
Harm van SeijenSony AIBestätigte E-Mail-Adresse bei sony.com
Doina PrecupDeepMind and McGill UniversityBestätigte E-Mail-Adresse bei cs.mcgill.ca
Brent KomerPhD Student, University of WaterlooBestätigte E-Mail-Adresse bei uwaterloo.ca
Marlos C. MachadoUniversity of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
Thomas DegrisDeepMindBestätigte E-Mail-Adresse bei google.com
Fengdi Cheuniversity of albertaBestätigte E-Mail-Adresse bei ualberta.ca
Oliver LimoyoUniversity of Toronto Institute for Aerospace StudiesBestätigte E-Mail-Adresse bei mail.utoronto.ca
Bryan ChanUniversity of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca
Jonathan KellyUniversity of Toronto Institute for Aerospace StudiesBestätigte E-Mail-Adresse bei utias.utoronto.ca
Mohamed ElsayedUniversity of AlbertaBestätigte E-Mail-Adresse bei ualberta.ca

Folgen

A. Rupam Mahmood

University of Alberta, Amii

Bestätigte E-Mail-Adresse bei ualberta.ca - Startseite

Continual learning reinforcement learning robot learning representation learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
An emphatic approach to the problem of off-policy temporal-difference learning RS Sutton, AR Mahmood, M White (JMLR) Journal of Machine Learning Research 17, 2016	293	2016
Benchmarking reinforcement learning algorithms on real-world robots AR Mahmood, D Korenkevych, G Vasan, W Ma, J Bergstra (CoRL) Proceedings of the 2nd Annual Conference on Robot Learning, 2018	197	2018
Weighted importance sampling for off-policy learning with linear function approximation AR Mahmood, H van Hasselt, RS Sutton (NeurIPS) Advances in Neural Information Processing Systems 27, 2014	174	2014
True online temporal-difference learning H van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton (JMLR) Journal of Machine Learning Research 17, 2016	115	2016
Setting up a reinforcement learning task with a real-world robot AR Mahmood, D Korenkevych, BJ Komer, J Bergstra (IROS) 2018 IEEE/RSJ International Conference on Intelligent Robots and …, 2018	89	2018
Tuning-free step-size adaptation AR Mahmood, RS Sutton, T Degris, PM Pilarski (ICASSP) Acoustics, Speech and Signal Processing, 2012 IEEE International …, 2012	83	2012
Maintaining plasticity in deep continual learning S Dohare, JF Hernandez-Garcia, P Rahman, AR Mahmood, RS Sutton arXiv preprint arXiv:2306.13812, 2024	69*	2024
Multi-step off-policy learning without importance sampling ratios AR Mahmood, H Yu, RS Sutton arXiv preprint arXiv:1702.03006, 2017	53	2017
Representation Search through Generate and Test AR Mahmood, RS Sutton Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013	47	2013
Off-policy TD (λ) with a true online equivalence H van Hasselt, AR Mahmood, RS Sutton (UAI) Proceedings of the 30th Conference on Uncertainty in Artificial …, 2014	46	2014
On generalized Bellman equations and temporal-difference learning H Yu, AR Mahmood, RS Sutton (JMLR) The Journal of Machine Learning Research 19 (1), 1864-1912, 2018	40	2018
A new Q (λ) with interim forward view and Monte Carlo equivalence RS Sutton, AR Mahmood, D Precup, M CA, H van Hasselt, U CA (ICML) In International Conference on Machine Learning, 2014	40	2014
Emphatic temporal-difference learning AR Mahmood, H Yu, M White, RS Sutton In European Workshops on Reinforcement Learning, 2015	37	2015
Off-policy learning based on weighted importance sampling with linear computational complexity AR Mahmood, RS Sutton (UAI) Proceedings of the 31st Conference on Uncertainty in Artificial …, 2015	30	2015
Autoregressive policies for continuous control deep reinforcement learning D Korenkevych, AR Mahmood, G Vasan, J Bergstra (IJCAI) Proceedings of the 28th International Joint Conference on Artificial …, 2019	28	2019
Greedification operators for policy optimization: investigating forward and reverse KL divergences A Chan, H Silva, S Lim, T Kozuno, AR Mahmood, M White (JMLR) Journal of Machine Learning Research, 2022	25	2022
Incremental Off-policy Reinforcement Learning Algorithms A Mahmood University of Alberta, 2017	18	2017
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo H Ishfaq, Q Lan, P Xu, AR Mahmood, D Precup, A Anandkumar, ... (ICLR) International Conference on Learning Representations, 2024	12	2024
Asynchronous reinforcement learning for real-time control of physical robots Y Yuan, AR Mahmood (ICRA) In Proceedings of the 2022 International Conference on Robotics and …, 2022	12	2022
Structure Learning of Causal Bayesian Networks: A Survey A Mahmood Department of Computing Science, University of Alberta, Edmonton, Canada …, 2011	11	2011

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren