Doina Precup

Cited by

	All	Since 2019
Citations	34442	25379
h-index	65	57
i10-index	237	189

6000

3000

1500

4500

20022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024117 117 178 219 245 308 331 317 323 378 407 484 592 606 881 1085 1893 2607 3420 4309 5269 5962 3781

Public access

View all

58 articles

8 articles

available

not available

Based on funding mandates

Co-authors

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerified email at cs.mcgill.ca
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Prakash PanangadenProfessor of Computer Science, McGill UniversityVerified email at cs.mcgill.ca
Tal ArbelProfessor of Electrical & Computer Engineering, McGill UniversityVerified email at cim.mcgill.ca
Riashat IslamResearch ScientistVerified email at mail.mcgill.ca
Andre BarretoResearch Scientist, Google DeepMindVerified email at google.com
Emmanuel BengioMcGill UniversityVerified email at mail.mcgill.ca
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
David SilverDeepMind, UCLVerified email at google.com
Jean HarbOpenAIVerified email at openai.com
Guilherme Sant AnnaProfessor (Full) of Pediatrics, McGill UniversityVerified email at mcgill.ca
Philip WarrickPerigen Inc.Verified email at perigen.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Norm FernsVerified email at normferns.com
Jordan FrankSoftware Engineer, FacebookVerified email at cs.mcgill.ca
Amir-massoud FarahmandUniversity of TorontoVerified email at cs.toronto.edu
Pablo Samuel CastroGoogleVerified email at google.com
Hamid MaeiNetflixVerified email at netflix.com
Borja BalleDeepMindVerified email at google.com

Doina Precup

DeepMind and McGill University

Verified email at cs.mcgill.ca

Artificial Intelligence machine learning reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The multimodal brain tumor image segmentation benchmark (BRATS) BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ... IEEE transactions on medical imaging 34 (10), 1993-2024, 2014	5784	2014
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4432	1999
Deep reinforcement learning that matters P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2346	2018
Off-policy deep reinforcement learning without exploration S Fujimoto, D Meger, D Precup International conference on machine learning, 2052-2062, 2019	1520	2019
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1243	2017
Eligibility traces for off-policy policy evaluation D Precup Computer Science Department Faculty Publication Series, 80, 2000	954	2000
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	716	2009
Learning with pseudo-ensembles P Bachman, O Alsharif, D Precup Advances in neural information processing systems 27, 2014	663	2014
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	600	2011
Algorithms for multi-armed bandit problems V Kuleshov, D Precup arXiv preprint arXiv:1402.6028, 2014	556	2014
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	553	2021
Off-policy temporal-difference learning with function approximation D Precup, RS Sutton, S Dasgupta ICML, 417-424, 2001	466	2001
Learning options in reinforcement learning M Stolle, D Precup Abstraction, Reformulation, and Approximation: 5th International Symposium …, 2002	464	2002
Exploring uncertainty measures in deep networks for multiple sclerosis lesion detection and segmentation T Nair, D Precup, DL Arnold, T Arbel Medical image analysis 59, 101557, 2020	460	2020
Temporal abstraction in reinforcement learning D Precup University of Massachusetts Amherst, 2000	396	2000
Metrics for Finite Markov Decision Processes. N Ferns, P Panangaden, D Precup UAI 4, 162-169, 2004	351	2004
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	345	2009
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	339	2015
Reproducibility of benchmarked deep reinforcement learning tasks for continuous control R Islam, P Henderson, M Gomrokchi, D Precup arXiv preprint arXiv:1708.04133, 2017	320	2017
Towards continual reinforcement learning: A review and perspectives K Khetarpal, M Riemer, I Rish, D Precup Journal of Artificial Intelligence Research 75, 1401-1476, 2022	262	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors