Doina Precup
Doina Precup
DeepMind and McGill University
Verified email at cs.mcgill.ca
TitleCited byYear
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
22771999
The multimodal brain tumor image segmentation benchmark (BRATS)
BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ...
IEEE transactions on medical imaging 34 (10), 1993-2024, 2014
14262014
Deep reinforcement learning that matters
P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
4402018
Fast gradient-descent methods for temporal-difference learning with linear function approximation
RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ...
Proceedings of the 26th Annual International Conference on Machine Learning …, 2009
4202009
Eligibility traces for off-policy policy evaluation
D Precup
Computer Science Department Faculty Publication Series, 80, 2000
3462000
The option-critic architecture
PL Bacon, J Harb, D Precup
Thirty-First AAAI Conference on Artificial Intelligence, 2017
3402017
Off-policy temporal-difference learning with function approximation
D Precup, RS Sutton, S Dasgupta
ICML, 417-424, 2001
2642001
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
RS Sutton
2592019
Algorithms for multi-armed bandit problems
V Kuleshov, D Precup
arXiv preprint arXiv:1402.6028, 2014
2492014
Learning options in reinforcement learning
M Stolle, D Precup
International Symposium on abstraction, reformulation, and approximation …, 2002
2362002
Temporal abstraction in reinforcement learning.
D Precup
2312001
Convergent temporal-difference learning with arbitrary smooth function approximation
S Bhatnagar, D Precup, D Silver, RS Sutton, HR Maei, C Szepesvári
Advances in neural information processing systems, 1204-1212, 2009
1882009
Automatic basis function construction for approximate dynamic programming and reinforcement learning
PW Keller, S Mannor, D Precup
Proceedings of the 23rd international conference on Machine learning, 449-456, 2006
1762006
Metrics for Finite Markov Decision Processes.
N Ferns, P Panangaden, D Precup
UAI 4, 162-169, 2004
1542004
Theoretical results on reinforcement learning with temporally abstract options
D Precup, RS Sutton, S Singh
European conference on machine learning, 382-393, 1998
1411998
Multi-time models for temporally abstract planning
D Precup, RS Sutton
Advances in neural information processing systems, 1050-1056, 1998
1411998
Activity and gait recognition with time-delay embeddings
J Frank, S Mannor, D Precup
Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010
1382010
Intra-Option Learning about Temporally Abstract Actions.
RS Sutton, D Precup, SP Singh
ICML 98, 556-564, 1998
1361998
Learning with pseudo-ensembles
P Bachman, O Alsharif, D Precup
Advances in neural information processing systems, 3365-3373, 2014
1162014
Sparse distributed memories for on-line value-based reinforcement learning
B Ratitch, D Precup
European Conference on Machine Learning, 347-358, 2004
1152004
The system can't perform the operation now. Try again later.
Articles 1–20