Folgen
Sridhar Mahadevan
Sridhar Mahadevan
Director, Data Science Lab, Adobe Research & Professor, University of Massachusetts, Amherst
Bestätigte E-Mail-Adresse bei cs.umass.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Recent advances in hierarchical reinforcement learning
AG Barto, S Mahadevan
Discrete event dynamic systems 13, 341-379, 2003
16922003
Automatic programming of behavior-based robots using reinforcement learning
S Mahadevan, J Connell
Artificial intelligence 55 (2-3), 311-365, 1992
9451992
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems.
E Even-Dar, S Mannor, Y Mansour, S Mahadevan
Journal of machine learning research 7 (6), 2006
7212006
Average reward reinforcement learning: Foundations, algorithms, and empirical results
S Mahadevan
Machine learning 22 (1), 159-195, 1996
6141996
Heterogeneous domain adaptation using manifold alignment
C Wang, S Mahadevan
IJCAI proceedings-international joint conference on artificial intelligence …, 2011
5192011
Generative multi-adversarial networks
I Durugkar, I Gemp, S Mahadevan
arXiv preprint arXiv:1611.01673, 2016
4462016
LEAP: A learning apprentice for VLSI design
TM Mitchell, S Mabadevan, LI Steinberg
Machine learning, 271-289, 1990
4411990
Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes.
S Mahadevan, M Maggioni
Journal of Machine Learning Research 8 (10), 2007
3832007
Manifold alignment using procrustes analysis
C Wang, S Mahadevan
Proceedings of the 25th international conference on Machine learning, 1120-1127, 2008
3442008
Solving semi-Markov decision problems using average reward reinforcement learning
TK Das, A Gosavi, S Mahadevan, N Marchalleck
Management Science 45 (4), 560-574, 1999
3051999
Robot learning
JH Connell, S Mahadevan
Springer Science & Business Media, 2012
2882012
Manifold alignment without correspondence.
C Wang, S Mahadevan
IJCAI 2, 3, 2009
2342009
Self-improving factory simulation using continuous-time average-reward reinforcement learning
S Mahadevan, N Marchalleck, TK Das, A Gosavi
MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE-, 202-210, 1997
1971997
Hierarchical multi-agent reinforcement learning
R Makar, S Mahadevan, M Ghavamzadeh
Proceedings of the fifth international conference on Autonomous agents, 246-253, 2001
1942001
Proto-value functions: Developmental reinforcement learning
S Mahadevan
Proceedings of the 22nd international conference on Machine learning, 553-560, 2005
1742005
Finite-sample analysis of proximal gradient td algorithms
B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik
arXiv preprint arXiv:2006.14364, 2020
1722020
Hierarchical multi-agent reinforcement learning
M Ghavamzadeh, S Mahadevan, R Makar
Autonomous Agents and Multi-Agent Systems 13, 197-229, 2006
1722006
Repairing disengagement with non-invasive interventions
I Arroyo, K Ferguson, J Johns, T Dragon, H Meheranian, D Fisher, A Barto, ...
AIED 2007, 195-202, 2007
1652007
A study of machine learning regression methods for major elemental analysis of rocks using laser-induced breakdown spectroscopy
TF Boucher, MV Ozanne, ML Carmosino, MD Dyar, S Mahadevan, ...
Spectrochimica Acta Part B: Atomic Spectroscopy 107, 1-10, 2015
1562015
Gaze control for face learning and recognition by humans and machines
T Shipley, P Kellman
From fragments to objects: Segmentation and grouping in vision 463, 2001
1412001
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20