Folgen
Shie Mannor
Shie Mannor
Professor of Electrical Engineering @ Technion & Researcher @ Nvidia Research
Bestätigte E-Mail-Adresse bei technion.ac.il - Startseite
Titel
Zitiert von
Zitiert von
Jahr
A Tutorial on the Cross-Entropy Method
B DE, P KROESE, S MANNOR
Annals of Operations Research 134 (1), 19-67, 2005
3257*2005
The kernel recursive least-squares algorithm
Y Engel, S Mannor, R Meir
IEEE Transactions on signal processing 52 (8), 2275-2285, 2004
12332004
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems.
E Even-Dar, S Mannor, Y Mansour, S Mahadevan
Journal of machine learning research 7 (6), 2006
7332006
Robustness and Regularization of Support Vector Machines.
H Xu, C Caramanis, S Mannor
Journal of machine learning research 10 (7), 2009
5822009
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
5362015
Reward constrained policy optimization
C Tessler, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1805.11074, 2018
5192018
PAC bounds for multi-armed bandit and Markov decision processes
E Even-Dar, S Mannor, Y Mansour
Computational Learning Theory: 15th Annual Conference on Computational …, 2002
5172002
Reinforcement learning with Gaussian processes
Y Engel, S Mannor, R Meir
ICML, 201-208, 2005
5012005
Robustness and generalization
H Xu, S Mannor
Machine learning 86, 391-423, 2012
4932012
The sample complexity of exploration in the multi-armed bandit problem
S Mannor, JN Tsitsiklis
Journal of Machine Learning Research 5 (Jun), 623-648, 2004
4772004
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
4352017
Robust regression and lasso
H Xu, C Caramanis, S Mannor
Advances in neural information processing systems 21, 2008
3682008
Q-cut—dynamic discovery of sub-goals in reinforcement learning
I Menache, S Mannor, N Shimkin
Machine Learning: ECML 2002: 13th European Conference on Machine Learning …, 2002
3682002
Policy gradients with variance related risk criteria
A Tamar, D Di Castro, S Mannor
Proceedings of the twenty-ninth international conference on machine learning …, 2012
3652012
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in neural information processing systems 28, 2015
3582015
The cross entropy method for classification
S Mannor, D Peleg, R Rubinstein
Proceedings of the 22nd international conference on Machine learning, 561-568, 2005
3572005
Dynamic abstraction in reinforcement learning via clustering
S Mannor, I Menache, A Hoze, U Klein
Proceedings of the twenty-first international conference on Machine learning, 71, 2004
3302004
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International conference on machine learning, 1899-1908, 2016
3282016
Percentile optimization for Markov decision processes with parameter uncertainty
E Delage, S Mannor
Operations research 58 (1), 203-213, 2010
328*2010
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
Y Engel, S Mannor, R Meir
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
3032003
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20