Christoph Dann

Zitiert von

	Alle	Seit 2019
Zitate	2384	2086
h-index	21	20
i10-index	32	30

480

240

120

360

2013201420152016201720182019202020212022202320248 18 13 52 81 122 171 242 406 422 469 375

Öffentlicher Zugriff

Alle anzeigen

10 Artikel

1 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityBestätigte E-Mail-Adresse bei cs.stanford.edu
Mehryar MohriHead, Learning Theory Team, Google Research; Professor, Courant Institute of Mathematical Sciences.Bestätigte E-Mail-Adresse bei google.com
Claudio GentileGoogle Research, New York, USABestätigte E-Mail-Adresse bei google.com
Yishay MansourTel Aviv UniversityBestätigte E-Mail-Adresse bei tauex.tau.ac.il
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIBestätigte E-Mail-Adresse bei ias.tu-darmstadt.de
Gerhard NeumannProfessor, Karlsruhe Institute of Technology (KIT)Bestätigte E-Mail-Adresse bei robot-learning.de
Sebastian NowozinGoogle DeepMindBestätigte E-Mail-Adresse bei deepmind.com
Lihong Li (李力鸿)AmazonBestätigte E-Mail-Adresse bei amazon.com
Philip ThomasUniversity of Massachusetts AmherstBestätigte E-Mail-Adresse bei cs.umass.edu
Peter GehlerZalandoBestätigte E-Mail-Adresse bei zalando.de

Folgen

Christoph Dann

Research Scientist, Google

Bestätigte E-Mail-Adresse bei google.com - Startseite

Reinforcement Learning Machine Learning. Sequential Decision Making under Uncertainty


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Unifying PAC and regret: Uniform PAC bounds for episodic reinforcement learning C Dann, T Lattimore, E Brunskill Advances in Neural Information Processing Systems, 5717-5727, 2017	321	2017
Policy evaluation with temporal differences: a survey and comparison. C Dann, G Neumann, J Peters Journal of Machine Learning Research 15 (1), 809-883, 2014	286	2014
Sample complexity of episodic fixed-horizon reinforcement learning C Dann, E Brunskill Advances in Neural Information Processing Systems, 2818-2826, 2015	275	2015
Scaling up behavioral science interventions in online education RF Kizilcec, J Reich, M Yeomans, C Dann, E Brunskill, G Lopez, S Turkay, ... Proceedings of the National Academy of Sciences, 2020	183	2020
Policy certificates: Towards accountable reinforcement learning C Dann, L Li, W Wei, E Brunskill International Conference on Machine Learning, 1507-1516, 2019	165	2019
On Oracle-Efficient PAC RL with Rich Observations C Dann, N Jiang, A Krishnamurthy, A Agarwal, J Langford, RE Schapire Advances in Neural Information Processing Systems, 1429-1439, 2018	125	2018
Thoughts on massively scalable Gaussian processes AG Wilson, C Dann, H Nickisch arXiv preprint arXiv:1511.01870, 2015	116	2015
RLPy: a value-function-based reinforcement learning framework for education and research. A Geramifard, C Dann, RH Klein, W Dabney, JP How Journal of Machine Learning Research 16, 1573-1578, 2015	110*	2015
Being optimistic to be conservative: Quickly learning a cvar policy R Keramati, C Dann, A Tamkin, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4436-4443, 2020	86	2020
The human kernel AG Wilson, C Dann, C Lucas, EP Xing Advances in Neural Information Processing Systems, 2854-2862, 2015	83	2015
Automated matching of pipeline corrosion features from in-line inspection data MR Dann, C Dann Reliability Engineering & System Safety 162, 40-50, 2017	51	2017
A Model Selection Approach for Corruption Robust Reinforcement Learning CY Wei, C Dann, J Zimmert International Conference on Algorithmic Learning Theory, 2022	49	2022
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning C Dann, M Mohri, T Zhang, J Zimmert Advances in Neural Information Processing Systems 34, 2021	47*	2021
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL A Pacchiano, C Dann, C Gentile, P Bartlett arXiv preprint arXiv:2012.13045, 2020	47	2020
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan International Conference on Machine Learning, 4666-4689, 2022	45	2022
Bayesian time-of-flight for realtime shape, illumination and albedo A Adam, C Dann, O Yair, S Mazor, S Nowozin IEEE transactions on pattern analysis and machine intelligence 39 (5), 851-864, 2017	43	2017
Dynamic balancing for model selection in bandits and rl A Cutkosky, C Dann, A Das, C Gentile, A Pacchiano, M Purohit International Conference on Machine Learning, 2276-2285, 2021	37	2021
Beyond value-function gaps: Improved instance-dependent regret bounds for episodic reinforcement learning C Dann, TV Marinov, M Mohri, J Zimmert Advances in Neural Information Processing Systems 34, 2021	35	2021
Distributionally-aware exploration for cvar bandits A Tamkin, R Keramati, C Dann, E Brunskill NeurIPS 2019 Workshop on Safety and Robustness on Decision Making, 2019	35	2019
A minimaximalist approach to reinforcement learning from human feedback G Swamy, C Dann, R Kidambi, ZS Wu, A Agarwal arXiv preprint arXiv:2401.04056, 2024	29	2024

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren