Tabish Rashid

Zitiert von

	Alle	Seit 2019
Zitate	4225	4172
h-index	9	9
i10-index	9	9

1500

750

375

1125

201820192020202120222023202420 134 310 688 1047 1473 517

Öffentlicher Zugriff

Alle anzeigen

4 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoBestätigte E-Mail-Adresse bei cs.ox.ac.uk
Mikayel SamvelyanMeta AI & UCLBestätigte E-Mail-Adresse bei meta.com
Gregory FarquharDeepMindBestätigte E-Mail-Adresse bei google.com
Christian Schroeder de WittUniversity of OxfordBestätigte E-Mail-Adresse bei robots.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordBestätigte E-Mail-Adresse bei eng.ox.ac.uk
Philip TorrProfessor, University of OxfordBestätigte E-Mail-Adresse bei eng.ox.ac.uk
Chia-Man HungUniversity of OxfordBestätigte E-Mail-Adresse bei robots.ox.ac.uk
Nantas NardelliStealthBestätigte E-Mail-Adresse bei arbitrarygravitas.com
Jason R.C. NurseReader in Cyber Security, University of KentBestätigte E-Mail-Adresse bei kent.ac.uk
Ioannis AgrafiotisComputer Science Department, University of OxfordBestätigte E-Mail-Adresse bei cs.ox.ac.uk

Folgen

Tabish Rashid

Microsoft Research

Bestätigte E-Mail-Adresse bei microsoft.com


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21(178):1−51, 2020, 2020	2115	2020
The StarCraft Multi-Agent Challenge M Samvelyan, T Rashid, CS de Witt, G Farquhar, N Nardelli, TGJ Rudner, ... AAMAS 2019, 2019	904	2019
Maven: Multi-agent variational exploration A Mahajan, T Rashid, M Samvelyan, S Whiteson Advances in Neural Information Processing Systems, 7613-7624, 2019	369	2019
Weighted QMIX: Expanding Monotonic Value Function Factorisation T Rashid, G Farquhar, B Peng, S Whiteson Advances in Neural Information Processing Systems 33, 2020, 2020	314*	2020
A new take on detecting insider threats: exploring the use of hidden markov models T Rashid, I Agrafiotis, JRC Nurse Proceedings of the 8th ACM CCS International Workshop on Managing Insider …, 2016	181	2016
Facmac: Factored multi-agent centralised policy gradients B Peng, T Rashid, C Schroeder de Witt, PA Kamienny, P Torr, W Böhmer, ... Advances in Neural Information Processing Systems 34, 12208-12221, 2021	168	2021
Imitating human behaviour with diffusion models T Pearce, T Rashid, A Kanervisto, D Bignell, M Sun, R Georgescu, ... arXiv preprint arXiv:2301.10677, 2023	89	2023
Optimistic Exploration even with a Pessimistic Initialisation T Rashid, B Peng, W Boehmer, S Whiteson International Conference on Learning Representations, 2019	46	2019
Exploration with unreliable intrinsic reward in multi-agent reinforcement learning W Böhmer, T Rashid, S Whiteson arXiv preprint arXiv:1906.02138, 2019	29	2019
Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning L Pan, T Rashid, B Peng, L Huang, S Whiteson arXiv preprint arXiv:2103.11883, 2021	5	2021
Estimating α-Rank by Maximizing Information Gain T Rashid, C Zhang, K Ciosek Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5673-5681, 2021	5	2021
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning T Rashid, M Samvelyan, CS de Witt, G Farquhar, J Foerster, S Whiteson Proceedings of the 35th International Conference on Machine Learning, 2018		2018

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–12

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren