Tabish Rashid

Cited by

	All	Since 2019
Citations	4236	4183
h-index	9	9
i10-index	9	9

1500

750

375

1125

201820192020202120222023202420 134 310 688 1047 1473 528

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Mikayel SamvelyanMeta AI & UCLVerified email at meta.com
Gregory FarquharDeepMindVerified email at google.com
Christian Schroeder de WittUniversity of OxfordVerified email at robots.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordVerified email at eng.ox.ac.uk
Philip TorrProfessor, University of OxfordVerified email at eng.ox.ac.uk
Chia-Man HungUniversity of OxfordVerified email at robots.ox.ac.uk
Nantas NardelliStealthVerified email at arbitrarygravitas.com
Jason R.C. NurseReader in Cyber Security, University of KentVerified email at kent.ac.uk
Ioannis AgrafiotisComputer Science Department, University of OxfordVerified email at cs.ox.ac.uk

Tabish Rashid

Microsoft Research

Verified email at microsoft.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21(178):1−51, 2020, 2020	2122	2020
The StarCraft Multi-Agent Challenge M Samvelyan, T Rashid, CS de Witt, G Farquhar, N Nardelli, TGJ Rudner, ... AAMAS 2019, 2019	906	2019
Maven: Multi-agent variational exploration A Mahajan, T Rashid, M Samvelyan, S Whiteson Advances in Neural Information Processing Systems, 7613-7624, 2019	369	2019
Weighted QMIX: Expanding Monotonic Value Function Factorisation T Rashid, G Farquhar, B Peng, S Whiteson Advances in Neural Information Processing Systems 33, 2020, 2020	314*	2020
A new take on detecting insider threats: exploring the use of hidden markov models T Rashid, I Agrafiotis, JRC Nurse Proceedings of the 8th ACM CCS International Workshop on Managing Insider …, 2016	181	2016
Facmac: Factored multi-agent centralised policy gradients B Peng, T Rashid, C Schroeder de Witt, PA Kamienny, P Torr, W Böhmer, ... Advances in Neural Information Processing Systems 34, 12208-12221, 2021	169	2021
Imitating human behaviour with diffusion models T Pearce, T Rashid, A Kanervisto, D Bignell, M Sun, R Georgescu, ... arXiv preprint arXiv:2301.10677, 2023	90	2023
Optimistic Exploration even with a Pessimistic Initialisation T Rashid, B Peng, W Boehmer, S Whiteson International Conference on Learning Representations, 2019	46	2019
Exploration with unreliable intrinsic reward in multi-agent reinforcement learning W Böhmer, T Rashid, S Whiteson arXiv preprint arXiv:1906.02138, 2019	29	2019
Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning L Pan, T Rashid, B Peng, L Huang, S Whiteson arXiv preprint arXiv:2103.11883, 2021	5	2021
Estimating α-Rank by Maximizing Information Gain T Rashid, C Zhang, K Ciosek Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5673-5681, 2021	5	2021
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning T Rashid, M Samvelyan, CS de Witt, G Farquhar, J Foerster, S Whiteson Proceedings of the 35th International Conference on Machine Learning, 2018		2018

The system can't perform the operation now. Try again later.

Articles 1–12

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors