Folgen
Tabish Rashid
Tabish Rashid
Microsoft Research
Bestätigte E-Mail-Adresse bei microsoft.com
Titel
Zitiert von
Zitiert von
Jahr
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson
Journal of Machine Learning Research 21(178):1−51, 2020, 2020
26602020
The StarCraft Multi-Agent Challenge
M Samvelyan, T Rashid, CS de Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
AAMAS 2019, 2019
11262019
Maven: Multi-agent variational exploration
A Mahajan, T Rashid, M Samvelyan, S Whiteson
Advances in Neural Information Processing Systems, 7613-7624, 2019
4342019
Weighted QMIX: Expanding Monotonic Value Function Factorisation
T Rashid, G Farquhar, B Peng, S Whiteson
Advances in Neural Information Processing Systems 33, 2020, 2020
377*2020
Facmac: Factored multi-agent centralised policy gradients
B Peng, T Rashid, C Schroeder de Witt, PA Kamienny, P Torr, W Böhmer, ...
Advances in Neural Information Processing Systems 34, 12208-12221, 2021
2372021
A new take on detecting insider threats: exploring the use of hidden markov models
T Rashid, I Agrafiotis, JRC Nurse
Proceedings of the 8th ACM CCS International Workshop on Managing Insider …, 2016
2022016
Imitating human behaviour with diffusion models
T Pearce, T Rashid, A Kanervisto, D Bignell, M Sun, R Georgescu, ...
arXiv preprint arXiv:2301.10677, 2023
1632023
Optimistic Exploration even with a Pessimistic Initialisation
T Rashid, B Peng, W Boehmer, S Whiteson
International Conference on Learning Representations, 2019
522019
Regularized softmax deep multi-agent q-learning
L Pan, T Rashid, B Peng, L Huang, S Whiteson
Advances in Neural Information Processing Systems 34, 1365-1377, 2021
332021
Exploration with unreliable intrinsic reward in multi-agent reinforcement learning
W Böhmer, T Rashid, S Whiteson
arXiv preprint arXiv:1906.02138, 2019
322019
Estimating α-Rank by Maximizing Information Gain
T Rashid, C Zhang, K Ciosek
Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5673-5681, 2021
112021
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games
L Schäfer, L Jones, A Kanervisto, Y Cao, T Rashid, R Georgescu, ...
32023
Scaling Laws for Pre-training Agents and World Models
T Pearce, T Rashid, D Bignell, R Georgescu, S Devlin, K Hofmann
arXiv preprint arXiv:2411.04434, 2024
2024
Aligning Agents like Large Language Models
A Jelley, Y Cao, D Bignell, S Devlin, T Rashid
2023
Exploration and value function factorisation in single and multi-agent reinforcement learning
T Rashid
University of Oxford, 2021
2021
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
T Rashid, M Samvelyan, CS de Witt, G Farquhar, J Foerster, S Whiteson
Proceedings of the 35th International Conference on Machine Learning, 2018
2018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–16