Stephen McAleer

引用先

	すべて	2019 年以来
引用	2827	2823
h 指標	21	21
i10 指標	31	31

900

450

225

675

20192020202120222023202459 181 313 515 867 886

オープンアクセス

すべて表示

17 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Pierre BaldiProfessor, University of California, Irvine確認したメールアドレス: ics.uci.edu
Yaodong YangBOYA (博雅) Assistant Professor at Peking University確認したメールアドレス: pku.edu.cn
Roy FoxAssistant Professor, UC Irvine確認したメールアドレス: uci.edu
JB LanierUC Irvine確認したメールアドレス: uci.edu
Alexander ShmakovUniversity of California Irvine確認したメールアドレス: uci.edu
Forest AgostinelliAssistant Professor at the University of South Carolina確認したメールアドレス: cse.sc.edu
Tuomas SandholmAngel Jordan University Professor of Computer Science, Carnegie Mellon University確認したメールアドレス: cs.cmu.edu
Jun WangProfessor, Computer Science, University College London確認したメールアドレス: cs.ucl.ac.uk
Oliver SlumbersUniversity College London確認したメールアドレス: ucl.ac.uk
Kevin A. WangBrown University確認したメールアドレス: kevinwang.us
Gabriele FarinaAssistant Professor, Massachusetts Institute of Technology確認したメールアドレス: mit.edu
Marc LanctotResearch Scientist, Google DeepMind確認したメールアドレス: google.com
Shauharda (Shaw) KhadkaSenior Applied Scientist at Microsoft確認したメールアドレス: microsoft.com
Somdeb MajumdarIntel Corp確認したメールアドレス: intel.com
Kagan TumerOregon State University確認したメールアドレス: oregonstate.edu
Ioannis PanageasAssistant Professor, University of California, Irvine確認したメールアドレス: ics.uci.edu
Pieter AbbeelUC Berkeley | Covariant確認したメールアドレス: cs.berkeley.edu
Alexander IhlerUniversity of California, Irvine確認したメールアドレス: ics.uci.edu
Michael DennisGoogle DeepMind確認したメールアドレス: cs.berkeley.edu
Karl TuylsFounder at H company, ex-Google DeepMind, Prof at University of Liverpool確認したメールアドレス: hcompany.ai

フォロー

Stephen McAleer

OpenAI

確認したメールアドレス: openai.com - ホームページ

Artificial Intelligence


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Highly accurate machine fault diagnosis using deep transfer learning S Shao, S McAleer, R Yan, P Baldi IEEE Transactions on Industrial Informatics 15 (4), 2446-2455, 2018	1121	2018
Solving the Rubik’s cube with deep reinforcement learning and search F Agostinelli, S McAleer, A Shmakov*, P Baldi Nature Machine Intelligence 1 (8), 356-363, 2019	224	2019
Language Models can Solve Computer Tasks G Kim, P Baldi, S McAleer Neural Information Processing Systems (NeurIPS), 2023	194	2023
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	172	2022
Llemma: An Open Language Model for Mathematics Z Azerbayev, H Schoelkopf, K Paster, M Dos Santos, S McAleer, AQ Jiang, ... International Conference on Learning Representations (ICLR), 2023	122	2023
AI Alignment: A Comprehensive Survey J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ... arXiv preprint arXiv:2310.19852, 2023	100	2023
Solving the Rubik's Cube with Approximate Policy Iteration S McAleer, F Agostinelli, A Shmakov*, P Baldi International Conference on Learning Representations (ICLR), 2018	96*	2018
Pipeline PSRO: A scalable approach for finding approximate nash equilibria in large games S McAleer, J Lanier, R Fox, P Baldi 34th Conference on Neural Information Processing Systems (NeurIPS), 2020	74	2020
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... 36th Conference on Neural Information Processing Systems (NeurIPS 2022 …, 2022	72	2022
Evolutionary reinforcement learning for sample-efficient multiagent coordination S Majumdar, S Khadka, S Miret, S McAleer, K Tumer International Conference on Machine Learning (ICML), 2020	63	2020
XDO: A double oracle algorithm for extensive-form games S McAleer, J Lanier, P Baldi, R Fox Advances in Neural Information Processing Systems (NeurIPS), 2021	51	2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games R Fox, S McAleer, W Overman, I Panageas AISTATS 2022, 2021	48	2021
Neural auto-curricula in two-player zero-sum games X Feng, O Slumbers, Z Wan, B Liu, S McAleer, Y Wen, J Wang, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2021	46*	2021
Alphazero-like tree-search can guide large language model decoding and training Z Wan, X Feng, M Wen, SM McAleer, Y Wen, W Zhang, J Wang Forty-first International Conference on Machine Learning, 2024	29	2024
Online Double Oracle LC Dinh, Y Yang, S McAleer, NP Nieves, O Slumbers, Z Tian, DH Mguni, ... Transactions on Machine Learning Research, 2021	28	2021
White Paper: ARIANNA-200 high energy neutrino telescope A Anker, P Baldi, SW Barwick, D Bergman, H Bernhoff, DZ Besson, ... arXiv preprint arXiv:2004.09841, 2020	28	2020
Deep-learning-based reconstruction of the neutrino direction and energy for in-ice radio detectors C Glaser, S McAleer, S Stjärnholm, P Baldi, SW Barwick Astroparticle Physics 145, 102781, 2023	27*	2023
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm International Conference on Learning Representations (ICLR), 2022	23*	2022
Curiosity-Driven Multi-Criteria Hindsight Experience Replay J Lanier, S McAleer, P Baldi NeurIPS 2019 Deep RL Workshop, 2019	23	2019
Reducing variance in temporal-difference value estimation via ensemble of deep networks L Liang, Y Xu, S McAleer, D Hu, A Ihler, P Abbeel, R Fox International Conference on Machine Learning (ICML), 2022	22*	2022

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者