Folgen
Vladislav Kurenkov
Vladislav Kurenkov
Lead Research Scientist @ Tinkoff
Bestätigte E-Mail-Adresse bei innopolis.ru - Startseite
Titel
Zitiert von
Zitiert von
Jahr
CORL: Research-oriented Deep Offline Reinforcement Learning Library
D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
382024
Showing your offline reinforcement learning work: Online evaluation budget matters
V Kurenkov, S Kolesnikov
International Conference on Machine Learning, 11729-11752, 2022
212022
Anti-exploration by random network distillation
A Nikulin, V Kurenkov, D Tarasov, S Kolesnikov
International Conference on Machine Learning, 26228-26244, 2023
132023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov
NeurIPS 2022, 3rd Offline RL Workshop: Offline RL as a ''Launchpad'', 2022
82022
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
NeurIPS 2022, 3rd Offline RL Workshop: Offline RL as a ''Launchpad'', 2022
72022
Revisiting the minimalist approach to offline reinforcement learning
D Tarasov, V Kurenkov, A Nikulin, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
62024
Katakomba: Tools and benchmarks for data-driven nethack
V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov
Advances in Neural Information Processing Systems 36, 2024
22024
XLand-minigrid: Scalable meta-reinforcement learning environments in JAX
A Nikulin, V Kurenkov, I Zisman, A Agarkov, V Sinii, S Kolesnikov
arXiv preprint arXiv:2312.12044, 2023
22023
Prompts and Pre-Trained Language Models for Offline Reinforcement Learning
D Tarasov, V Kurenkov, S Kolesnikov
ICLR 2022, Workshop on Generalizable Policy Learning in Physical World, 2022
22022
Learning stabilizing control policies for a tensegrity hopper with augmented random search
V Kurenkov, H Hamed, S Savin
2020 International Conference on Industrial Engineering, Applications and …, 2020
22020
Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order
V Kurenkov, B Maksudov, A Khan
arXiv preprint arXiv:1910.12354, 2019
22019
In-Context Reinforcement Learning for Variable Action Spaces
V Sinii, A Nikulin, V Kurenkov, I Zisman, S Kolesnikov
arXiv preprint arXiv:2312.13327, 2023
12023
Emergence of In-Context Reinforcement Learning from Noise Distillation
I Zisman, V Kurenkov, A Nikulin, V Sinii, S Kolesnikov
arXiv preprint arXiv:2312.12275, 2023
12023
Guiding Evolutionary Strategies by Differentiable Robot Simulators
V Kurenkov, B Maksudov
NeurIPS 2021, 4th Robot Learning Workshop, 2021
12021
Mathematical modelling of tensegrity robots with rigid rods
SI Savin, LI Vorochaeva, VV Kurenkov
Computer research and modeling 12 (4), 821-830, 2020
2020
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–15