Folgen
Qingfeng Lan
Qingfeng Lan
PhD student @ University of Alberta
Bestätigte E-Mail-Adresse bei ualberta.ca - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Maxmin Q-learning: Controlling the estimation bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations, 2020
1592020
A deep top-k relevance matching model for ad-hoc retrieval
Z Yang, Q Lan, J Guo, Y Fan, X Zhu, Y Lan, Y Wang, X Cheng
Information Retrieval: 24th China Conference, CCIR 2018, Guilin, China …, 2018
162018
Variational quantum soft actor-critic
Q Lan
arXiv preprint arXiv:2112.11921, 2021
152021
Model-free Policy Learning with Reward Gradients
Q Lan, S Tosatto, H Farrahi, AR Mahmood
The 25th International Conference on Artificial Intelligence and Statistics …, 2022
82022
Reducing selection bias in counterfactual reasoning for individual treatment effects estimation
Z Zhang, Q Lan, L Ding, Y Wang, N Hassanpour, R Greiner
NeurIPS 2019 CausalML Workshop, 2019
82019
Memory-efficient reinforcement learning with value-based knowledge consolidation
Q Lan, Y Pan, J Luo, AR Mahmood
Transactions on Machine Learning Research, 2023
6*2023
Learning to Optimize for Reinforcement Learning
Q Lan, AR Mahmood, S Yan, Z Xu
arXiv preprint arXiv:2302.01470, 2023
52023
A PyTorch Reinforcement Learning Framework for Exploring New Ideas
Q Lan
https://github.com/qlan3/Explorer, 2019
52019
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
H Ishfaq*, Q Lan*, P Xu, AR Mahmood, D Precup, A Anandkumar, ...
International Conference on Learning Representations, 2024
42024
Overcoming policy collapse in deep reinforcement learning
S Dohare, Q Lan, AR Mahmood
Sixteenth European Workshop on Reinforcement Learning, 2023
22023
Elephant Neural Networks: Born to Be a Continual Learner
Q Lan, AR Mahmood
ICML Workshop on High-dimensional Learning Dynamics, 2023
12023
Predictive Representation Learning for Language Modeling
Q Lan, L Kumar, M White, A Fyshe
arXiv preprint arXiv:2105.14214, 2021
2021
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–12