Optidice: Offline policy optimization via stationary distribution correction estimation J Lee, W Jeon, B Lee, J Pineau, KE Kim
International Conference on Machine Learning, 6120-6130, 2021
78 2021 Representation balancing offline model-based reinforcement learning BJ Lee, J Lee, KE Kim
International Conference on Learning Representations, 2020
49 2020 Winning the l2rpn challenge: Power grid management via semi-markov afterstate actor-critic D Yoon, S Hong, BJ Lee, KE Kim
International Conference on Learning Representations, 2020
41 2020 Optimizing generative dialog state tracker via cascading gradient descent BJ Lee, W Lim, D Kim, KE Kim
Proceedings of the 15th Annual Meeting of the Special Interest Group on …, 2014
21 2014 Hierarchically-partitioned Gaussian process approximation BJ Lee, J Lee, KE Kim
Artificial Intelligence and Statistics, 822-831, 2017
18 2017 Neural dialog state tracker for large ontologies by attention mechanism Y Jang, J Ham, BJ Lee, Y Chang, KE Kim
2016 IEEE spoken language technology workshop (SLT), 531-537, 2016
17 2016 Batch reinforcement learning with hyperparameter gradients B Lee, J Lee, P Vrancx, D Kim, KE Kim
International Conference on Machine Learning, 5725-5735, 2020
16 2020 Reinforcement learning for control with multiple frequencies J Lee, BJ Lee, KE Kim
Advances in Neural Information Processing Systems 33, 3254-3264, 2020
16 2020 Dialog history construction with long-short term memory for robust generative dialog state tracking BJ Lee, KE Kim
Dialogue & Discourse 7 (3), 47-64, 2016
12 2016 Cross-language neural dialog state tracker for large ontologies using hierarchical attention Y Jang, J Ham, BJ Lee, KE Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (11 …, 2018
11 2018 Residual neural processes BJ Lee, S Hong, KE Kim
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4545-4552, 2020
8 2020 Local metric learning for off-policy evaluation in contextual bandits with continuous actions H Lee, J Lee, Y Choi, W Jeon, BJ Lee, YK Noh, KE Kim
Advances in Neural Information Processing Systems 35, 3913-3925, 2022
3 2022 MARS: Multiagent Reinforcement Learning for Spatial–Spectral and Temporal Feature Selection in EEG-Based BCI DH Shin, YH Son, JM Kim, HJ Ahn, JH Seo, CH Ji, JW Han, BJ Lee, ...
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2024
1 2024 Adaptive Online Time-Series Prediction for Virtual Metrology in Semiconductor Manufacturing S Zabrocki, PS Jo, C Park, D Yim, S Yun, BJ Lee
2023 34th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC …, 2023
1 2023 Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization W Kim, D Ki, BJ Lee
Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13185 …, 2024
2024 Offline Imitation Learning by Controlling the Effective Planning Horizon HJ Ahn, SW Shim, BJ Lee
arXiv preprint arXiv:2401.09728, 2024
2024 Quantifying Information of Tokens for Simple and Flexible Simultaneous Machine Translation D Lee, M Park, BJ Lee
Proceedings of the 27th Conference on Computational Natural Language …, 2023
2023 Improving Neural Machine Translation with Offline Evaluations MK Park, BJ Lee
Proceedings of the 13th International Joint Conference on Natural Language …, 2023
2023 Learning variable-length skills through Novelty-based Decision Point Identification M Kim, H Lee, JH Seo, SW Shim, BJ Lee
2023 Offline Reinforcement Learning via Weighted -divergence W Kim, D Ki, BJ Lee
2022