Pano-avqa: Grounded audio-visual question answering on 360deg videos H Yun, Y Yu, W Yang, K Lee, G Kim
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
55 2021 Transitional adaptation of pretrained models for visual storytelling Y Yu, J Chung, H Yun, J Kim, G Kim
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
28 2021 Multimodal knowledge alignment with reinforcement learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, P Ammanabrolu, R Zellers, ...
arXiv preprint arXiv:2205.12630, 2022
26 2022 Panoramic Vision Transformer for Saliency Detection in 360 Videos H Yun, S Lee, G Kim
European Conference on Computer Vision, 422-439, 2022
15 2022 Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, R Zellers, P Ammanabrolu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
6 2023 Character grounding and re-identification in story of videos and text descriptions Y Yu, J Kim, H Yun, J Chung, G Kim
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
6 2020 Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation H Yun, J Na, G Kim
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1 2023 A Mobile Robot Generating Video Summaries of Seniors' Indoor Activities CY Yang, H Yun, S Varadaraj, JY Hsu
Proceedings of the 21st International Conference on Human-Computer …, 2019
2019