Follow
Yifei Huang
Yifei Huang
The University of Tokyo
Verified email at ut-vision.org - Homepage
Title
Cited by
Cited by
Year
Ego4d: Around the world in 3,000 hours of egocentric video
K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
6872022
Semantic aware attention based deep object co-segmentation
H Chen, Y Huang, H Nakayama
Asian Conference on Computer Vision, 435-450, 2018
1562018
Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition
Y Huang, M Cai, Z Li, Y Sato
Oral presentation, European Conference on Computer Vision (ECCV), 789-804, 2018
1442018
Goal-oriented gaze estimation for zero-shot learning
Y Liu, L Zhou, X Bai, Y Huang, L Gu, J Zhou, T Harada
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
1252021
Improving action segmentation via graph-based temporal reasoning
Y Huang, Y Sugano, Y Sato
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1252020
Mutual context network for jointly estimating egocentric gaze and action
Y Huang, M Cai, Z Li, F Lu, Y Sato
IEEE Transactions on Image Processing 29, 7795-7806, 2020
682020
Manipulation-skill assessment from videos with spatial attention network
Z Li, Y Huang, M Cai, Y Sato
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2019
632019
Videollm: Modeling video sequence with large language models
G Chen, YD Zheng, J Wang, J Xu, Y Huang, J Pan, Y Wang, Y Wang, ...
arXiv preprint arXiv:2305.13292, 2023
562023
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
412024
Deep convolutional neural network-aided detection of portal hypertension in patients with cirrhosis
Y Liu, Z Ning, N Örmeci, W An, Q Yu, K Han, Y Huang, D Liu, F Liu, Z Li, ...
Clinical Gastroenterology and Hepatology 18 (13), 2998-3007. e5, 2020
402020
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
382022
Interact before align: Leveraging cross-modal knowledge for domain adaptive action recognition
L Yang, Y Huang, Y Sugano, Y Sato
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
372022
Commonsense knowledge aware concept selection for diverse and informative visual storytelling
H Chen, Y Huang, H Takamura, H Nakayama
Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 999-1008, 2021
372021
Handling missing sensors in topology-aware iot applications with gated graph neural network
S Liu, S Yao, Y Huang, D Liu, H Shao, Y Zhao, J Li, T Wang, R Wang, ...
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous …, 2020
322020
Towards visually explaining video understanding networks with perturbation
Z Li, W Wang, Z Li, Y Huang, Y Sato
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021
302021
Compound Prototype Matching for Few-Shot Action Recognition
Y Huang, L Yang, Y Sato
European Conference on Computer Vision, 351-368, 2022
282022
Precise multi-modal in-hand pose estimation using low-precision sensors for robotic assembly
F von Drigalski, K Hayashi, Y Huang, R Yonetani, M Hamaya, K Tanaka, ...
2021 IEEE International Conference on Robotics and Automation (ICRA), 968-974, 2021
282021
Video mamba suite: State space model as a versatile alternative for video understanding
G Chen, Y Huang, J Xu, B Pei, Z Chen, Z Li, J Wang, K Li, T Lu, L Wang
arXiv preprint arXiv:2403.09626, 2024
272024
Internvideo2: Scaling video foundation models for multimodal video understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ...
arXiv preprint arXiv:2403.15377, 2024
232024
Weakly supervised temporal sentence grounding with uncertainty-guided self-training
Y Huang, L Yang, Y Sato
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
162023
The system can't perform the operation now. Try again later.
Articles 1–20