Yifei Huang

Cited by

	All	Since 2019
Citations	1938	1923
h-index	19	19
i10-index	25	25

700

350

175

525

201820192020202120222023202415 46 86 145 319 641 682

Public access

View all

16 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yoichi SatoProfessor, Institute of Industrial Science, The University of TokyoVerified email at iis.u-tokyo.ac.jp
Zhenqiang LiThe University of TokyoVerified email at iis.u-tokyo.ac.jp
Minjie CaiAssociate Professor @ Hunan University; The University of TokyoVerified email at iis.u-tokyo.ac.jp
Guo ChenNanjing UniversityVerified email at smail.nju.edu.cn
Yusuke SuganoInstitute of Industrial Science, The University of TokyoVerified email at iis.u-tokyo.ac.jp
Jilan XuFudan UniversityVerified email at m.fudan.edu.cn
Lin GuResearch Scientist, RIKEN AIP, University of TokyoVerified email at mi.t.u-tokyo.ac.jp
Kai KunzeKeio Media DesignVerified email at kmd.keio.ac.jp
Feng LuProfessor, Beihang UniversityVerified email at buaa.edu.cn
Weimin WangDalian University of TechnologyVerified email at dlut.edu.cn
Xiao BaiProfessor of Computer Science, Beihang UniversityVerified email at buaa.edu.cn
Ryosuke FurutaThe University of TokyoVerified email at iis.u-tokyo.ac.jp
Takuma YagiResearch Scientist, National Institute of Advanced Industrial Science and Technology (AIST)Verified email at aist.go.jp
Guangming WuCenter for Spatial Information Science, The University of TokyoVerified email at csis.u-tokyo.ac.jp
Hong ChenThe University of TokyoVerified email at nlab.ci.i.u-tokyo.ac.jp
Felix von DrigalskiMujin Inc.

Yifei Huang

The University of Tokyo

Verified email at ut-vision.org - Homepage

human computer interaction egocentric vision gaze


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	687	2022
Semantic aware attention based deep object co-segmentation H Chen, Y Huang, H Nakayama Asian Conference on Computer Vision, 435-450, 2018	156	2018
Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition Y Huang, M Cai, Z Li, Y Sato Oral presentation, European Conference on Computer Vision (ECCV), 789-804, 2018	144	2018
Goal-oriented gaze estimation for zero-shot learning Y Liu, L Zhou, X Bai, Y Huang, L Gu, J Zhou, T Harada Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	125	2021
Improving action segmentation via graph-based temporal reasoning Y Huang, Y Sugano, Y Sato Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	125	2020
Mutual context network for jointly estimating egocentric gaze and action Y Huang, M Cai, Z Li, F Lu, Y Sato IEEE Transactions on Image Processing 29, 7795-7806, 2020	68	2020
Manipulation-skill assessment from videos with spatial attention network Z Li, Y Huang, M Cai, Y Sato Proceedings of the IEEE International Conference on Computer Vision Workshops, 2019	63	2019
Videollm: Modeling video sequence with large language models G Chen, YD Zheng, J Wang, J Xu, Y Huang, J Pan, Y Wang, Y Wang, ... arXiv preprint arXiv:2305.13292, 2023	56	2023
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	41	2024
Deep convolutional neural network-aided detection of portal hypertension in patients with cirrhosis Y Liu, Z Ning, N Örmeci, W An, Q Yu, K Han, Y Huang, D Liu, F Liu, Z Li, ... Clinical Gastroenterology and Hepatology 18 (13), 2998-3007. e5, 2020	40	2020
Internvideo-ego4d: A pack of champion solutions to ego4d challenges G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ... arXiv preprint arXiv:2211.09529, 2022	38	2022
Interact before align: Leveraging cross-modal knowledge for domain adaptive action recognition L Yang, Y Huang, Y Sugano, Y Sato Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	37	2022
Commonsense knowledge aware concept selection for diverse and informative visual storytelling H Chen, Y Huang, H Takamura, H Nakayama Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 999-1008, 2021	37	2021
Handling missing sensors in topology-aware iot applications with gated graph neural network S Liu, S Yao, Y Huang, D Liu, H Shao, Y Zhao, J Li, T Wang, R Wang, ... Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous …, 2020	32	2020
Towards visually explaining video understanding networks with perturbation Z Li, W Wang, Z Li, Y Huang, Y Sato Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021	30	2021
Compound Prototype Matching for Few-Shot Action Recognition Y Huang, L Yang, Y Sato European Conference on Computer Vision, 351-368, 2022	28	2022
Precise multi-modal in-hand pose estimation using low-precision sensors for robotic assembly F von Drigalski, K Hayashi, Y Huang, R Yonetani, M Hamaya, K Tanaka, ... 2021 IEEE International Conference on Robotics and Automation (ICRA), 968-974, 2021	28	2021
Video mamba suite: State space model as a versatile alternative for video understanding G Chen, Y Huang, J Xu, B Pei, Z Chen, Z Li, J Wang, K Li, T Lu, L Wang arXiv preprint arXiv:2403.09626, 2024	27	2024
Internvideo2: Scaling video foundation models for multimodal video understanding Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ... arXiv preprint arXiv:2403.15377, 2024	23	2024
Weakly supervised temporal sentence grounding with uncertainty-guided self-training Y Huang, L Yang, Y Sato Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	16	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors