‪Haoyu Lu‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	485	484
h-index	9	9
i10-index	8	8

0

240

120

60

180

202120222023202411 102 226 142

Public access

5 articles

0 articles

available

not available

Based on funding mandates

Haoyu Lu

Haoyu Lu

Renmin University of China

Verified email at ruc.edu.cn

multimodal pre-training video-language modeling


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Towards artificial general intelligence via a multimodal foundation model N Fei, Z Lu, Y Gao, G Yang, Y Huo, J Wen, H Lu, R Song, X Gao, T Xiang, ... Nature Communications 13 (1), 3094, 2022	156*	2022
WenLan: Bridging vision and language by large-scale multi-modal pre-training Y Huo, M Zhang, G Liu, H Lu, Y Gao, G Yang, J Wen, H Zhang, B Xu, ... arXiv preprint arXiv:2103.06561, 2021	125	2021
Cots: Collaborative two-stream vision-language pre-training model for cross-modal retrieval H Lu, N Fei, Y Huo, Y Gao, Z Lu, JR Wen Proceedings of the IEEE/CVF conference on computer Vision and pattern …, 2022	58	2022
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024	39	2024
Self-supervised video representation learning with constrained spatiotemporal jigsaw Y Huo, M Ding, H Lu, Z Lu, T Xiang, JR Wen, Z Huang, J Jiang, S Zhang, ...	19	2020
Vdt: General-purpose video diffusion transformers via mask modeling H Lu, G Yang, N Fei, Y Huo, Z Lu, P Luo, M Ding The Twelfth International Conference on Learning Representations, 2023	17*	2023
Uniadapter: Unified parameter-efficient transfer learning for cross-modal modeling H Lu, Y Huo, G Yang, Z Lu, W Zhan, M Tomizuka, M Ding arXiv preprint arXiv:2302.06605, 2023	15	2023
Learning versatile neural architectures by propagating network codes M Ding, Y Huo, H Lu, L Yang, Z Wang, Z Lu, J Wang, P Luo arXiv preprint arXiv:2103.13253, 2021	13	2021
Multimodal foundation models are better simulators of the human brain H Lu, Q Zhou, N Fei, Z Lu, M Ding, J Wen, C Du, X Zhao, H Sun, H He, ... arXiv preprint arXiv:2208.08263, 2022	9	2022
Compressed video contrastive learning Y Huo, M Ding, H Lu, N Fei, Z Lu, JR Wen, P Luo Advances in Neural Information Processing Systems 34, 14176-14187, 2021	9	2021
LGDN: Language-Guided Denoising Network for Video-Language Modeling H Lu, M Ding, N Fei, Y Huo, Z Lu Advances in Neural Information Processing Systems, 2022, 2022	8	2022
DeepSeek-VL: towards real-world vision-language understanding H Lu, W Liu, B Zhang, B Wang, K Dong, B Liu, J Sun, T Ren, Z Li, Y Sun, ... arXiv preprint arXiv:2403.05525, 2024	7	2024
Cross-modal contrastive learning for generalizable and efficient image-text retrieval H Lu, Y Huo, M Ding, N Fei, Z Lu Machine Intelligence Research 20 (4), 569-582, 2023	5	2023
Bmu-moco: Bidirectional momentum update for continual video-language modeling Y Gao, N Fei, H Lu, Z Lu, H Jiang, Y Li, Z Cao Advances in Neural Information Processing Systems 35, 22699-22712, 2022	5	2022

The system can't perform the operation now. Try again later.

Articles 1–14