Shilong Liu

引用先

	すべて	2019 年以来
引用	4122	4121
h 指標	18	18
i10 指標	22	22

2300

1150

575

1725

202220232024222 1617 2266

オープンアクセス

すべて表示

2 件の論文

1 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Lei ZhangInternational Digital Economy Academy (IDEA)確認したメールアドレス: idea.edu.cn
Feng LiPhD student, Hong Kong University of Science and Technology確認したメールアドレス: connect.ust.hk
Hao ZhangThe Hong Kong University of Science and Technology確認したメールアドレス: connect.ust.hk
Hang SuAssociated Professor, Tsinghua University確認したメールアドレス: mail.tsinghua.edu.cn
Jun ZhuProfessor of Computer Science, Tsinghua University確認したメールアドレス: mail.tsinghua.edu.cn
Tianhe RenInternational Digital Economy Academy (IDEA)確認したメールアドレス: idea.edu.cn
Lionel NiChair Professor of Data Science and Analytics, HKUST(Guangzhou)確認したメールアドレス: ust.hk
Zhaoyang ZengInternational Digital Economy Academy確認したメールアドレス: idea.edu.cn
Jianwei YangPrincipal Researcher, Microsoft Research, Redmond確認したメールアドレス: microsoft.com
Chunyuan LiMicrosoft Research, Redmond確認したメールアドレス: microsoft.com
Jie YangThe Chinese Univeristy of Hong Kong, Shenzhen確認したメールアドレス: link.cuhk.edu.cn
Heung-Yeung ShumMicrosoft確認したメールアドレス: microsoft.com
Hongyang LiSouth China University of Technology確認したメールアドレス: mail.scut.edu.cn
Xiao YangTsinghua University確認したメールアドレス: mails.tsinghua.edu.cn
Xueyan ZouPostDoc at University of California, San Diego確認したメールアドレス: wisc.edu
Xianbiao QiInternational Digital Economy Academy確認したメールアドレス: idea.edu.cn
Ailing ZengTencent確認したメールアドレス: tencent.com
Huaizhe XuHong Kong University of Science and Technology確認したメールアドレス: connect.ust.hk
Qing JiangM.S student, South China University of Technology確認したメールアドレス: mail.scut.edu.cn
Xinyu HuangPhD student, Fudan University確認したメールアドレス: fudan.edu.cn

フォロー

Shilong Liu

その他の名前刘世隆

PhD student, Tsinghua University

確認したメールアドレス: mails.tsinghua.edu.cn - ホームページ

Computer Vision Object Detection Visual Grounding Multi-Modality


タイトル引用回数順公開年順タイトル順	引用先引用先	年
DINO: Detr with improved denoising anchor boxes for end-to-end object detection H Zhang, F Li, S Liu, L Zhang, H Su, J Zhu, LM Ni, HY Shum arXiv preprint arXiv:2203.03605, 2022	967	2022
Grounding dino: Marrying dino with grounded pre-training for open-set object detection S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, C Li, J Yang, H Su, J Zhu, ... arXiv preprint arXiv:2303.05499, 2023	808	2023
Dab-detr: Dynamic anchor boxes are better queries for detr S Liu, F Li, H Zhang, X Yang, X Qi, H Su, J Zhu, L Zhang arXiv preprint arXiv:2201.12329, 2022	582	2022
DN-DETR: Accelerate detr training by introducing query denoising F Li, H Zhang, S Liu, J Guo, LM Ni, L Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	505	2022
Mask dino: Towards a unified transformer-based framework for object detection and segmentation F Li, H Zhang, H Xu, S Liu, L Zhang, LM Ni, HY Shum Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	254	2023
Query2label: A simple transformer way to multi-label classification S Liu, L Zhang, X Yang, H Su, J Zhu arXiv preprint arXiv:2107.10834, 2021	185	2021
Recognize anything: A strong image tagging model Y Zhang, X Huang, J Ma, Z Li, Z Luo, Y Xie, Y Qin, T Luo, Y Li, S Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	104	2024
A simple framework for open-vocabulary segmentation and detection H Zhang, F Li, X Zou, S Liu, C Li, J Yang, L Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	100	2023
Semantic-sam: Segment and recognize anything at any granularity F Li, H Zhang, P Sun, X Zou, S Liu, J Yang, C Li, L Zhang, J Gao arXiv preprint arXiv:2307.04767, 2023	98	2023
Grounded sam: Assembling open-world models for diverse visual tasks T Ren, S Liu, A Zeng, J Lin, K Li, H Cao, J Chen, X Huang, Y Chen, F Yan, ... arXiv preprint arXiv:2401.14159, 2024	69	2024
Llava-plus: Learning to use tools for creating multimodal agents S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ... arXiv preprint arXiv:2311.05437, 2023	56	2023
Explicit box detection unifies end-to-end multi-person pose estimation J Yang, A Zeng, S Liu, F Li, R Zhang, L Zhang arXiv preprint arXiv:2302.01593, 2023	48	2023
Lite detr: An interleaved multi-scale encoder for efficient detr F Li, A Zeng, S Liu, H Zhang, H Li, L Zhang, LM Ni Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	45	2023
Mp-former: Mask-piloted transformer for image segmentation H Zhang, F Li, H Xu, S Huang, S Liu, LM Ni, L Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	38	2023
Vision-language intelligence: Tasks, representation learning, and large models F Li, H Zhang, YF Zhang, S Liu, J Guo, LM Ni, PC Zhang, L Zhang arXiv preprint arXiv:2203.01922, 2022	35	2022
Unsupervised part segmentation through disentangling appearance and shape S Liu, L Zhang, X Yang, H Su, J Zhu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	32	2021
Detection transformer with stable matching S Liu, T Ren, J Chen, Z Zeng, H Zhang, F Li, H Li, J Huang, H Su, J Zhu, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	25	2023
Llava-grounding: Grounded visual chat with large multimodal models H Zhang, H Li, F Li, T Ren, X Zou, S Liu, S Huang, J Gao, L Zhang, C Li, ... arXiv preprint arXiv:2312.02949, 2023	19	2023
Vidu: a highly consistent, dynamic and skilled text-to-video generator with diffusion models F Bao, C Xiang, G Yue, G He, H Zhu, K Zheng, M Zhao, S Liu, Y Wang, ... arXiv preprint arXiv:2405.04233, 2024	12	2024
DQ-DETR: Dual query detection transformer for phrase extraction and grounding S Liu, S Huang, F Li, H Zhang, Y Liang, H Su, J Zhu, L Zhang Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1728-1736, 2023	11	2023

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者