フォロー
Shilong Liu
Shilong Liu
PhD student, Tsinghua University
確認したメール アドレス: mails.tsinghua.edu.cn - ホームページ
タイトル
引用先
引用先
DINO: Detr with improved denoising anchor boxes for end-to-end object detection
H Zhang, F Li, S Liu, L Zhang, H Su, J Zhu, LM Ni, HY Shum
International Conference on Learning Representations, 2023
13822023
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, C Li, J Yang, H Su, J Zhu, ...
The 18th European Conference on Computer Vision ECCV, 2024
13622024
Dab-detr: Dynamic anchor boxes are better queries for detr
S Liu, F Li, H Zhang, X Yang, X Qi, H Su, J Zhu, L Zhang
International Conference on Learning Representations, 2022
7962022
DN-DETR: Accelerate detr training by introducing query denoising
F Li, H Zhang, S Liu, J Guo, LM Ni, L Zhang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
7102022
Mask dino: Towards a unified transformer-based framework for object detection and segmentation
F Li, H Zhang, H Xu, S Liu, L Zhang, LM Ni, HY Shum
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
3662023
Query2label: A simple transformer way to multi-label classification
S Liu, L Zhang, X Yang, H Su, J Zhu
arXiv preprint arXiv:2107.10834, 2021
2252021
Grounded sam: Assembling open-world models for diverse visual tasks
T Ren, S Liu, A Zeng, J Lin, K Li, H Cao, J Chen, X Huang, Y Chen, F Yan, ...
arXiv preprint arXiv:2401.14159, 2024
1822024
Recognize anything: A strong image tagging model
Y Zhang, X Huang, J Ma, Z Li, Z Luo, Y Xie, Y Qin, T Luo, Y Li, S Liu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1672024
Semantic-sam: Segment and recognize anything at any granularity
F Li, H Zhang, P Sun, X Zou, S Liu, J Yang, C Li, L Zhang, J Gao
arXiv preprint arXiv:2307.04767, 2023
1432023
A simple framework for open-vocabulary segmentation and detection
H Zhang, F Li, X Zou, S Liu, C Li, J Yang, L Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1382023
Llava-plus: Learning to use tools for creating multimodal agents
S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ...
European Conference on Computer Vision, 126-142, 2025
832025
Lite detr: An interleaved multi-scale encoder for efficient detr
F Li, A Zeng, S Liu, H Zhang, H Li, L Zhang, LM Ni
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
682023
Explicit box detection unifies end-to-end multi-person pose estimation
J Yang, A Zeng, S Liu, F Li, R Zhang, L Zhang
arXiv preprint arXiv:2302.01593, 2023
642023
Mp-former: Mask-piloted transformer for image segmentation
H Zhang, F Li, H Xu, S Huang, S Liu, LM Ni, L Zhang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
592023
Llava-grounding: Grounded visual chat with large multimodal models
H Zhang, H Li, F Li, T Ren, X Zou, S Liu, S Huang, J Gao, C Li, J Yang
European Conference on Computer Vision, 19-35, 2025
432025
Vision-language intelligence: Tasks, representation learning, and large models
F Li, H Zhang, YF Zhang, S Liu, J Guo, LM Ni, PC Zhang, L Zhang
arXiv preprint arXiv:2203.01922, 2022
422022
Unsupervised part segmentation through disentangling appearance and shape
S Liu, L Zhang, X Yang, H Su, J Zhu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
382021
Detection transformer with stable matching
S Liu, T Ren, J Chen, Z Zeng, H Zhang, F Li, H Li, J Huang, H Su, J Zhu, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
312023
Vidu: a highly consistent, dynamic and skilled text-to-video generator with diffusion models
F Bao, C Xiang, G Yue, G He, H Zhu, K Zheng, M Zhao, S Liu, Y Wang, ...
arXiv preprint arXiv:2405.04233, 2024
272024
Visual in-context prompting
F Li, Q Jiang, H Zhang, T Ren, S Liu, X Zou, H Xu, H Li, J Yang, C Li, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
242024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20