フォロー
Hao Zhang
タイトル
引用先
引用先
DINO: Detr with improved denoising anchor boxes for end-to-end object detection
H Zhang*, F Li*, S Liu*, L Zhang, H Su, J Zhu, LM Ni, HY Shum
International Conference on Learning Representations (ICLR), 2023, 2022
6192022
DAB-DETR: Dynamic anchor boxes are better queries for DETR
S Liu, F Li, H Zhang, X Yang, X Qi, H Su, J Zhu, L Zhang
International Conference on Learning Representations (ICLR), 2022, 2022
4102022
Grounding dino: Marrying dino with grounded pre-training for open-set object detection
S Liu, Z Zeng, T Ren, F Li, H Zhang, J Yang, C Li, J Yang, H Su, J Zhu, ...
arXiv preprint arXiv:2303.05499, 2023
3992023
Dn-detr: Accelerate detr training by introducing query denoising
F Li*, H Zhang*, S Liu, J Guo, LM Ni, L Zhang
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR …, 2022
3492022
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
F Li*, H Zhang*, S Liu, L Zhang, LM Ni, HY Shum
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023, 2022
1702022
Segment everything everywhere all at once
X Zou*, J Yang*, H Zhang*, F Li*, L Li, J Gao, YJ Lee
NeurIPS 2023, 2023
1612023
A simple framework for open-vocabulary segmentation and detection
H Zhang*, F Li*, X Zou, S Liu, C Li, J Gao, J Yang, L Zhang
ICCV 2023, 2023
542023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
F Li*, H Zhang*, P Sun, X Zou, S Liu, J Yang, C Li, L Zhang, J Gao
arXiv preprint arXiv:2307.04767, 2023
422023
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
J Yang*, H Zhang*, F Li*, X Zou*, C Li, J Gao
arXiv preprint arXiv:2310.11441, 2023
282023
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models
F Li*, H Zhang*, YF Zhang, S Liu, J Guo, LM Ni, PC Zhang, L Zhang
arXiv preprint arXiv:2203.01922, 2022
262022
Lite DETR: An Interleaved Multi-Scale Encoder for Efficient DETR
F Li, A Zeng, S Liu, H Zhang, H Li, L Zhang, LM Ni
CVPR 2023, 2023
212023
MP-Former: Mask-Piloted Transformer for Image Segmentation
H Zhang, F Li, H Xu, S Huang, S Liu, LM Ni, L Zhang
The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023, 2023
192023
Llava-plus: Learning to use tools for creating multimodal agents
S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ...
arXiv preprint arXiv:2311.05437, 2023
182023
Detection Transformer with Stable Matching
S Liu, T Ren, J Chen, Z Zeng, H Zhang, F Li, H Li, J Huang, H Su, J Zhu, ...
ICCV 2023, 2023
112023
Multi-relation message passing for multi-label text classification
M Ozmen, H Zhang, P Wang, M Coates
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
102022
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
S Liu, Y Liang, F Li, S Huang, H Zhang, H Su, J Zhu, L Zhang
AAAI 2023, 2022
62022
A Strong and Reproducible Object Detector with Only Public Datasets
T Ren, J Yang, S Liu, A Zeng, F Li, H Zhang, H Li, Z Zeng, L Zhang
arxiv, 2023
52023
detrex: Benchmarking Detection Transformers
T Ren*, S Liu*, F Li*, H Zhang*, A Zeng, J Yang, X Liao, D Jia, H Li, H Cao, ...
arXiv preprint arXiv:2306.07265, 2023
42023
Introducing Depth into Transformer-based 3D Object Detection
H Zhang, H Li, A Zeng, F Li, S Liu, X Liao, L Zhang
arXiv preprint arXiv:2302.13002, 2023
4*2023
A unified mutual supervision framework for referring expression segmentation and generation
S Huang, F Li, H Zhang, S Liu, L Zhang, L Wang
arXiv preprint arXiv:2211.07919, 2022
42022
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20