フォロー
Ronghang Hu
Ronghang Hu
Research Scientist, Meta AI
確認したメール アドレス: meta.com - ホームページ
タイトル
引用先
引用先
Learning to reason: End-to-end module networks for visual question answering
R Hu, J Andreas, M Rohrbach, T Darrell, K Saenko
Proceedings of the IEEE international conference on computer vision, 804-813, 2017
6642017
Natural language object retrieval
R Hu, H Xu, M Rohrbach, J Feng, K Saenko, T Darrell
Proceedings of the IEEE conference on computer vision and pattern …, 2016
6212016
Grounding of textual phrases in images by reconstruction
A Rohrbach, M Rohrbach, R Hu, T Darrell, B Schiele
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
5342016
Speaker-follower models for vision-and-language navigation
D Fried, R Hu, V Cirik, A Rohrbach, J Andreas, LP Morency, ...
Advances in neural information processing systems 31, 2018
4792018
Flava: A foundational language and vision alignment model
A Singh, R Hu, V Goswami, G Couairon, W Galuba, M Rohrbach, D Kiela
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
4692022
Modeling relationships in referential expressions with compositional modular networks
R Hu, M Rohrbach, J Andreas, T Darrell, K Saenko
Proceedings of the IEEE conference on computer vision and pattern …, 2017
4042017
Segmentation from natural language expressions
R Hu, M Rohrbach, T Darrell
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
3962016
LSDA: Large scale detection through adaptation
J Hoffman, S Guadarrama, ES Tzeng, R Hu, J Donahue, R Girshick, ...
Advances in neural information processing systems 27, 2014
3782014
Convnext v2: Co-designing and scaling convnets with masked autoencoders
S Woo, S Debnath, R Hu, X Chen, Z Liu, IS Kweon, S Xie
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
3642023
Learning to segment every thing
R Hu, P Dollár, K He, T Darrell, R Girshick
Proceedings of the IEEE conference on computer vision and pattern …, 2018
3452018
UniT: Multimodal Multitask Learning with a Unified Transformer
R Hu, A Singh
arXiv preprint arXiv:2102.10772, 2021
3272021
Textcaps: a dataset for image captioning with reading comprehension
O Sidorov, R Hu, M Rohrbach, A Singh
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
2512020
Grounding visual explanations
L Anne Hendricks, R Hu, T Darrell, Z Akata
Proceedings of the European Conference on Computer Vision (ECCV), 264-279, 2018
2272018
Explainable neural computation via stack neural module networks
R Hu, J Andreas, T Darrell, K Saenko
Proceedings of the European conference on computer vision (ECCV), 53-69, 2018
2132018
Iterative answer prediction with pointer-augmented multimodal transformers for textvqa
R Hu, A Singh, T Darrell, M Rohrbach
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
2112020
Language-conditioned graph networks for relational reasoning
R Hu, A Rohrbach, T Darrell, K Saenko
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
1782019
Scaling language-image pre-training via masking
Y Li, H Fan, R Hu, C Feichtenhofer, K He
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1772023
Generating counterfactual explanations with natural language
LA Hendricks, R Hu, T Darrell, Z Akata
arXiv preprint arXiv:1806.09809, 2018
1052018
Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation
R Hu, D Fried, A Rohrbach, D Klein, T Darrell, K Saenko
arXiv preprint arXiv:1906.00347, 2019
862019
Worldsheet: Wrapping the world in a 3d sheet for view synthesis from a single image
R Hu, N Ravi, AC Berg, D Pathak
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
692021
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20