Faithscore: Evaluating hallucinations in large vision-language models L Jing, R Li, Y Chen, M Jia, X Du arXiv preprint arXiv:2311.01477, 2023 | 11 | 2023 |
Multi-source semantic graph-based multimodal sarcasm explanation generation L Jing, X Song, K Ouyang, M Jia, L Nie arXiv preprint arXiv:2306.16650, 2023 | 9 | 2023 |
Multimodal activation: Awakening dialog robots without wake words L Nie, M Jia, X Song, G Wu, H Cheng, J Gu Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 9 | 2021 |
Plug: Leveraging pivot language in cross-lingual instruction tuning Z Zhang, DH Lee, Y Fang, W Yu, M Jia, M Jiang, F Barbieri arXiv preprint arXiv:2311.08711, 2023 | 6 | 2023 |
Knowledge-enhanced memory model for emotional support conversation M Jia, Q Chen, L Jing, D Fu, R Li arXiv preprint arXiv:2310.07700, 2023 | 2 | 2023 |
Debiasing multimodal sarcasm detection with contrastive learning M Jia, C Xie, L Jing Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18354 …, 2024 | 1 | 2024 |
Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training M Jia, Z Zhang, W Yu, F Jiao, M Jiang arXiv preprint arXiv:2404.14604, 2024 | | 2024 |
Multimodal Interaction Modeling via Self-Supervised Multi-Task Learning for Review Helpfulness Prediction HL Gong, M Jia, L Jing arXiv preprint arXiv:2402.18107, 2024 | | 2024 |
Query-Oriented Micro-Video Summarization M Jia, Y Wei, X Song, T Sun, M Zhang, L Nie IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | | 2024 |