Ms marco: A human generated machine reading comprehension dataset DF Campos, T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, ... ArXiv, abs/1611.09268, 2016 | 1909* | 2016 |
Overview of the TREC 2019 deep learning track N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees arXiv preprint arXiv:2003.07820, 2020 | 378 | 2020 |
XGLUE: A new benchmark dataset for cross-lingual pre-training, understanding and generation Y Liang, N Duan, Y Gong, N Wu, F Guo, W Qi, M Gong, L Shou, D Jiang, ... arXiv preprint arXiv:2004.01401, 2020 | 234 | 2020 |
Leading conversational search by suggesting useful questions C Rosset, C Xiong, X Song, D Campos, N Craswell, S Tiwary, P Bennett Proceedings of the web conference 2020, 1160-1170, 2020 | 74 | 2020 |
ORCAS: 20 million clicked query-document pairs for analyzing search N Craswell, D Campos, B Mitra, E Yilmaz, B Billerbeck Proceedings of the 29th ACM International Conference on Information …, 2020 | 65 | 2020 |
The optimal bert surgeon: Scalable and accurate second-order pruning for large language models E Kurtic, D Campos, T Nguyen, E Frantar, M Kurtz, B Fineran, M Goin, ... arXiv preprint arXiv:2203.07259, 2022 | 52 | 2022 |
Open Domain Web Keyphrase Extraction Beyond Language Modeling AO Lee Xiong, Chuan Hu, Chenyan Xiong, Daniel Campos EMNLP-IJCNLP 2019, 2019 | 48* | 2019 |
Ms marco: Benchmarking ranking models in the large-data regime N Craswell, B Mitra, E Yilmaz, D Campos, J Lin Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 45 | 2021 |
TREC deep learning track: Reusable test collections in the large data regime N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees, I Soboroff Proceedings of the 44th international ACM SIGIR conference on research and …, 2021 | 33 | 2021 |
Significant improvements over the state of the art? a case study of the ms marco document ranking leaderboard J Lin, D Campos, N Craswell, B Mitra, E Yilmaz Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 22 | 2021 |
On the reliability of test collections for evaluating systems of different types E Yilmaz, N Craswell, B Mitra, D Campos proceedings of the 43rd International ACM SIGIR Conference on Research and …, 2020 | 21 | 2020 |
Overview of the TREC 2019 deep learning track. CoRR abs/2003.07820 (2020) N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees | 16 | 2020 |
Curriculum learning for language modeling D Campos arXiv preprint arXiv:2108.02170, 2021 | 15 | 2021 |
Keyphrase extraction beyond language modeling L Xiong, C Hu, A Overwijk, J Ahmed, DF Campos, C Xiong US Patent 11,250,214, 2022 | 14 | 2022 |
Fostering coopetition while plugging leaks: The design and implementation of the ms marco leaderboards J Lin, D Campos, N Craswell, B Mitra, E Yilmaz Proceedings of the 45th international ACM SIGIR conference on research and …, 2022 | 6 | 2022 |
Img2smi: Translating molecular structure images to simplified molecular-input line-entry system D Campos, H Ji arXiv preprint arXiv:2109.04202, 2021 | 6 | 2021 |
GAIA at SMKBP 2020-a dockerlized multi-media multi-lingual knowledge extraction, clustering, temporal tracking and hypothesis generation system M Li, Y Lin, TM Lai, X Pan, H Wen, S Li, Z Wang, P Yu, L Huang, D Lu, ... Proceedings of Thirteenth Text Analysis Conference (TAC 2020), 2020 | 6 | 2020 |
Sparse* bert: Sparse models are robust D Campos, A Marques, T Nguyen, M Kurtz, CX Zhai arXiv preprint arXiv:2205.12452, 2022 | 4 | 2022 |
To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency D Campos, CX Zhai arXiv preprint arXiv:2304.02721, 2023 | 2 | 2023 |
Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders D Campos, A Magnani, CX Zhai arXiv preprint arXiv:2304.01016, 2023 | 2 | 2023 |