Shallow-to-deep training for neural machine translation B Li, Z Wang, H Liu, Y Jiang, Q Du, T Xiao, H Wang, J Zhu arXiv preprint arXiv:2010.03737, 2020 | 37 | 2020 |
Learning light-weight translation models from deep transformer B Li, Z Wang, H Liu, Q Du, T Xiao, C Zhang, J Zhu Proceedings of the AAAI Conference on Artificial Intelligence 35 (15), 13217 …, 2021 | 33 | 2021 |
Weight distillation: Transferring the knowledge in neural network parameters Y Lin, Y Li, Z Wang, B Li, Q Du, T Xiao, J Zhu arXiv preprint arXiv:2009.09152, 2020 | 18 | 2020 |
Ode transformer: An ordinary differential equation-inspired model for neural machine translation B Li, Q Du, T Zhou, S Zhou, X Zeng, T Xiao, J Zhu arXiv preprint arXiv:2104.02308, 2021 | 14 | 2021 |
ODE transformer: An ordinary differential equation-inspired model for sequence generation B Li, Q Du, T Zhou, Y Jing, S Zhou, X Zeng, T Xiao, J Zhu, X Liu, M Zhang arXiv preprint arXiv:2203.09176, 2022 | 12 | 2022 |
A simple and effective approach to robust unsupervised bilingual dictionary induction Y Li, Y Luo, Y Lin, Q Du, H Wang, S Huang, T Xiao, J Zhu arXiv preprint arXiv:2011.14874, 2020 | 11 | 2020 |
Handling many-to-one unk translation for neural machine translation F Li, D Quan, W Qiang, X Tong, J Zhu Machine Translation: 13th China Workshop, CWMT 2017, Dalian, China …, 2017 | 3 | 2017 |
Learning Evaluation Models from Large Language Models for Sequence Generation C Wang, H Zhou, K Chang, T Liu, C Zhang, Q Du, T Xiao, J Zhu arXiv preprint arXiv:2308.04386, 2023 | 1 | 2023 |
Topology-Sensitive Neural Architecture Search for Language Modeling Q Du, N Xu, Y Li, T Xiao, J Zhu IEEE Access 9, 107416-107423, 2021 | 1 | 2021 |
Non-autoregressive neural machine translation with auxiliary representation fusion Q Du, K Feng, C Xu, T Xiao, J Zhu Journal of Intelligent & Fuzzy Systems 41 (6), 7229-7239, 2021 | | 2021 |