Supported policy optimization for offline reinforcement learning J Wu, H Wu, Z Qiu, J Wang, M Long Advances in Neural Information Processing Systems 35, 31278-31291, 2022 | 38 | 2022 |
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures? Z Qiu, Z Huang, J Fu arXiv preprint arXiv:2310.10908, 2023 | 3 | 2023 |
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts H Zhao, Z Qiu, H Wu, Z Wang, Z He, J Fu arXiv preprint arXiv:2402.12656, 2024 | | 2024 |
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers Z Qiu, Z Huang, Y Huang, J Fu Tiny Paper @ ICLR 2024, 2024 | | 2024 |
Heterogenous Memory Augmented Neural Networks Z Qiu, Z Liu, S Yan, S Zhang, J Fu arXiv preprint arXiv:2310.10909, 2023 | | 2023 |