PaLM 2 Technical Report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1411 | 2023 |
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 707 | 2024 |
Gemma 2: Improving open language models at a practical size G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ... arXiv preprint arXiv:2408.00118, 2024 | 186 | 2024 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 154 | 2023 |
Beyond human data: Scaling self-training for problem-solving with language models A Singh, JD Co-Reyes, R Agarwal, A Anand, P Patil, X Garcia, PJ Liu, ... arXiv preprint arXiv:2312.06585, 2023 | 63 | 2023 |
PaLM 2 Technical Report.(2023) R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 51 | 2023 |
Neural generation meets real people: Towards emotionally engaging mixed-initiative conversations A Paranjape, A See, K Kenealy, H Li, A Hardy, P Qi, KR Sadagopan, ... arXiv preprint arXiv:2008.12348, 2020 | 48 | 2020 |
Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue Agent EA Chi, A Paranjape, A See, C Chiam, K Kenealy, SK Lim, A Hardy, ... arXiv preprint arXiv:2207.12021, 2022 | 11 | 2022 |
Transformers and pointer-generator networks for abstractive summarization J Deaton, A Jacobs, K Kenealy, A See | 9 | 2019 |
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024 | 5 | 2024 |
Transfer Learning for Text Diffusion Models K Han, K Kenealy, A Barua, N Fiedel, N Constant arXiv preprint arXiv:2401.17181, 2024 | 2 | 2024 |
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability J Hron, L Culp, G Elsayed, R Liu, B Adlam, M Bileschi, B Bohnet, ... arXiv preprint arXiv:2408.07852, 2024 | 1 | 2024 |