Don't stop pretraining: adapt language models to domains and tasks S Gururangan, A Marasović, S Swayamdipta, K Lo, I Beltagy, D Downey, ... arXiv preprint arXiv:2004.10964, 2020 | 749 | 2020 |
Annotation artifacts in natural language inference data S Gururangan, S Swayamdipta, O Levy, R Schwartz, SR Bowman, ... arXiv preprint arXiv:1803.02324, 2018 | 698 | 2018 |
Show your work: Improved reporting of experimental results J Dodge, S Gururangan, D Card, R Schwartz, NA Smith arXiv preprint arXiv:1909.03004, 2019 | 152 | 2019 |
Realtoxicityprompts: Evaluating neural toxic degeneration in language models S Gehman, S Gururangan, M Sap, Y Choi, NA Smith arXiv preprint arXiv:2009.11462, 2020 | 150 | 2020 |
Variational pretraining for semi-supervised text classification S Gururangan, T Dang, D Card, NA Smith arXiv preprint arXiv:1906.02242, 2019 | 69 | 2019 |
All that's' human'is not gold: Evaluating human evaluation of generated text E Clark, T August, S Serrano, N Haduong, S Gururangan, NA Smith arXiv preprint arXiv:2107.00061, 2021 | 43 | 2021 |
Detoxifying language models risks marginalizing minority voices A Xu, E Pathak, E Wallace, S Gururangan, M Sap, D Klein arXiv preprint arXiv:2104.06390, 2021 | 21 | 2021 |
Analysis of graph invariants in functional neocortical circuitry reveals generalized features common to three areas of sensory cortex SS Gururangan, AJ Sadovsky, JN MacLean PLoS computational biology 10 (7), e1003710, 2014 | 17 | 2014 |
Demix layers: Disentangling domains for modular language modeling S Gururangan, M Lewis, A Holtzman, NA Smith, L Zettlemoyer arXiv preprint arXiv:2108.05036, 2021 | 12 | 2021 |
Time waits for no one! analysis and challenges of temporal misalignment K Luu, D Khashabi, S Gururangan, K Mandyam, NA Smith arXiv preprint arXiv:2111.07408, 2021 | 7 | 2021 |
Emergent coordination underlying learning to reach to grasp with a brain-machine interface M Vaidya, K Balasubramanian, J Southerland, I Badreldin, A Eleryan, ... Journal of neurophysiology 119 (4), 1291-1304, 2018 | 6 | 2018 |
Classifying locator generation kits R Hodgman, A Kuppa, S Gururangan, A Reece US Patent 10,594,655, 2020 | 1 | 2020 |
Nearest Neighbor Zero-Shot Inference W Shi, J Michael, S Gururangan, L Zettlemoyer arXiv preprint arXiv:2205.13792, 2022 | | 2022 |
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection S Gururangan, D Card, SK Drier, EK Gade, LZ Wang, Z Wang, ... arXiv preprint arXiv:2201.10474, 2022 | | 2022 |
Expected Validation Performance and Estimation of a Random Variable's Maximum J Dodge, S Gururangan, D Card, R Schwartz, NA Smith arXiv preprint arXiv:2110.00613, 2021 | | 2021 |
Neutralizing malicious locators R Hodgman, A Kuppa, S Gururangan, A Reece US Patent 10,601,846, 2020 | | 2020 |
Polyglot Text Classification with Neural Document Models S Gururangan | | 2018 |
Emergent coordination with a brain–machine interface: implications for the neural basis of motor 1 learning 2 M Vaidya, K Balasubramanian, J Southerland, I Badreldin, A Eleryan, ... | | |
Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks Open Website S Gururangan, A Marasovic, S Swayamdipta, K Lo, I Beltagy, D Downey, ... | | |