Adversarial policies: Attacking deep reinforcement learning A Gleave, M Dennis, C Wild, N Kant, S Levine, S Russell arXiv preprint arXiv:1905.10615, 2019 | 435 | 2019 |
End-to-end training of neural retrievers for open-domain question answering DS Sachan, M Patwary, M Shoeybi, N Kant, W Ping, WL Hamilton, ... arXiv preprint arXiv:2101.00408, 2021 | 98 | 2021 |
Practical text classification with large pre-trained language models N Kant, R Puri, N Yakovenko, B Catanzaro arXiv preprint arXiv:1812.01207, 2018 | 84 | 2018 |
PrefixRL: Optimization of parallel prefix circuits using deep reinforcement learning R Roy, J Raiman, N Kant, I Elkin, R Kirby, M Siu, S Oberman, S Godil, ... 2021 58th ACM/IEEE Design Automation Conference (DAC), 853-858, 2021 | 54 | 2021 |
Synthetic datasets for neural program synthesis R Shin, N Kant, K Gupta, C Bender, B Trabucco, R Singh, D Song arXiv preprint arXiv:1912.12345, 2019 | 50 | 2019 |
Helpsteer: Multi-attribute helpfulness dataset for steerlm Z Wang, Y Dong, J Zeng, V Adams, MN Sreedhar, D Egert, O Delalleau, ... arXiv preprint arXiv:2311.09528, 2023 | 39 | 2023 |
Recent advances in neural program synthesis N Kant arXiv preprint arXiv:1802.02353, 2018 | 28 | 2018 |
The Road to General Intelligence J Swan, E Nivel, N Kant, J Hedges, T Atkinson, B Steunebrink Springer Nature, 2022 | 9 | 2022 |
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare S Mukherjee, P Gamble, MS Ausin, N Kant, K Aggarwal, N Manjunath, ... arXiv preprint arXiv:2403.13313, 2024 | 8 | 2024 |
Neural Inference of API Functions from Input–Output Examples R Bavishi, C Lemieux, N Kant, R Fox, K Sen, I Stoica Workshop on ML for Systems at NeurIPS, 2018 | 2 | 2018 |