Legalbench: A collaboratively built benchmark for measuring legal reasoning in large language models N Guha, J Nyarko, D Ho, C Ré, A Chilton, A Chohlas-Wood, A Peters, ... Advances in Neural Information Processing Systems 36, 2024 | 31 | 2024 |
Ai regulation has its own alignment problem: The technical and institutional feasibility of disclosure, registration, licensing, and auditing N Guha, C Lawrence, LA Gailmard, K Rodolfa, F Surani, R Bommasani, ... George Washington Law Review, Forthcoming, 2023 | 10 | 2023 |
Presto: A multilingual dataset for parsing realistic task-oriented dialogs R Goel, W Ammar, A Gupta, S Vashishtha, M Sano, F Surani, M Chang, ... arXiv preprint arXiv:2303.08954, 2023 | 8 | 2023 |