Latent jailbreak: A benchmark for evaluating text safety and output robustness of large language models H Qiu, S Zhang, A Li, H He, Z Lan arXiv preprint arXiv:2307.08487, 2023 | 39 | 2023 |
Superclue: A comprehensive chinese large language model benchmark L Xu, A Li, L Zhu, H Xue, C Zhu, K Zhao, H He, X Zhang, Q Kang, Z Lan arXiv preprint arXiv:2307.15020, 2023 | 35 | 2023 |
SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support H Qiu, H He, S Zhang, A Li, Z Lan arXiv preprint arXiv:2305.00450, 2023 | 27 | 2023 |
Understanding client reactions in online mental health counseling A Li, L Ma, Y Mei, H He, S Zhang, H Qiu, Z Lan arXiv preprint arXiv:2306.15334, 2023 | 15 | 2023 |
A benchmark for understanding dialogue safety in mental health support H Qiu, T Zhao, A Li, S Zhang, H He, Z Lan CCF International Conference on Natural Language Processing and Chinese …, 2023 | 9 | 2023 |
Towards automated real-time evaluation in text-based counseling A Li, J Ma, L Ma, P Fang, H He, Z Lan arXiv preprint arXiv:2203.03442, 2022 | 5 | 2022 |
Psychat: A client-centric dialogue system for mental health support H Qiu, A Li, L Ma, Z Lan 2024 27th International Conference on Computer Supported Cooperative Work in …, 2024 | 3 | 2024 |
Automatic Evaluation for Mental Health Counseling using LLMs A Li, Y Lu, N Song, S Zhang, L Ma, Z Lan arXiv preprint arXiv:2402.11958, 2024 | 2 | 2024 |
Predicting the Big Five Personality Traits in Chinese Counselling Dialogues Using Large Language Models Y Yan, L Ma, A Li, J Ma, Z Lan arXiv preprint arXiv:2406.17287, 2024 | | 2024 |
PsyBench: a balanced and in-depth Psychological Chinese Evaluation Benchmark for Foundation Models J Zhang, H He, N Song, S He, H Qiu, A Li, L Ma, Z Lan arXiv preprint arXiv:2311.09861, 2023 | | 2023 |