Chatgpt or grammarly? evaluating chatgpt on grammatical error correction benchmark H Wu, W Wang, Y Wan, W Jiao, M Lyu arXiv preprint arXiv:2303.13648, 2023 | 59 | 2023 |
Biasasker: Measuring the bias in conversational ai system Y Wan, W Wang, P He, J Gu, H Bai, MR Lyu Proceedings of the 31st ACM Joint European Software Engineering Conference …, 2023 | 23 | 2023 |
ChatGPT or grammarly H Wu, W Wang, Y Wan, W Jiao, M Lyu Evaluating ChatGPT on grammatical error correction benchmark. arXiv 2303, 2023 | 11 | 2023 |
A & B== B & A: Triggering logical reasoning failures in large language models Y Wan, W Wang, Y Yang, Y Yuan, J Huang, P He, W Jiao, MR Lyu arXiv preprint arXiv:2401.00757, 2024 | 2 | 2024 |
New Job, New Gender? Measuring the Social Bias in Image Generation Models W Wang, H Bai, J Huang, Y Wan, Y Yuan, H Qiu, N Peng, MR Lyu arXiv preprint arXiv:2401.00763, 2024 | | 2024 |