Hateful symbols or hateful people? predictive features for hate speech detection on twitter Z Waseem, D Hovy Proceedings of the NAACL student research workshop, 88-93, 2016 | 1916 | 2016 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1209 | 2022 |
Are you a racist or am I seeing things? Annotator influence on hate speech detection on Twitter Z Waseem Proceedings of the 1st Workshop on Natural Language Processing and …, 2016 | 678 | 2016 |
Understanding Abuse: A Typology of Abusive Language Detection Subtasks Z Waseem, T Davidson, D Warmsley, I Weber Proceedings of the First Workshop on Abusive Language Online, 78-85, 2017 | 494 | 2017 |
Dynabench: Rethinking Benchmarking in NLP D Kiela, M Bartolo, Y Nie, D Kaushik, A Geiger, Z Wu, B Vidgen, G Prasad, ... arXiv preprint arXiv:2104.14337, 2021 | 300 | 2021 |
HateCheck: Functional Tests for Hate Speech Detection Models P Röttger, B Vidgen, D Nguyen, Z Waseem, H Margetts, J Pierrehumbert arXiv preprint arXiv:2012.15606, 2020 | 193 | 2020 |
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection B Vidgen, T Thrush, Z Waseem, D Kiela arXiv preprint arXiv:2012.15761, 2020 | 157 | 2020 |
A Survey of Race, Racism, and Anti-Racism in NLP A Field, SL Blodgett, Z Waseem, Y Tsvetkov arXiv preprint arXiv:2106.11410, 2021 | 106 | 2021 |
Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Entection Z Talat, J Thorne, J Bingel Online Harassment, C1-C1, 2018 | 105* | 2018 |
Detecting East Asian Prejudice on Social Media B Vidgen, A Botelho, D Broniatowski, E Guest, M Hall, H Margetts, ... arXiv preprint arXiv:2005.03909, 2020 | 98 | 2020 |
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022 | 68 | 2022 |
On the Machine Learning of Ethical Judgments from Natural Language Z Talat, H Blix, J Valvoda, MI Ganesh, R Cotterell, A Williams Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 64* | 2022 |
Data governance in the age of large-scale data-driven language technology Y Jernite, H Nguyen, S Biderman, A Rogers, M Masoud, V Danchev, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022 | 55 | 2022 |
Evaluating the Social Impact of Generative AI Systems in Systems and Society I Solaiman, Z Talat, W Agnew, L Ahmad, D Baker, SL Blodgett, ... arXiv preprint arXiv:2306.05949, 2023 | 46 | 2023 |
Disembodied machine learning: On the illusion of objectivity in NLP Z Waseem, S Lulz, J Bingel, I Augenstein arXiv preprint arXiv:2101.11974, 2021 | 37 | 2021 |
Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models P Röttger, H Seelawi, D Nozza, Z Talat, B Vidgen arXiv preprint arXiv:2206.09917, 2022 | 32 | 2022 |
Detecting ‘dirt’and ‘toxicity’: Rethinking content moderation as pollution behaviour N Thylstrup, Z Talat Available at SSRN 3709719, 2020 | 22 | 2020 |
Mirages: On Anthropomorphism in Dialogue Systems G Abercrombie, AC Curry, T Dinkar, Z Talat arXiv preprint arXiv:2305.09800, 2023 | 21 | 2023 |
Using TF-IDF n-gram and Word Embedding Cluster Ensembles for Author Profiling. A Poulston, Z Waseem, M Stevenson CLEF (Working notes), 2017 | 21* | 2017 |
Findings of the WOAH 5 Shared Task on Fine Grained Hateful Memes Detection L Mathias, S Nie, AM Davani, D Kiela, V Prabhakaran, B Vidgen, ... Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021), 201-206, 2021 | 19 | 2021 |