Manuel Tonneau

Ph.D. student in Social Data Science, University of Oxford

Consultant, Development Impact (DIME), The World Bank

Member of the Open Networks and Big Data Lab, New York University

manuel.tonneau@oii.ox.ac.uk

Social:

Resources:

Papers:

Bio

I am a third year Ph.D. student in Social Data Science and a Shirley Scholar at the Oxford Internet Institute of the University of Oxford where I am supervised by Ralph Schroeder and Scott Hale. I also consult for the World Bank under the supervision of Samuel Fraiberger and am affiliated with the Open Networks and Big Data Lab at New York University.

My research is at the intersection of natural language processing (NLP) and AI ethics, aiming to create inclusive NLP systems that work across cultures without perpetuating societal harms. For my PhD, I focus on AI-driven moderation of hate speech on social media, investigating how these systems may unequally protect users from online hate across cultures. Additionally, I work on identifying and reducing harms in text generated by large language models, with a particular emphasis on Global Majority contexts.

Prior to the Ph.D., I worked as a research assistant at the World Bank and the Centre Marc Bloch on various computational social science projects. I hold an Engineering degree in Statistics and Economics (eq. to MSc) from ENSAE Paris as well as an MSc in Economics from Humboldt-Universität zu Berlin.

My research is supported by a Global Merit Award from the Shirley Scholars Fund.

News

2024-11-01 🤗 Our hate speech supersets, combining all hate speech corpora in 8 languages, have reached 2K downloads on Hugging Face!
2024-10-01 🇩🇪 Just started a fellowship at the Weizenbaum Institute in Berlin, Germany, following an invitation from Elizaveta Kuznetsova
Summer 2024 💬 Presented our work on cultural bias in hate speech datasets (link) at WOAH and C3NLP
2024-05-15 📜 Paper accepted at ACL 2024: "NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data" (preprint here)
2024-04-16 📜 Paper accepted at the Workshop on Online Abuse and Harms (WOAH), co-located at NAACL 2024: "From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets" (link)
2024-03-15 📜 Paper accepted at ICWSM 2024: "280 Characters to Employment: Using Twitter to Quantify Job Vacancies"
2023-10-20 💬 Will present my work on content moderation at the upcoming workshop on “Alternative Platforms/Platform Alternatives: Comparisons and Transnational Flows” at the Weizenbaum Institute in Berlin, Germany
2023-09-15 📜 New pre-print on disparities in LLM bias between India and the West. Feedback welcome! (link)
2023-07-20 💬 Presented ongoing work on hate speech on Nigerian Twitter at IC2S2 (photo)
2023-03-15 📜 Paper accepted at ICWSM 2023: "Large-Scale Demographic Inference of Social Media Users in a Low-Resource Scenario" (link)